77 subscribers
Chuyển sang chế độ ngoại tuyến với ứng dụng Player FM !
Podcast đáng để nghe
TÀI TRỢ BỞI


1 Unlocking Your Hidden Genius: How to Harness Your Innate Talents with Betsy Wills & Alex Ellison | Ep. 289 32:08
Automating Software Engineering: Genie Tops SWE-Bench, w/ Alistair Pullen, from Latent.Space podcast
Manage episode 443195566 series 3452589
In this special crossover episode of The Cognitive Revolution, Nathan shares an insightful conversation from the Latent.Space podcast. Swyx and Alessio interview Alistair Pullen of Cosine, creators of Genie, showcasing the cutting edge of AI automation in software engineering. Learn how Cosine achieves state-of-the-art results on the SWE-bench benchmark by implementing advanced AI techniques. This episode complements Nathan's recent discussion on AI Automation, demonstrating how far these practices can be pushed in real-world applications. Don't miss this opportunity to explore the future of AI-driven software development and its implications for businesses across industries.
Check out the Latent.Space podcast here: https://www.latent.space
Apply to join over 400 Founders and Execs in the Turpentine Network: https://www.turpentinenetwork.co/
SPONSORS:
WorkOS: Building an enterprise-ready SaaS app? WorkOS has got you covered with easy-to-integrate APIs for SAML, SCIM, and more. Join top startups like Vercel, Perplexity, Jasper & Webflow in powering your app with WorkOS. Enjoy a free tier for up to 1M users! Start now at https://bit.ly/WorkOS-Turpentine-Network
Weights & Biases Weave: Weights & Biases Weave is a lightweight AI developer toolkit designed to simplify your LLM app development. With Weave, you can trace and debug input, metadata and output with just 2 lines of code. Make real progress on your LLM development and visit the following link to get started with Weave today: https://wandb.me/cr
80,000 Hours: 80,000 Hours offers free one-on-one career advising for Cognitive Revolution listeners aiming to tackle global challenges, especially in AI. They connect high-potential individuals with experts, opportunities, and personalized career plans to maximize positive impact. Apply for a free call at https://80000hours.org/cognitiverevolution to accelerate your career and contribute to solving pressing AI-related issues.
Omneky: Omneky is an omnichannel creative generation platform that lets you launch hundreds of thousands of ad iterations that actually work customized across all platforms, with a click of a button. Omneky combines generative AI and real-time advertising data. Mention "Cog Rev" for 10% off https://www.omneky.com/
CHAPTERS:
(00:00:00) About the Show
(00:00:22) Sponsors: WorkOS
(00:01:22) About the Episode
(00:04:29) Alistair and Cosine intro
(00:13:50) Building the Code Retrieval Tool
(00:17:36) Sponsors: Weights & Biases Weave | 80,000 Hours
(00:20:15) Developing Genie and Fine-tuning Process
(00:27:41) Working with Customer Data
(00:30:53) Code Retrieval Challenges and Solutions
(00:36:39) Sponsors: Omneky
(00:37:02) Planning and Reasoning in AI Models
(00:45:55) Language Support and Generalization
(00:49:46) Fine-tuning Experience with OpenAI
(00:52:56) Synthetic Data and Self-improvement Loop
(00:55:57) Benchmarking and SWE-bench Results
(01:01:47) Future Plans for Genie
(01:03:02) Industry Trends and Cursor's Success
(01:05:23) Calls to Action and Ideal Customers
(01:08:43) Outro
231 tập
Automating Software Engineering: Genie Tops SWE-Bench, w/ Alistair Pullen, from Latent.Space podcast
"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis
Manage episode 443195566 series 3452589
In this special crossover episode of The Cognitive Revolution, Nathan shares an insightful conversation from the Latent.Space podcast. Swyx and Alessio interview Alistair Pullen of Cosine, creators of Genie, showcasing the cutting edge of AI automation in software engineering. Learn how Cosine achieves state-of-the-art results on the SWE-bench benchmark by implementing advanced AI techniques. This episode complements Nathan's recent discussion on AI Automation, demonstrating how far these practices can be pushed in real-world applications. Don't miss this opportunity to explore the future of AI-driven software development and its implications for businesses across industries.
Check out the Latent.Space podcast here: https://www.latent.space
Apply to join over 400 Founders and Execs in the Turpentine Network: https://www.turpentinenetwork.co/
SPONSORS:
WorkOS: Building an enterprise-ready SaaS app? WorkOS has got you covered with easy-to-integrate APIs for SAML, SCIM, and more. Join top startups like Vercel, Perplexity, Jasper & Webflow in powering your app with WorkOS. Enjoy a free tier for up to 1M users! Start now at https://bit.ly/WorkOS-Turpentine-Network
Weights & Biases Weave: Weights & Biases Weave is a lightweight AI developer toolkit designed to simplify your LLM app development. With Weave, you can trace and debug input, metadata and output with just 2 lines of code. Make real progress on your LLM development and visit the following link to get started with Weave today: https://wandb.me/cr
80,000 Hours: 80,000 Hours offers free one-on-one career advising for Cognitive Revolution listeners aiming to tackle global challenges, especially in AI. They connect high-potential individuals with experts, opportunities, and personalized career plans to maximize positive impact. Apply for a free call at https://80000hours.org/cognitiverevolution to accelerate your career and contribute to solving pressing AI-related issues.
Omneky: Omneky is an omnichannel creative generation platform that lets you launch hundreds of thousands of ad iterations that actually work customized across all platforms, with a click of a button. Omneky combines generative AI and real-time advertising data. Mention "Cog Rev" for 10% off https://www.omneky.com/
CHAPTERS:
(00:00:00) About the Show
(00:00:22) Sponsors: WorkOS
(00:01:22) About the Episode
(00:04:29) Alistair and Cosine intro
(00:13:50) Building the Code Retrieval Tool
(00:17:36) Sponsors: Weights & Biases Weave | 80,000 Hours
(00:20:15) Developing Genie and Fine-tuning Process
(00:27:41) Working with Customer Data
(00:30:53) Code Retrieval Challenges and Solutions
(00:36:39) Sponsors: Omneky
(00:37:02) Planning and Reasoning in AI Models
(00:45:55) Language Support and Generalization
(00:49:46) Fine-tuning Experience with OpenAI
(00:52:56) Synthetic Data and Self-improvement Loop
(00:55:57) Benchmarking and SWE-bench Results
(01:01:47) Future Plans for Genie
(01:03:02) Industry Trends and Cursor's Success
(01:05:23) Calls to Action and Ideal Customers
(01:08:43) Outro
231 tập
Tất cả các tập
×
1 Shortwave Rides the Tidal Wave: Inbox Agents, Hyper-Growth & Hiring AI Managers, with CEO Andrew Lee 1:51:39

1 Code Context is King: Augment’s AI Assistant for Professional Software Engineers, with Guy Gur-Ari 1:25:44

1 Unlocking Cells' Secrets: Diffusion, Deconvolution, & Discovery with Siyu He, author of Squidiff & CORAL 1:46:17

1 a16z on AI Voices: Call Centers, Coaches, and Companions with Olivia Moore & Anish Acharya 1:07:35

1 Agency over AI? Allan Dafoe on Technological Determinism & DeepMind's Safety Plans, from 80000 Hours 3:02:28

1 China's Tech Tightrope: Power, Regulation, and the AI Race with Angela Zhang 1:31:56

1 Historic AI Developments & the Emerging Shape of Superintelligence, from the Consistently Candid Podcast 1:57:36

1 Frontier Models for Frontier Science with Professor Derya Unutmaz, Immunologist & ChatGPT Pro Grantee 1:32:34

1 US-China Relations: History, Culture, and AI Competition, with Noah Smith, from Econ 102 1:09:49

1 The Adversarial Mind: Defeating AI Defenses with Nicholas Carlini of Google DeepMind 2:34:38

1 New Jersey’s AI Moonshot: Governor Phil Murphy on Partnerships, Progress, and Preparedness 55:54

1 Inference Scaling, Alignment Faking, Deal Making? Frontier Research with Ryan Greenblatt of Redwood Research 3:21:07

1 An Application-Free Future? Speaking Directly to Data with illumex CEO Inna Tokarev Sela 1:31:26

1 Claude Cooperates! Exploring Cultural Evolution in LLM Societies, with Aron Vallinder & Edward Hughes 1:32:52

1 Software Supernova: Lovable's "Superhuman Full Stack Engineer" to Transform Idea to App in Seconds 1:34:53
Chào mừng bạn đến với Player FM!
Player FM đang quét trang web để tìm các podcast chất lượng cao cho bạn thưởng thức ngay bây giờ. Đây là ứng dụng podcast tốt nhất và hoạt động trên Android, iPhone và web. Đăng ký để đồng bộ các theo dõi trên tất cả thiết bị.