Artwork

Nội dung được cung cấp bởi Turpentine, Erik Torenberg, and Nathan Labenz. Tất cả nội dung podcast bao gồm các tập, đồ họa và mô tả podcast đều được Turpentine, Erik Torenberg, and Nathan Labenz hoặc đối tác nền tảng podcast của họ tải lên và cung cấp trực tiếp. Nếu bạn cho rằng ai đó đang sử dụng tác phẩm có bản quyền của bạn mà không có sự cho phép của bạn, bạn có thể làm theo quy trình được nêu ở đây https://vi.player.fm/legal.
Player FM - Ứng dụng Podcast
Chuyển sang chế độ ngoại tuyến với ứng dụng Player FM !

Anthropic's Responsible Scaling Policy, with Nick Joseph, from the 80,000 Hours Podcast

2:42:17
 
Chia sẻ
 

Manage episode 441745286 series 3452589
Nội dung được cung cấp bởi Turpentine, Erik Torenberg, and Nathan Labenz. Tất cả nội dung podcast bao gồm các tập, đồ họa và mô tả podcast đều được Turpentine, Erik Torenberg, and Nathan Labenz hoặc đối tác nền tảng podcast của họ tải lên và cung cấp trực tiếp. Nếu bạn cho rằng ai đó đang sử dụng tác phẩm có bản quyền của bạn mà không có sự cho phép của bạn, bạn có thể làm theo quy trình được nêu ở đây https://vi.player.fm/legal.

In this crosspost from the 80,000 Hours podcast, host Rob Wiblin interviews Nick Joseph, Head of Training at Anthropic, about the company's responsible scaling policy for AI development. The episode delves into Anthropic's approach to AI safety, the growing trend of voluntary commitments from top AI labs, and the need for public scrutiny of frontier model development. The conversation also covers AI safety career advice, with a reminder that 80,000 Hours offers free career advising sessions for listeners. Join us for an insightful discussion on the future of AI and its societal implications.

Apply to join over 400 Founders and Execs in the Turpentine Network: https://www.turpentinenetwork.co/

SPONSORS:

WorkOS: Building an enterprise-ready SaaS app? WorkOS has got you covered with easy-to-integrate APIs for SAML, SCIM, and more. Join top startups like Vercel, Perplexity, Jasper & Webflow in powering your app with WorkOS. Enjoy a free tier for up to 1M users! Start now at https://bit.ly/WorkOS-Turpentine-Network

Weights & Biases Weave: Weights & Biases Weave is a lightweight AI developer toolkit designed to simplify your LLM app development. With Weave, you can trace and debug input, metadata and output with just 2 lines of code. Make real progress on your LLM development and visit the following link to get started with Weave today: https://wandb.me/cr

80,000 Hours: 80,000 Hours offers free one-on-one career advising for Cognitive Revolution listeners aiming to tackle global challenges, especially in AI. They connect high-potential individuals with experts, opportunities, and personalized career plans to maximize positive impact. Apply for a free call at https://80000hours.org/cognitiverevolution to accelerate your career and contribute to solving pressing AI-related issues.

Omneky: Omneky is an omnichannel creative generation platform that lets you launch hundreds of thousands of ad iterations that actually work customized across all platforms, with a click of a button. Omneky combines generative AI and real-time advertising data. Mention "Cog Rev" for 10% off https://www.omneky.com/

RECOMMENDED PODCAST:

This Won't Last - Eavesdrop on Keith Rabois, Kevin Ryan, Logan Bartlett, and Zach Weinberg's monthly backchannel ft their hottest takes on the future of tech, business, and venture capital.

Spotify: https://open.spotify.com/show/2HwSNeVLL1MXy0RjFPyOSz

CHAPTERS:

(00:00:00) About the Show

(00:00:22) Sponsors: WorkOS

(00:01:22) About the Episode

(00:04:31) Intro and Nick's background

(00:08:37) Model training and scaling laws

(00:13:10) Nick's role at Anthropic

(00:16:49) Responsible Scaling Policies overview (Part 1)

(00:18:00) Sponsors: Weights & Biases Weave | 80,000 Hours

(00:20:39) Responsible Scaling Policies overview (Part 2)

(00:25:24) AI Safety Levels framework

(00:30:33) Benefits of RSPs (Part 1)

(00:33:15) Sponsors: Omneky

(00:33:38) Benefits of RSPs (Part 2)

(00:36:32) Concerns about RSPs

(00:47:33) Sandbagging and evaluation challenges

(00:54:46) Critiques of RSPs

(01:03:11) Trust and accountability

(01:12:03) Conservative vs. aggressive approaches

(01:17:43) Capabilities vs. safety research

(01:23:47) Working at Anthropic

(01:35:14) Nick's career journey

(01:45:12) Hiring at Anthropic

(01:52:06) Concerns about AI capabilities work

(02:03:38) Anthropic office locations

(02:08:46) Pressure and stakes at Anthropic

(02:18:09) Overrated and underrated AI applications

(02:35:57) Closing remarks

(02:38:33) Sponsors: Outro

  continue reading

205 tập

Artwork
iconChia sẻ
 
Manage episode 441745286 series 3452589
Nội dung được cung cấp bởi Turpentine, Erik Torenberg, and Nathan Labenz. Tất cả nội dung podcast bao gồm các tập, đồ họa và mô tả podcast đều được Turpentine, Erik Torenberg, and Nathan Labenz hoặc đối tác nền tảng podcast của họ tải lên và cung cấp trực tiếp. Nếu bạn cho rằng ai đó đang sử dụng tác phẩm có bản quyền của bạn mà không có sự cho phép của bạn, bạn có thể làm theo quy trình được nêu ở đây https://vi.player.fm/legal.

In this crosspost from the 80,000 Hours podcast, host Rob Wiblin interviews Nick Joseph, Head of Training at Anthropic, about the company's responsible scaling policy for AI development. The episode delves into Anthropic's approach to AI safety, the growing trend of voluntary commitments from top AI labs, and the need for public scrutiny of frontier model development. The conversation also covers AI safety career advice, with a reminder that 80,000 Hours offers free career advising sessions for listeners. Join us for an insightful discussion on the future of AI and its societal implications.

Apply to join over 400 Founders and Execs in the Turpentine Network: https://www.turpentinenetwork.co/

SPONSORS:

WorkOS: Building an enterprise-ready SaaS app? WorkOS has got you covered with easy-to-integrate APIs for SAML, SCIM, and more. Join top startups like Vercel, Perplexity, Jasper & Webflow in powering your app with WorkOS. Enjoy a free tier for up to 1M users! Start now at https://bit.ly/WorkOS-Turpentine-Network

Weights & Biases Weave: Weights & Biases Weave is a lightweight AI developer toolkit designed to simplify your LLM app development. With Weave, you can trace and debug input, metadata and output with just 2 lines of code. Make real progress on your LLM development and visit the following link to get started with Weave today: https://wandb.me/cr

80,000 Hours: 80,000 Hours offers free one-on-one career advising for Cognitive Revolution listeners aiming to tackle global challenges, especially in AI. They connect high-potential individuals with experts, opportunities, and personalized career plans to maximize positive impact. Apply for a free call at https://80000hours.org/cognitiverevolution to accelerate your career and contribute to solving pressing AI-related issues.

Omneky: Omneky is an omnichannel creative generation platform that lets you launch hundreds of thousands of ad iterations that actually work customized across all platforms, with a click of a button. Omneky combines generative AI and real-time advertising data. Mention "Cog Rev" for 10% off https://www.omneky.com/

RECOMMENDED PODCAST:

This Won't Last - Eavesdrop on Keith Rabois, Kevin Ryan, Logan Bartlett, and Zach Weinberg's monthly backchannel ft their hottest takes on the future of tech, business, and venture capital.

Spotify: https://open.spotify.com/show/2HwSNeVLL1MXy0RjFPyOSz

CHAPTERS:

(00:00:00) About the Show

(00:00:22) Sponsors: WorkOS

(00:01:22) About the Episode

(00:04:31) Intro and Nick's background

(00:08:37) Model training and scaling laws

(00:13:10) Nick's role at Anthropic

(00:16:49) Responsible Scaling Policies overview (Part 1)

(00:18:00) Sponsors: Weights & Biases Weave | 80,000 Hours

(00:20:39) Responsible Scaling Policies overview (Part 2)

(00:25:24) AI Safety Levels framework

(00:30:33) Benefits of RSPs (Part 1)

(00:33:15) Sponsors: Omneky

(00:33:38) Benefits of RSPs (Part 2)

(00:36:32) Concerns about RSPs

(00:47:33) Sandbagging and evaluation challenges

(00:54:46) Critiques of RSPs

(01:03:11) Trust and accountability

(01:12:03) Conservative vs. aggressive approaches

(01:17:43) Capabilities vs. safety research

(01:23:47) Working at Anthropic

(01:35:14) Nick's career journey

(01:45:12) Hiring at Anthropic

(01:52:06) Concerns about AI capabilities work

(02:03:38) Anthropic office locations

(02:08:46) Pressure and stakes at Anthropic

(02:18:09) Overrated and underrated AI applications

(02:35:57) Closing remarks

(02:38:33) Sponsors: Outro

  continue reading

205 tập

सभी एपिसोड

×
 
Loading …

Chào mừng bạn đến với Player FM!

Player FM đang quét trang web để tìm các podcast chất lượng cao cho bạn thưởng thức ngay bây giờ. Đây là ứng dụng podcast tốt nhất và hoạt động trên Android, iPhone và web. Đăng ký để đồng bộ các theo dõi trên tất cả thiết bị.

 

Hướng dẫn sử dụng nhanh