6 - Debate And Imitative Generalization With Beth Barnes AXRP - The AI X-risk Research podcast

Artwork

Science Tech Daniel Filan Xrisk

Nội dung được cung cấp bởi Daniel Filan. Tất cả nội dung podcast bao gồm các tập, đồ họa và mô tả podcast đều được Daniel Filan hoặc đối tác nền tảng podcast của họ tải lên và cung cấp trực tiếp. Nếu bạn cho rằng ai đó đang sử dụng tác phẩm có bản quyền của bạn mà không có sự cho phép của bạn, bạn có thể làm theo quy trình được nêu ở đây https://vi.player.fm/legal.

AXRP - the AI X-risk Research Podcast « »
6 - Debate and Imitative Generalization with Beth Barnes

3y ago 1:58:48

Chia sẻ

MP3•Trang chủ episode

Nội dung được cung cấp bởi Daniel Filan. Tất cả nội dung podcast bao gồm các tập, đồ họa và mô tả podcast đều được Daniel Filan hoặc đối tác nền tảng podcast của họ tải lên và cung cấp trực tiếp. Nếu bạn cho rằng ai đó đang sử dụng tác phẩm có bản quyền của bạn mà không có sự cho phép của bạn, bạn có thể làm theo quy trình được nêu ở đây https://vi.player.fm/legal.

One proposal to train AIs that can be useful is to have ML models debate each other about the answer to a human-provided question, where the human judges which side has won. In this episode, I talk with Beth Barnes about her thoughts on the pros and cons of this strategy, what she learned from seeing how humans behaved in debate protocols, and how a technique called imitative generalization can augment debate. Those who are already quite familiar with the basic proposal might want to skip past the explanation of debate to 13:00, "what problems does it solve and does it not solve".

Link to Beth's posts on the Alignment Forum: alignmentforum.org/users/beth-barnes

Link to the transcript: axrp.net/episode/2021/04/08/episode-6-debate-beth-barnes.html

… continue reading

32 tập

#Science #Tech #Daniel Filan #Xrisk

Artwork

6 - Debate and Imitative Generalization with Beth Barnes

AXRP - the AI X-risk Research Podcast

31 subscribers

published 3y ago

Chia sẻ

MP3•Trang chủ episode

Nội dung được cung cấp bởi Daniel Filan. Tất cả nội dung podcast bao gồm các tập, đồ họa và mô tả podcast đều được Daniel Filan hoặc đối tác nền tảng podcast của họ tải lên và cung cấp trực tiếp. Nếu bạn cho rằng ai đó đang sử dụng tác phẩm có bản quyền của bạn mà không có sự cho phép của bạn, bạn có thể làm theo quy trình được nêu ở đây https://vi.player.fm/legal.

One proposal to train AIs that can be useful is to have ML models debate each other about the answer to a human-provided question, where the human judges which side has won. In this episode, I talk with Beth Barnes about her thoughts on the pros and cons of this strategy, what she learned from seeing how humans behaved in debate protocols, and how a technique called imitative generalization can augment debate. Those who are already quite familiar with the basic proposal might want to skip past the explanation of debate to 13:00, "what problems does it solve and does it not solve".

Link to Beth's posts on the Alignment Forum: alignmentforum.org/users/beth-barnes

Link to the transcript: axrp.net/episode/2021/04/08/episode-6-debate-beth-barnes.html

… continue reading

32 tập

#Science #Tech #Daniel Filan #Xrisk

Tất cả các tập

×

Chào mừng bạn đến với Player FM!

Player FM đang quét trang web để tìm các podcast chất lượng cao cho bạn thưởng thức ngay bây giờ. Đây là ứng dụng podcast tốt nhất và hoạt động trên Android, iPhone và web. Đăng ký để đồng bộ các theo dõi trên tất cả thiết bị.

Nghe hơn 500 chủ đề