Pop Quiz, AI: How Do You Test A Thinking Machine? Generative AI 101 podcast

Artwork

Tech Emily Laird

Nội dung được cung cấp bởi Emily Laird. Tất cả nội dung podcast bao gồm các tập, đồ họa và mô tả podcast đều được Emily Laird hoặc đối tác nền tảng podcast của họ tải lên và cung cấp trực tiếp. Nếu bạn cho rằng ai đó đang sử dụng tác phẩm có bản quyền của bạn mà không có sự cho phép của bạn, bạn có thể làm theo quy trình được nêu ở đây https://vi.player.fm/legal.

Generative AI 101 « »
Pop Quiz, AI: How Do You Test a Thinking Machine?

10M ago 6:08

Chia sẻ

MP3•Trang chủ episode

Nội dung được cung cấp bởi Emily Laird. Tất cả nội dung podcast bao gồm các tập, đồ họa và mô tả podcast đều được Emily Laird hoặc đối tác nền tảng podcast của họ tải lên và cung cấp trực tiếp. Nếu bạn cho rằng ai đó đang sử dụng tác phẩm có bản quyền của bạn mà không có sự cho phép của bạn, bạn có thể làm theo quy trình được nêu ở đây https://vi.player.fm/legal.

AI keeps bragging about its "reasoning skills," but is it actually getting smarter, or just better at faking it? In this episode, we put AI’s so-called intelligence to the test with hardcore benchmarks—BIG-Bench HARD, TruthfulQA, and more—to see if these models can truly problem-solve or if they're just memorizing answers like a sneaky high schooler. Spoiler: Not all AIs are built the same, and some are way better at bluffing than thinking. Tune in to find out who’s the real deal and who’s just a smooth talker.
Connect with Us: If you enjoyed this episode or have questions, reach out to Emily Laird on LinkedIn. Stay tuned for more insights into the evolving world of generative AI. And remember, you now know more about reasoning models than you did before!

Connect with Emily Laird on LinkedIn

… continue reading

230 tập

#Tech #Emily Laird

Artwork

Pop Quiz, AI: How Do You Test a Thinking Machine?

Generative AI 101

29 subscribers

published 10M ago

Chia sẻ

MP3•Trang chủ episode

Nội dung được cung cấp bởi Emily Laird. Tất cả nội dung podcast bao gồm các tập, đồ họa và mô tả podcast đều được Emily Laird hoặc đối tác nền tảng podcast của họ tải lên và cung cấp trực tiếp. Nếu bạn cho rằng ai đó đang sử dụng tác phẩm có bản quyền của bạn mà không có sự cho phép của bạn, bạn có thể làm theo quy trình được nêu ở đây https://vi.player.fm/legal.

AI keeps bragging about its "reasoning skills," but is it actually getting smarter, or just better at faking it? In this episode, we put AI’s so-called intelligence to the test with hardcore benchmarks—BIG-Bench HARD, TruthfulQA, and more—to see if these models can truly problem-solve or if they're just memorizing answers like a sneaky high schooler. Spoiler: Not all AIs are built the same, and some are way better at bluffing than thinking. Tune in to find out who’s the real deal and who’s just a smooth talker.
Connect with Us: If you enjoyed this episode or have questions, reach out to Emily Laird on LinkedIn. Stay tuned for more insights into the evolving world of generative AI. And remember, you now know more about reasoning models than you did before!

Connect with Emily Laird on LinkedIn

… continue reading

230 tập

#Tech #Emily Laird

Tất cả các tập

×

Chào mừng bạn đến với Player FM!

Player FM đang quét trang web để tìm các podcast chất lượng cao cho bạn thưởng thức ngay bây giờ. Đây là ứng dụng podcast tốt nhất và hoạt động trên Android, iPhone và web. Đăng ký để đồng bộ các theo dõi trên tất cả thiết bị.

Nghe hơn 500 chủ đề

Hướng dẫn sử dụng nhanh

Podcast hàng đầu

Tạp chí thể thao

Tạp chí kinh tế

KBS WORLD Radio Tiếng Hàn qua phim ảnh

Vietnamese News - NHK WORLD RADIO JAPAN

Tạp chí tiêu điểm

The Present Writer

Podcasts – Life Abroad Podcast

Nghe chương trình này trong khi bạn khám phá