Artwork

Player FM - Internet Radio Done Right

1,111 subscribers

Checked 2d ago
Đã thêm cách đây tám năm
Nội dung được cung cấp bởi Jon Krohn. Tất cả nội dung podcast bao gồm các tập, đồ họa và mô tả podcast đều được Jon Krohn hoặc đối tác nền tảng podcast của họ tải lên và cung cấp trực tiếp. Nếu bạn cho rằng ai đó đang sử dụng tác phẩm có bản quyền của bạn mà không có sự cho phép của bạn, bạn có thể làm theo quy trình được nêu ở đây https://vi.player.fm/legal.
Player FM - Ứng dụng Podcast
Chuyển sang chế độ ngoại tuyến với ứng dụng Player FM !
icon Daily Deals

825: Data Contracts: The Key to Data Quality, with Chad Sanderson

1:02:22
 
Chia sẻ
 

Manage episode 444156560 series 1278026
Nội dung được cung cấp bởi Jon Krohn. Tất cả nội dung podcast bao gồm các tập, đồ họa và mô tả podcast đều được Jon Krohn hoặc đối tác nền tảng podcast của họ tải lên và cung cấp trực tiếp. Nếu bạn cho rằng ai đó đang sử dụng tác phẩm có bản quyền của bạn mà không có sự cho phép của bạn, bạn có thể làm theo quy trình được nêu ở đây https://vi.player.fm/legal.

Data contracts are redefining data quality and governance, and Chad Sanderson, CEO of Gable.ai, joins host Jon Krohn to explain how they can transform your data strategy. He breaks down what data contracts are, how they shift data quality checks closer to production, and why they’re essential for reducing data debt. Chad also highlights how better alignment between data producers and consumers can elevate data reliability and tackle change-management challenges in modern organizations.

This episode is brought to you by epic LinkedIn Learning instructor Keith McCormick, and by Gurobi, the Decision Intelligence Leader. Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information.

In this episode you will learn:

  • What data contracts are and how they define expectations for data quality [03:16]
  • What data contracts look like [09:09]
  • The common misconceptions about data quality when implementing AI [12:55]
  • Chad’s Chief Operator role at Data Quality Camp [19:46]
  • How “shifting left” improves data reliability by addressing issues early [24:17]
  • Why data professionals still struggle with data quality [30:31]
  • How data debt forms and why it leads to complex, inefficient architectures [35:53]
  • How will the role of human oversight evolve in ensuring data quality? [47:12]
  • How can data teams leverage storytelling? [52:33]

Additional materials: www.superdatascience.com/825

  continue reading

1186 tập

Artwork
iconChia sẻ
 
Manage episode 444156560 series 1278026
Nội dung được cung cấp bởi Jon Krohn. Tất cả nội dung podcast bao gồm các tập, đồ họa và mô tả podcast đều được Jon Krohn hoặc đối tác nền tảng podcast của họ tải lên và cung cấp trực tiếp. Nếu bạn cho rằng ai đó đang sử dụng tác phẩm có bản quyền của bạn mà không có sự cho phép của bạn, bạn có thể làm theo quy trình được nêu ở đây https://vi.player.fm/legal.

Data contracts are redefining data quality and governance, and Chad Sanderson, CEO of Gable.ai, joins host Jon Krohn to explain how they can transform your data strategy. He breaks down what data contracts are, how they shift data quality checks closer to production, and why they’re essential for reducing data debt. Chad also highlights how better alignment between data producers and consumers can elevate data reliability and tackle change-management challenges in modern organizations.

This episode is brought to you by epic LinkedIn Learning instructor Keith McCormick, and by Gurobi, the Decision Intelligence Leader. Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information.

In this episode you will learn:

  • What data contracts are and how they define expectations for data quality [03:16]
  • What data contracts look like [09:09]
  • The common misconceptions about data quality when implementing AI [12:55]
  • Chad’s Chief Operator role at Data Quality Camp [19:46]
  • How “shifting left” improves data reliability by addressing issues early [24:17]
  • Why data professionals still struggle with data quality [30:31]
  • How data debt forms and why it leads to complex, inefficient architectures [35:53]
  • How will the role of human oversight evolve in ensuring data quality? [47:12]
  • How can data teams leverage storytelling? [52:33]

Additional materials: www.superdatascience.com/825

  continue reading

1186 tập

Tất cả các tập

×
 
NPUs, AIPC, and Dell’s growing suite of AI products: Shirish Gupta speaks to Jon Krohn about neural processing units and what makes them a go-to tool for AI inference workloads, reasons to move your workloads from the cloud and to your local devices, what the mnemonic AIPC stands for and why it will soon be on everyone’s lips, and he offers a special intro to Dell’s new Pro-AI Studio Toolkit. Hear about several real-world AIPC applications run by Dell’s clients, from detecting manufacturing defects to improving efficiencies for first responders, massively supporting actual life-or-death situations. Additional materials: www.superdatascience.com/877 This episode is brought to you by ODSC , the Open Data Science Conference . Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information. In this episode you will learn: (03:28) What neural processing units (NPUs) are (23:53) About Dell Pro AI Studio (35:03) Use cases for Dell Pro AI Studio (45:16) How AI development workflows and applications will change (49:01) About Dell’s AI factory ecosystem…
 
Small, simple, accessible: Hugging Face makes a huge contribution to the agentic AI wave with its smolagents. Jon Krohn explores how this small-but-mighty new Python library can act as the best personal assistant you never had. Hear about its features and use cases in this five-minute Friday. Additional materials: www.superdatascience.com/876 Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information.…
 
Why are semiconductors so essential in this digital age, and how are they made? Jon Krohn speaks to electronics CEO Kai Beckmann about Merck KGaA, Darmstadt, Germany’s intricate manufacturing process, how we can use AI to develop materials that power next-gen AI technologies, and how a chip with the processing power of the human brain might one day be able to run on the power of a low-watt light bulb. Additional materials: www.superdatascience.com/875 This episode is brought to you by the Dell AI Factory with NVIDIA . Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information. In this episode you will learn: (06:26) How Merck KGaA, Darmstadt, Germany supports groundbreaking developments in AI (13:42) Material science’s biggest challenges for AI (29:55) What heterogeneous integration is (34:37) How optical tech influences the electronics industry (49:04) Navigating upturns and downturns in the semiconductor industry (53:08) How AI regulations benefit humanity…
 
In this Five-Minute Friday, Jon Krohn talks baseball. For decades, coaches have relied on player performance stats to make in-game decisions and refine their season strategies. Now, AI led by Statcast is taking baseball strategy even further, massively broadening analytics data to include pitch, swing and catch trajectories, spin rates, biomechanical information, player matchups, and how to enhance player performances. Listen to the episode to find out what other industries can learn from the “data-friendly” sport of baseball. Additional materials: www.superdatascience.com/874 Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information.…
 
Natalie Monbiot is an independent advisor and collaborator for projects that concern the “virtual human”, and she is “going all in on the virtual human economy”. Jon Krohn speaks to Natalie about these new ventures, how to mitigate the divide between AI users and nonusers, and how anyone can collaborate with AI without compromising their own creativity. Additional materials: www.superdatascience.com/873 This episode is brought to you by the Dell AI Factory with NVIDIA , by Trainium2, the latest AI chip from AWS and by ODSC, the Open Data Science Conference . Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information. In this episode you will learn: (07:21) Natalie’s influences for her work (18:30) Will machines surpass human intelligence? (29:08) Using LLMs as collaborators and partners (40:15) How platforms demand user engagement and time (56:54) Natalie Monbiot at Wizly…
 
In this five-minute Friday, Jon Krohn looks into Microsoft’s recent release of Majorana 1, a new quantum processing unit that uses topological qubits, a step away from the fragile qubits currently in use. Get Jon’s thoughts about this “transistor for the quantum age”, potential applications for quantum computing, and why this marks an exciting future for data science and machine learning practitioners. Additional materials: www.superdatascience.com/8 72 Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information.…
 
Agentic AI, AI success strategies, and why flexibility will be so important to keep up with the AI market: Jon Krohn talks to Richmond Alake about the NoSQL database MongoDB, including why it’s a great addition to your toolkit for developing (agentic) AI applications, with a look under the hood at its native vector database. Richmond also talks about why he expects multi-agent AI architectures to go mainstream in 2025. Additional materials: www.superdatascience.com/871 This episode is brought to you by the Dell AI Factory with NVIDIA and by ODSC , the Open Data Science Conference . Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information. In this episode you will learn: (04:10) How Richmond became a Staff Developer Advocate (07:40) How NoSQL database differs from a relational database (16:50) The advantages of working with the cloud-based MongoDB Atlas (32:26) Richmond’s predictions for agentic AI (40:38) How to create an effective AI strategy…
 
In this Five-Minute Friday, Jon Krohn looks into what he considers the world’s most powerful research tool to date, OpenAI’s Deep Research. Find out how OpenAI trained Deep Research to compile literature reviews of limitless topics, what similar tools are on the market, and where Jon sees the tool as having real-world value including how he uses it daily. Additional materials: www.superdatascience.com/870 Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information.…
 
Jon Krohn talks to Varun Godbole about AI prompt engineering, generative wisdom, and AI generalists in this episode all about the interrelationships between humans and AI. Additional materials: www.superdatascience.com/869 This episode is brought to you by the Dell AI Factory with NVIDIA and by ODSC, the Open Data Science Conference . Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information. In this episode you will learn: (10:44) Using deep learning to predict breast cancer (15:55) All about Varun’s Tuning Playbook (29:56) On the explosion of interest and news about AI and data science (46:35) About Varun’s Wise AI…
 
How to start a successful tech company, and how you can get started with DBT, TabPFN and BAML: Jon Krohn rounds up his favorite moments from February in this episode of “In Case You Missed It”. Additional materials: www.superdatascience.com/868 Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information.…
 
The realities of Agentic AI, AGI, and chatbots that don’t hallucinate: Andriy Burkov talks to Jon Krohn about AI in 2025. Best known for his concise machine learning modelling books, author and AI influencer Andriy Burkov also talks about his latest publication in the series, The Hundred-Page Language Learning Models Book. Additional materials: www.superdatascience.com/867 This episode is brought to you by the Dell AI Factory with NVIDIA . Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information. In this episode you will learn: (07:38) Andriy’s “triology” of books on machine learning (29:32) On the limitations of AI agents (41:12) On the prospect of artificial general intelligence (AGI) (54:24) On developing a chatbot that doesn’t hallucinate (01:10:07) On open-weight and open-source LLMs…
 
Jon Krohn addresses a question for the ages: How close are we, really, to Jurassic Park? Dallas-based biotech company Colossal Biosciences is developing technology that aims to return previously extinct animals like the dodo and woolly mammoth to earth and, crucially, pull many others like the white rhino back from the brink of extinction. Additional materials: www.superdatascience.com/866 Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information.…
 
Jon Krohn talks to Cal Al-Dhubaib about the extraordinary success of AI and machine learning solutions provider Pandata, his ironclad hack for any company to define their core values, and how to attract and secure loyal clients. Cal thinks tech professionals make two critical mistakes in their careers: The first is that they too-often enjoy being the gatekeepers of their work rather than educating their clients and coworkers as to the details of their projects and why it benefits the company. The second is that tech professionals don’t show vulnerability, whether that means not knowing a topic or not fully understanding how a business works. This issue, Cal says, can spell the difference between a startup’s success and failure. Learn how tech startups can make an ironclad strategy for their future in this episode of The SuperDataScience Podcast. This episode is brought to you by ODSC , the Open Data Science Conference. Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information. In this episode you will learn: (09:32) How to scale a successful data science consultancy (22:25) How Pandata navigates highly regulated environments (27:59) How to tackle tech illiteracy in business (36:32) What skills Cals looks for in new hires (35:56) How to sell on a tech company Additional materials: www.superdatascience.com/865…
 
Jon Krohn investigates OpenAI’s new release, o3-mini, in this five-minute Friday, where he walks through the reasoning model’s capabilities and performance, cross-examining them against other major-league players, DeepSeek-R1, GPT-4o and Claude 3.5 Sonnet. Additional materials: www.superdatascience.com/864 Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information.…
 
Jon Krohn talks tabular data with Frank Hutter, Professor of Artificial Intelligence at Universität Freiburg in Germany. Despite the great steps that deep learning has made in analysing images, audio, and natural language, tabular data has remained its insurmountable obstacle. In this episode, Frank Hutter details the path he has found around this obstacle even with limited data by using a ground-breaking transformer architecture. Named TabPFN, this approach is vastly outperforming other architectures, as testified by a write up of TabPFN’s capabilities in Nature. Frank talks about his work on version 2 of TabPFN, the architecture’s cross-industry applicability, and how TabPFN is able to return accurate results with synthetic data. This episode is brought to you by ODSC , the Open Data Science Conference. Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information. In this episode you will learn: (05:57) All about the TabPFN architecture (21:27) Use cases for Bayesian inference (35:07) On getting published in Nature (44:03) How TabPFN handles time series data (51:52) All about Prior Labs Additional materials: www.superdatascience.com/863…
 
Loading …

Chào mừng bạn đến với Player FM!

Player FM đang quét trang web để tìm các podcast chất lượng cao cho bạn thưởng thức ngay bây giờ. Đây là ứng dụng podcast tốt nhất và hoạt động trên Android, iPhone và web. Đăng ký để đồng bộ các theo dõi trên tất cả thiết bị.

 

icon Daily Deals
icon Daily Deals
icon Daily Deals

Hướng dẫn sử dụng nhanh

Nghe chương trình này trong khi bạn khám phá
Nghe