This show goes behind the scenes for the tools, techniques, and difficulties associated with the discipline of data engineering. Databases, workflows, automation, and data manipulation are just some of the topics that you will find here.
…
continue reading
Are you on top of the latest innovations in data, analytics, and AI? With data being pivotal to strategy and change, the Data-powered Innovation Jam podcast gives you the key to some of the most crucial aspects of business success. Through our guests, we bring you the latest trends from the world of data and AI, discussing the best ideas and experiences. Our hosts with their decades of profound experience and a background in avant-garde music, will also explore the edges of jazz, rock, and p ...
…
continue reading
Little Fluffy PolyClouds: The Data Engineering Playbook is your essential guide to building cloud-agnostic data infrastructure. We provide practical, step-by-step strategies for designing and deploying resilient data systems across all major platforms, including AWS, Azure, and GCP.
…
continue reading
Hi, we’re Tim Berglund, Adi Polak, and Viktor Gamov and we’re excited to bring you the Confluent Developer podcast (formerly “Streaming Audio.”) Our hand-crafted weekly episodes feature in-depth interviews with our community of software developers (actual human beings - not AI) talking about some of the most interesting challenges they’ve faced in their careers. We aim to explore the conditions that gave rise to each person’s technical hurdles, as well as how their experiences transformed th ...
…
continue reading
Independent contractor software developer and cloud platform engineer. Podcast and music by Pilgrim Engineering Architecture Technology PEAT UK
…
continue reading
Hosted by Viktor Gamov and Kaitlyn Barnard, we interview software developers and technology leaders at the top of their game every other week. We’ll also give you the tools, tactics and strategies you need to take your cloud native architecture to the next level. We go beyond the buzzwords and dissect real-life applications and success stories so that you can tackle your biggest connectivity challenges.
…
continue reading
What does the future of AI sound like? In this special year-end episode of Data-powered Innovation Jam, we riff on seven bold predictions for 2026, from security-first AI and multi-agent ecosystems to industry-native intelligence and even synthetic curiosity. Join hosts Ron Tolido and Robert Engels as they jam with thought leaders on trends that wi…
…
continue reading
1
Decreasing Java Build Times with Pratik Patel | Ep. 10
25:56
25:56
Nghe Sau
Nghe Sau
Danh sách
Thích
Đã thích
25:56Tim Berglund talks to Pratik Patel (Azul Systems) about his career in developer relations and Java. Pratik’s first job: computer lab assistant at UNC Chapel Hill. His challenge: working at a large enterprise with manual, slow build processes and transforming them through automation. SEASON 2 Hosted by Tim Berglund, Adi Polak and Viktor Gamov Produc…
…
continue reading
1
Blurring Lines: Data, AI, and the New Playbook for Team Velocity
1:00:57
1:00:57
Nghe Sau
Nghe Sau
Danh sách
Thích
Đã thích
1:00:57Summary In this crossover episode, Max Beauchemin explores how multiplayer, multi‑agent engineering is transforming the way individuals and teams build data and AI systems. He digs into the shifting boundary between data and AI engineering, the rise of “context as code,” and how just‑in‑time retrieval via MCP and CLIs lets agents gather what they n…
…
continue reading
Bringing you a very special edition of the Data Powered Innovation Jam podcast, recorded live during a tech road trip through San Francisco and Silicon Valley. This episode blends the city’s musical heritage with cutting-edge innovation, exploring how creativity and technology intersect. Hosts Robert Engels, together with our ‘tech guy’ Alex Bulat …
…
continue reading
1
Reimagining Stream Processing with Matthias J. Sax | Ep. 9
36:42
36:42
Nghe Sau
Nghe Sau
Danh sách
Thích
Đã thích
36:42Viktor Gamov talks to Matthias J. Sax (Confluent) about his career in stream processing and, specifically, Kafka Streams. Matthias’ first job: an electrician-in-training on BMW’s assembly lines. His challenge: building Kafka Streams at Confluent with a focus on API design, backward compatibility, and a library-first approach that also fits microser…
…
continue reading
1
State, Scale, and Signals: Rethinking Orchestration with Durable Execution
51:46
51:46
Nghe Sau
Nghe Sau
Danh sách
Thích
Đã thích
51:46Summary In this episode Preeti Somal, EVP of Engineering at Temporal, talks about the durable execution model and how it reshapes the way teams build reliable, stateful systems for data and AI. She explores Temporal’s code‑first programming model—workflows, activities, task queues, and replay—and how it eliminates hand‑rolled retry, checkpoint, and…
…
continue reading
1
How Time Kills All Deals in Pre-Sales with Rachel Pedreschi | Ep. 8
27:40
27:40
Nghe Sau
Nghe Sau
Danh sách
Thích
Đã thích
27:40Listen: https://confluent.buzzsprout.com | In this episode, Tim Berglund talks to his guest, Rachel Pedreschi (DeltaStream), about her career in pre-sales engineering. Her first job: rectory office assistant at her local parish. Her challenge/theme: working at early-stage startups to bridge sales, marketing, and engineering to reach product-market …
…
continue reading
1
The AI Data Paradox: High Trust in Models, Low Trust in Data
51:35
51:35
Nghe Sau
Nghe Sau
Danh sách
Thích
Đã thích
51:35Summary In this episode of the Data Engineering Podcast Ariel Pohoryles, head of product marketing for Boomi's data management offerings, talks about a recent survey of 300 data leaders on how organizations are investing in data to scale AI. He shares a paradox uncovered in the research: while 77% of leaders trust the data feeding their AI systems,…
…
continue reading
1
Scaling AI in Engineering with Peter Bell | Ep. 7
27:16
27:16
Nghe Sau
Nghe Sau
Danh sách
Thích
Đã thích
27:16Listen: https://confluent.buzzsprout.com | Today, Adi Polak talks to her guest, Peter Bell (gather.dev), about his career in software engineering leadership, CTO community building, and AI-driven development. Peter’s first job: electronics lab technician at their school (alongside shifts at Tesco). His challenge/theme: working at scale with AI adop…
…
continue reading
1
Bridging the AI–Data Gap: Collect, Curate, Serve
50:40
50:40
Nghe Sau
Nghe Sau
Danh sách
Thích
Đã thích
50:40Summary In this episode of the Data Engineering Podcast Omri Lifshitz (CTO) and Ido Bronstein (CEO) of Upriver talk about the growing gap between AI's demand for high-quality data and organizations' current data practices. They discuss why AI accelerates both the supply and demand sides of data, highlighting that the bottleneck lies in the "middle …
…
continue reading
1
How Kafka Expert Robin Moffat Tackles Open Source Problems | Ep. 6
24:50
24:50
Nghe Sau
Nghe Sau
Danh sách
Thích
Đã thích
24:50Today, Viktor Gamov talks to his colleague Robin Moffat (Confluent) about his career in data engineering. His first job: paperboy. His challenge: working at a retailer with Oracle materialized views as well as teaching others how to productively approach Kafka’s internal systems. Blog posts mentioned in the podcast: ► Oracle Materialized Views trou…
…
continue reading
1
Beyond the Perimeter: Practical Patterns for Fine‑Grained Data Access
1:05:00
1:05:00
Nghe Sau
Nghe Sau
Danh sách
Thích
Đã thích
1:05:00Summary In this episode of the Data Engineering Podcast Matt Topper, president of UberEther, talks about the complex challenge of identity, credentials, and access control in modern data platforms. With the shift to composable ecosystems, integration burdens have exploded, fracturing governance and auditability across warehouses, lakes, files, vect…
…
continue reading
1
Episode 3: The Pipeline Pit Crew: Monitoring, Troubleshooting, and Optimizing Your AWS Data
12:36
12:36
Nghe Sau
Nghe Sau
Danh sách
Thích
Đã thích
12:36Keep your data pipelines running smoothly! This episode covers Domain 3 (22% of the DEA-C01 exam). We dive into setting up alarms with CloudWatch, troubleshooting stuck jobs with Glue Logs, optimizing performance and cost in Redshift, and ensuring data quality with AWS Glue DataBrew.Bởi James
…
continue reading
Where should you put your data? We tackle Domain 2 (26% of the DEA-C01 exam) by comparing Redshift, DynamoDB, and RDS. Learn how to design optimal schemas, use the AWS Glue Data Catalog, and implement S3 Lifecycle Policies to manage data lifespan and control costs.Bởi James
…
continue reading
1
Episode 4: The Data Fortress: Securing and Governing Data for the DEA-C01
12:20
12:20
Nghe Sau
Nghe Sau
Danh sách
Thích
Đã thích
12:20Lock down your data platform! This is the final domain, Domain 4 (18% of the DEA-C01 exam). We cover essential security best practices: using IAM and Lake Formation for access control, enforcing encryption with KMS (at rest and in transit), and securing network access via VPC and Security Groups.Bởi James
…
continue reading
1
Episode 1: Mastering the AWS Data Assembly Line
18:05
18:05
Nghe Sau
Nghe Sau
Danh sách
Thích
Đã thích
18:05This is the essential guide to Domain 1: Data Ingestion and Transformation—the biggest section (34%) of the AWS Certified Data Engineer - Associate (DEA-C01) exam! We break down the core components of a successful data pipeline. Learn to compare Batch vs. Streaming with services like Kinesis and DMS, master ETL/ELT using AWS Glue and EMR, and orche…
…
continue reading
In this genre-blending episode of Data Powered Innovation Jam, hosts Ron Tolido, Robert Engels, and Arne Rossman welcome Stephen Brobst, CTO of Ab Initio and former CTO of Terradata, for a deep dive into the art of mixing data, AI, and music. From punk rock roots and stage-diving legends to the reinvention of enterprise data platforms, Stephen shar…
…
continue reading
1
Building Parquet into Apache Pinot ft. Neha Pawar | Ep. 5
26:07
26:07
Nghe Sau
Nghe Sau
Danh sách
Thích
Đã thích
26:07Today, Tim Berglund talks to Neha Pawar (StarTree) about her career in real-time analytics and open source database engineering. Her first job: a year-long internship at NVIDIA. Her challenge: leading the technical effort to add native Parquet support into Apache Pinot. SEASON 2 Hosted by Tim Berglund, Adi Polak and Viktor Gamov Produced and Edited…
…
continue reading
1
The True Costs of Legacy Systems: Technical Debt, Risk, and Exit Strategies
1:04:16
1:04:16
Nghe Sau
Nghe Sau
Danh sách
Thích
Đã thích
1:04:16Summary In this episode Kate Shaw, Senior Product Manager for Data and SLIM at SnapLogic, talks about the hidden and compounding costs of maintaining legacy systems—and practical strategies for modernization. She unpacks how “legacy” is less about age and more about when a system becomes a risk: blocking innovation, consuming excess IT time, and cr…
…
continue reading
1
The Fix That Secured 1000s of Credit Cards ft. Brian Sletten | Ep. 4
29:37
29:37
Nghe Sau
Nghe Sau
Danh sách
Thích
Đã thích
29:37In this episode, Tim talks to Brian Sletten (Bosatsu Consulting) about his career in software development. His first job: working at a small communications company that built network matrix switch interfaces. His challenge/theme: overhauling credit card storage and security at a major hospitality company. SEASON 2 Hosted by Tim Berglund, Adi Polak …
…
continue reading
1
Context Engineering as a Discipline: Building Governed AI Analytics
51:58
51:58
Nghe Sau
Nghe Sau
Danh sách
Thích
Đã thích
51:58Summary In this episode of the Data Engineering Podcast, host Tobias Macey welcomes back Nick Schrock, CTO and founder of Dagster Labs, to discuss Compass - a Slack-native, agentic analytics system designed to keep data teams connected with business stakeholders. Nick shares his journey from initial skepticism to embracing agentic AI as model and a…
…
continue reading
Welcome to the latest episode of the Data Powered Innovation Jam, where data meets disco and AI grooves with funk. After a long summer break, our hosts return with fresh stories, musical nostalgia, and cutting-edge insights into the world of supply chain superintelligence. In this vibrant and eclectic episode, we’re joined by Guillaume Waline, Seni…
…
continue reading
1
How Viktor Gamov Stays Curious as Tech Rapidly Evolves | Ep. 3
30:11
30:11
Nghe Sau
Nghe Sau
Danh sách
Thích
Đã thích
30:11Adi Polak interviews her co-host, Viktor Gamov, about his career’s evolution from distributed systems to streaming technology. Viktor’s first job: apple picking. His challenge/theme: staying curious and non-judgmental in the ever-changing landscape of tech. SEASON 2 Hosted by Tim Berglund, Adi Polak and Viktor Gamov Produced and Edited by Noelle Ga…
…
continue reading
1
The Data Model That Captures Your Business: Metric Trees Explained
1:01:05
1:01:05
Nghe Sau
Nghe Sau
Danh sách
Thích
Đã thích
1:01:05Summary In this episode of the Data Engineering Podcast Vijay Subramanian, founder and CEO of Trace, talks about metric trees - a new approach to data modeling that directly captures a company's business model. Vijay shares insights from his decade-long experience building data practices at Rent the Runway and explains how the modern data stack has…
…
continue reading
1
How Tim Berglund Found His Calling | Ep. 2
30:36
30:36
Nghe Sau
Nghe Sau
Danh sách
Thích
Đã thích
30:36Viktor Gamov interviews his co-host, Tim Berglund, about his career in the world of streaming data. Tim’s first job: Burger King broiler steamer. His challenge/theme: pivoting from working in hardware and firmware to finding his calling in enterprise software and developer relations. SEASON 2 Hosted by Tim Berglund, Adi Polak and Viktor Gamov Produ…
…
continue reading
1
From GPUs-as-a-Service to Workloads-as-a-Service: Flex AI’s Path to High-Utilization AI Infra
56:31
56:31
Nghe Sau
Nghe Sau
Danh sách
Thích
Đã thích
56:31Summary In this crossover episode of the AI Engineering Podcast, host Tobias Macey interviews Brijesh Tripathi, CEO of Flex AI, about revolutionizing AI engineering by removing DevOps burdens through "workload as a service". Brijesh shares his expertise from leading AI/HPC architecture at Intel and deploying supercomputers like Aurora, highlighting…
…
continue reading
1
Building Real-time Systems for Apple, Nike & more ft. Adi Polak | Ep. 1
32:53
32:53
Nghe Sau
Nghe Sau
Danh sách
Thích
Đã thích
32:53The Confluent Developer Podcast is here! For this first episode, Tim Berglund talks to his co-host, Adi Polak (Confluent), about her career in distributed data systems. Her first job: neighborhood dogwalker. Her challenge/theme: early Hadoop, working at Akamai on data optimization and real-time threat detection for huge global customers like Apple,…
…
continue reading
1
From RAG to Relational: How Agentic Patterns Are Reshaping Data Architecture
52:58
52:58
Nghe Sau
Nghe Sau
Danh sách
Thích
Đã thích
52:58Summary In this episode of the AI Engineering Podcast Mark Brooker, VP and Distinguished Engineer at AWS, talks about how agentic workflows are transforming database usage and infrastructure design. He discusses the evolving role of data in AI systems, from traditional models to more modern approaches like vectors, RAG, and relational databases. Ma…
…
continue reading
1
Duck Lake: Simplifying the Lakehouse Ecosystem
1:10:41
1:10:41
Nghe Sau
Nghe Sau
Danh sách
Thích
Đã thích
1:10:41Summary In this episode of the Data Engineering Podcast Hannes Mühleisen and Mark Raasveldt, the creators of DuckDB, share their work on Duck Lake, a new entrant in the open lakehouse ecosystem. They discuss how Duck Lake, is focused on simplicity, flexibility, and offers a unified catalog and table format compared to other lakehouse formats like I…
…
continue reading
1
We're back! Welcome to the Confluent Developer Podcast.
1:20
1:20
Nghe Sau
Nghe Sau
Danh sách
Thích
Đã thích
1:20Weekly episodes launching Sept. 22! | Hi, I'm Tim Berglund. It's been about four years since I've been podcasting at Confluent, and "Streaming Audio" has been on hiatus for a little more than two, but I've got great news: we are back! We're back with a new name, a new format, and new hosts. Welcome to the Confluent Developer Podcast, where we talk …
…
continue reading
1
Aligning Business and Data: The Essential Role of Data Modeling
1:06:51
1:06:51
Nghe Sau
Nghe Sau
Danh sách
Thích
Đã thích
1:06:51Summary In this episode of the Data Engineering Podcast Serge Gershkovich, head of product at SQL DBM, talks about the socio-technical aspects of data modeling. Serge shares his background in data modeling and highlights its importance as a collaborative process between business stakeholders and data teams. He debunks common misconceptions that dat…
…
continue reading
1
From Academia to Industry: Bridging Data Engineering Challenges
50:54
50:54
Nghe Sau
Nghe Sau
Danh sách
Thích
Đã thích
50:54Summary In this episode of the Data Engineering Podcast Professor Paul Groth, from the University of Amsterdam, talks about his research on knowledge graphs and data engineering. Paul shares his background in AI and data management, discussing the evolution of data provenance and lineage, as well as the challenges of data integration. He explores t…
…
continue reading
1
High Performance And Low Overhead Graphs With KuzuDB
1:01:29
1:01:29
Nghe Sau
Nghe Sau
Danh sách
Thích
Đã thích
1:01:29Summary In this episode of the Data Engineering Podcast Prashanth Rao, an AI engineer at KuzuDB, talks about their embeddable graph database. Prashanth explains how KuzuDB addresses performance shortcomings in existing solutions through columnar storage and novel join algorithms. He discusses the usability and scalability of KuzuDB, emphasizing its…
…
continue reading
1
Bridging Data and Decision-Making: AI's Role in Modern Analytics
1:10:44
1:10:44
Nghe Sau
Nghe Sau
Danh sách
Thích
Đã thích
1:10:44Summary In this episode of the Data Engineering Podcast Lucas Thelosen and Drew Gilson from Gravity talk about their development of Orion, an autonomous data analyst that bridges the gap between data availability and business decision-making. Lucas and Drew share their backgrounds in data analytics and how their experiences have shaped their approa…
…
continue reading
1
From Bits to Tables: The Evolution of S3 Storage
50:08
50:08
Nghe Sau
Nghe Sau
Danh sách
Thích
Đã thích
50:08Summary In this episode of the Data Engineering Podcast Andy Warfield talks about the innovative functionalities of S3 Tables and Vectors and their integration into modern data stacks. Andy shares his journey through the tech industry and his role at Amazon, where he collaborates to enhance storage capabilities, discussing the evolution of S3 from …
…
continue reading
1
Revolutionizing Python Notebooks with Marimo
51:56
51:56
Nghe Sau
Nghe Sau
Danh sách
Thích
Đã thích
51:56Summary In this episode of the Data Engineering Podcast Akshay Agrawal from Marimo discusses the innovative new Python notebook environment, which offers a reactive execution model, full Python integration, and built-in UI elements to enhance the interactive computing experience. He discusses the challenges of traditional Jupyter notebooks, such as…
…
continue reading
1
Warehouse Native Incremental Data Processing With Dynamic Tables And Delayed View Semantics
55:07
55:07
Nghe Sau
Nghe Sau
Danh sách
Thích
Đã thích
55:07Summary In this episode of the Data Engineering Podcast Dan Sotolongo from Snowflake talks about the complexities of incremental data processing in warehouse environments. Dan discusses the challenges of handling continuously evolving datasets and the importance of incremental data processing for optimized resource use and reduced latency. He expla…
…
continue reading
1
Streamlining Data Pipelines with MCP Servers and Vector Engines
52:04
52:04
Nghe Sau
Nghe Sau
Danh sách
Thích
Đã thích
52:04Summary In this episode of the Data Engineering Podcast Kacper Łukawski from Qdrant about integrating MCP servers with vector databases to process unstructured data. Kacper shares his experience in data engineering, from building big data pipelines in the automotive industry to leveraging large language models (LLMs) for transforming unstructured d…
…
continue reading
1
Foundational Data Engineering At Two Sigma
55:05
55:05
Nghe Sau
Nghe Sau
Danh sách
Thích
Đã thích
55:05Summary In this episode of the Data Engineering Podcast Effie Baram, a leader in foundational data engineering at Two Sigma, talks about the complexities and innovations in data engineering within the finance sector. She discusses the critical role of data at Two Sigma, balancing data quality with delivery speed, and the socio-technical challenges …
…
continue reading