This show goes behind the scenes for the tools, techniques, and difficulties associated with the discipline of data engineering. Databases, workflows, automation, and data manipulation are just some of the topics that you will find here.
…
continue reading
Are you on top of the latest innovations in data, analytics, and AI? With data being pivotal to strategy and change, the Data-powered Innovation Jam podcast gives you the key to some of the most crucial aspects of business success. Through our guests, we bring you the latest trends from the world of data and AI, discussing the best ideas and experiences. Our hosts with their decades of profound experience and a background in avant-garde music, will also explore the edges of jazz, rock, and p ...
…
continue reading
Little Fluffy PolyClouds: The Data Engineering Playbook is your essential guide to building cloud-agnostic data infrastructure. We provide practical, step-by-step strategies for designing and deploying resilient data systems across all major platforms, including AWS, Azure, and GCP.
…
continue reading
Hi, we’re Tim Berglund, Adi Polak, and Viktor Gamov and we’re excited to bring you the Confluent Developer podcast (formerly “Streaming Audio.”) Our hand-crafted weekly episodes feature in-depth interviews with our community of software developers (actual human beings - not AI) talking about some of the most interesting challenges they’ve faced in their careers. We aim to explore the conditions that gave rise to each person’s technical hurdles, as well as how their experiences transformed th ...
…
continue reading
Independent contractor software developer and cloud platform engineer. Podcast and music by Pilgrim Engineering Architecture Technology PEAT UK
…
continue reading
Hosted by Viktor Gamov and Kaitlyn Barnard, we interview software developers and technology leaders at the top of their game every other week. We’ll also give you the tools, tactics and strategies you need to take your cloud native architecture to the next level. We go beyond the buzzwords and dissect real-life applications and success stories so that you can tackle your biggest connectivity challenges.
…
continue reading
What does the future of AI sound like? In this special year-end episode of Data-powered Innovation Jam, we riff on seven bold predictions for 2026, from security-first AI and multi-agent ecosystems to industry-native intelligence and even synthetic curiosity. Join hosts Ron Tolido and Robert Engels as they jam with thought leaders on trends that wi…
…
continue reading
1
Decreasing Java Build Times with Pratik Patel | Ep. 10
25:56
25:56
Play later
Play later
Lists
Like
Liked
25:56Tim Berglund talks to Pratik Patel (Azul Systems) about his career in developer relations and Java. Pratik’s first job: computer lab assistant at UNC Chapel Hill. His challenge: working at a large enterprise with manual, slow build processes and transforming them through automation. SEASON 2 Hosted by Tim Berglund, Adi Polak and Viktor Gamov Produc…
…
continue reading
1
Blurring Lines: Data, AI, and the New Playbook for Team Velocity
1:00:57
1:00:57
Play later
Play later
Lists
Like
Liked
1:00:57Summary In this crossover episode, Max Beauchemin explores how multiplayer, multi‑agent engineering is transforming the way individuals and teams build data and AI systems. He digs into the shifting boundary between data and AI engineering, the rise of “context as code,” and how just‑in‑time retrieval via MCP and CLIs lets agents gather what they n…
…
continue reading
Bringing you a very special edition of the Data Powered Innovation Jam podcast, recorded live during a tech road trip through San Francisco and Silicon Valley. This episode blends the city’s musical heritage with cutting-edge innovation, exploring how creativity and technology intersect. Hosts Robert Engels, together with our ‘tech guy’ Alex Bulat …
…
continue reading
1
Reimagining Stream Processing with Matthias J. Sax | Ep. 9
36:42
36:42
Play later
Play later
Lists
Like
Liked
36:42Viktor Gamov talks to Matthias J. Sax (Confluent) about his career in stream processing and, specifically, Kafka Streams. Matthias’ first job: an electrician-in-training on BMW’s assembly lines. His challenge: building Kafka Streams at Confluent with a focus on API design, backward compatibility, and a library-first approach that also fits microser…
…
continue reading
1
State, Scale, and Signals: Rethinking Orchestration with Durable Execution
51:46
51:46
Play later
Play later
Lists
Like
Liked
51:46Summary In this episode Preeti Somal, EVP of Engineering at Temporal, talks about the durable execution model and how it reshapes the way teams build reliable, stateful systems for data and AI. She explores Temporal’s code‑first programming model—workflows, activities, task queues, and replay—and how it eliminates hand‑rolled retry, checkpoint, and…
…
continue reading
1
How Time Kills All Deals in Pre-Sales with Rachel Pedreschi | Ep. 8
27:40
27:40
Play later
Play later
Lists
Like
Liked
27:40Listen: https://confluent.buzzsprout.com | In this episode, Tim Berglund talks to his guest, Rachel Pedreschi (DeltaStream), about her career in pre-sales engineering. Her first job: rectory office assistant at her local parish. Her challenge/theme: working at early-stage startups to bridge sales, marketing, and engineering to reach product-market …
…
continue reading
1
The AI Data Paradox: High Trust in Models, Low Trust in Data
51:35
51:35
Play later
Play later
Lists
Like
Liked
51:35Summary In this episode of the Data Engineering Podcast Ariel Pohoryles, head of product marketing for Boomi's data management offerings, talks about a recent survey of 300 data leaders on how organizations are investing in data to scale AI. He shares a paradox uncovered in the research: while 77% of leaders trust the data feeding their AI systems,…
…
continue reading
1
Scaling AI in Engineering with Peter Bell | Ep. 7
27:16
27:16
Play later
Play later
Lists
Like
Liked
27:16Listen: https://confluent.buzzsprout.com | Today, Adi Polak talks to her guest, Peter Bell (gather.dev), about his career in software engineering leadership, CTO community building, and AI-driven development. Peter’s first job: electronics lab technician at their school (alongside shifts at Tesco). His challenge/theme: working at scale with AI adop…
…
continue reading
1
Bridging the AI–Data Gap: Collect, Curate, Serve
50:40
50:40
Play later
Play later
Lists
Like
Liked
50:40Summary In this episode of the Data Engineering Podcast Omri Lifshitz (CTO) and Ido Bronstein (CEO) of Upriver talk about the growing gap between AI's demand for high-quality data and organizations' current data practices. They discuss why AI accelerates both the supply and demand sides of data, highlighting that the bottleneck lies in the "middle …
…
continue reading
1
How Kafka Expert Robin Moffat Tackles Open Source Problems | Ep. 6
24:50
24:50
Play later
Play later
Lists
Like
Liked
24:50Today, Viktor Gamov talks to his colleague Robin Moffat (Confluent) about his career in data engineering. His first job: paperboy. His challenge: working at a retailer with Oracle materialized views as well as teaching others how to productively approach Kafka’s internal systems. Blog posts mentioned in the podcast: ► Oracle Materialized Views trou…
…
continue reading
1
Beyond the Perimeter: Practical Patterns for Fine‑Grained Data Access
1:05:00
1:05:00
Play later
Play later
Lists
Like
Liked
1:05:00Summary In this episode of the Data Engineering Podcast Matt Topper, president of UberEther, talks about the complex challenge of identity, credentials, and access control in modern data platforms. With the shift to composable ecosystems, integration burdens have exploded, fracturing governance and auditability across warehouses, lakes, files, vect…
…
continue reading
1
Episode 3: The Pipeline Pit Crew: Monitoring, Troubleshooting, and Optimizing Your AWS Data
12:36
12:36
Play later
Play later
Lists
Like
Liked
12:36Keep your data pipelines running smoothly! This episode covers Domain 3 (22% of the DEA-C01 exam). We dive into setting up alarms with CloudWatch, troubleshooting stuck jobs with Glue Logs, optimizing performance and cost in Redshift, and ensuring data quality with AWS Glue DataBrew.By James
…
continue reading
Where should you put your data? We tackle Domain 2 (26% of the DEA-C01 exam) by comparing Redshift, DynamoDB, and RDS. Learn how to design optimal schemas, use the AWS Glue Data Catalog, and implement S3 Lifecycle Policies to manage data lifespan and control costs.By James
…
continue reading
1
Episode 4: The Data Fortress: Securing and Governing Data for the DEA-C01
12:20
12:20
Play later
Play later
Lists
Like
Liked
12:20Lock down your data platform! This is the final domain, Domain 4 (18% of the DEA-C01 exam). We cover essential security best practices: using IAM and Lake Formation for access control, enforcing encryption with KMS (at rest and in transit), and securing network access via VPC and Security Groups.By James
…
continue reading
1
Episode 1: Mastering the AWS Data Assembly Line
18:05
18:05
Play later
Play later
Lists
Like
Liked
18:05This is the essential guide to Domain 1: Data Ingestion and Transformation—the biggest section (34%) of the AWS Certified Data Engineer - Associate (DEA-C01) exam! We break down the core components of a successful data pipeline. Learn to compare Batch vs. Streaming with services like Kinesis and DMS, master ETL/ELT using AWS Glue and EMR, and orche…
…
continue reading
In this genre-blending episode of Data Powered Innovation Jam, hosts Ron Tolido, Robert Engels, and Arne Rossman welcome Stephen Brobst, CTO of Ab Initio and former CTO of Terradata, for a deep dive into the art of mixing data, AI, and music. From punk rock roots and stage-diving legends to the reinvention of enterprise data platforms, Stephen shar…
…
continue reading
51
Building Parquet into Apache Pinot ft. Neha Pawar | Ep. 5
26:07
26:07
Play later
Play later
Lists
Like
Liked
26:07Today, Tim Berglund talks to Neha Pawar (StarTree) about her career in real-time analytics and open source database engineering. Her first job: a year-long internship at NVIDIA. Her challenge: leading the technical effort to add native Parquet support into Apache Pinot. SEASON 2 Hosted by Tim Berglund, Adi Polak and Viktor Gamov Produced and Edited…
…
continue reading
1
The True Costs of Legacy Systems: Technical Debt, Risk, and Exit Strategies
1:04:16
1:04:16
Play later
Play later
Lists
Like
Liked
1:04:16Summary In this episode Kate Shaw, Senior Product Manager for Data and SLIM at SnapLogic, talks about the hidden and compounding costs of maintaining legacy systems—and practical strategies for modernization. She unpacks how “legacy” is less about age and more about when a system becomes a risk: blocking innovation, consuming excess IT time, and cr…
…
continue reading
51
The Fix That Secured 1000s of Credit Cards ft. Brian Sletten | Ep. 4
29:37
29:37
Play later
Play later
Lists
Like
Liked
29:37In this episode, Tim talks to Brian Sletten (Bosatsu Consulting) about his career in software development. His first job: working at a small communications company that built network matrix switch interfaces. His challenge/theme: overhauling credit card storage and security at a major hospitality company. SEASON 2 Hosted by Tim Berglund, Adi Polak …
…
continue reading
1
Context Engineering as a Discipline: Building Governed AI Analytics
51:58
51:58
Play later
Play later
Lists
Like
Liked
51:58Summary In this episode of the Data Engineering Podcast, host Tobias Macey welcomes back Nick Schrock, CTO and founder of Dagster Labs, to discuss Compass - a Slack-native, agentic analytics system designed to keep data teams connected with business stakeholders. Nick shares his journey from initial skepticism to embracing agentic AI as model and a…
…
continue reading
Welcome to the latest episode of the Data Powered Innovation Jam, where data meets disco and AI grooves with funk. After a long summer break, our hosts return with fresh stories, musical nostalgia, and cutting-edge insights into the world of supply chain superintelligence. In this vibrant and eclectic episode, we’re joined by Guillaume Waline, Seni…
…
continue reading
1
How Viktor Gamov Stays Curious as Tech Rapidly Evolves | Ep. 3
30:11
30:11
Play later
Play later
Lists
Like
Liked
30:11Adi Polak interviews her co-host, Viktor Gamov, about his career’s evolution from distributed systems to streaming technology. Viktor’s first job: apple picking. His challenge/theme: staying curious and non-judgmental in the ever-changing landscape of tech. SEASON 2 Hosted by Tim Berglund, Adi Polak and Viktor Gamov Produced and Edited by Noelle Ga…
…
continue reading
1
The Data Model That Captures Your Business: Metric Trees Explained
1:01:05
1:01:05
Play later
Play later
Lists
Like
Liked
1:01:05Summary In this episode of the Data Engineering Podcast Vijay Subramanian, founder and CEO of Trace, talks about metric trees - a new approach to data modeling that directly captures a company's business model. Vijay shares insights from his decade-long experience building data practices at Rent the Runway and explains how the modern data stack has…
…
continue reading
1
How Tim Berglund Found His Calling | Ep. 2
30:36
30:36
Play later
Play later
Lists
Like
Liked
30:36Viktor Gamov interviews his co-host, Tim Berglund, about his career in the world of streaming data. Tim’s first job: Burger King broiler steamer. His challenge/theme: pivoting from working in hardware and firmware to finding his calling in enterprise software and developer relations. SEASON 2 Hosted by Tim Berglund, Adi Polak and Viktor Gamov Produ…
…
continue reading
1
From GPUs-as-a-Service to Workloads-as-a-Service: Flex AI’s Path to High-Utilization AI Infra
56:31
56:31
Play later
Play later
Lists
Like
Liked
56:31Summary In this crossover episode of the AI Engineering Podcast, host Tobias Macey interviews Brijesh Tripathi, CEO of Flex AI, about revolutionizing AI engineering by removing DevOps burdens through "workload as a service". Brijesh shares his expertise from leading AI/HPC architecture at Intel and deploying supercomputers like Aurora, highlighting…
…
continue reading
1
Building Real-time Systems for Apple, Nike & more ft. Adi Polak | Ep. 1
32:53
32:53
Play later
Play later
Lists
Like
Liked
32:53The Confluent Developer Podcast is here! For this first episode, Tim Berglund talks to his co-host, Adi Polak (Confluent), about her career in distributed data systems. Her first job: neighborhood dogwalker. Her challenge/theme: early Hadoop, working at Akamai on data optimization and real-time threat detection for huge global customers like Apple,…
…
continue reading
1
From RAG to Relational: How Agentic Patterns Are Reshaping Data Architecture
52:58
52:58
Play later
Play later
Lists
Like
Liked
52:58Summary In this episode of the AI Engineering Podcast Mark Brooker, VP and Distinguished Engineer at AWS, talks about how agentic workflows are transforming database usage and infrastructure design. He discusses the evolving role of data in AI systems, from traditional models to more modern approaches like vectors, RAG, and relational databases. Ma…
…
continue reading
1
Duck Lake: Simplifying the Lakehouse Ecosystem
1:10:41
1:10:41
Play later
Play later
Lists
Like
Liked
1:10:41Summary In this episode of the Data Engineering Podcast Hannes Mühleisen and Mark Raasveldt, the creators of DuckDB, share their work on Duck Lake, a new entrant in the open lakehouse ecosystem. They discuss how Duck Lake, is focused on simplicity, flexibility, and offers a unified catalog and table format compared to other lakehouse formats like I…
…
continue reading
1
We're back! Welcome to the Confluent Developer Podcast.
1:20
1:20
Play later
Play later
Lists
Like
Liked
1:20Weekly episodes launching Sept. 22! | Hi, I'm Tim Berglund. It's been about four years since I've been podcasting at Confluent, and "Streaming Audio" has been on hiatus for a little more than two, but I've got great news: we are back! We're back with a new name, a new format, and new hosts. Welcome to the Confluent Developer Podcast, where we talk …
…
continue reading
1
Aligning Business and Data: The Essential Role of Data Modeling
1:06:51
1:06:51
Play later
Play later
Lists
Like
Liked
1:06:51Summary In this episode of the Data Engineering Podcast Serge Gershkovich, head of product at SQL DBM, talks about the socio-technical aspects of data modeling. Serge shares his background in data modeling and highlights its importance as a collaborative process between business stakeholders and data teams. He debunks common misconceptions that dat…
…
continue reading
1
From Academia to Industry: Bridging Data Engineering Challenges
50:54
50:54
Play later
Play later
Lists
Like
Liked
50:54Summary In this episode of the Data Engineering Podcast Professor Paul Groth, from the University of Amsterdam, talks about his research on knowledge graphs and data engineering. Paul shares his background in AI and data management, discussing the evolution of data provenance and lineage, as well as the challenges of data integration. He explores t…
…
continue reading
1
High Performance And Low Overhead Graphs With KuzuDB
1:01:29
1:01:29
Play later
Play later
Lists
Like
Liked
1:01:29Summary In this episode of the Data Engineering Podcast Prashanth Rao, an AI engineer at KuzuDB, talks about their embeddable graph database. Prashanth explains how KuzuDB addresses performance shortcomings in existing solutions through columnar storage and novel join algorithms. He discusses the usability and scalability of KuzuDB, emphasizing its…
…
continue reading
1
Bridging Data and Decision-Making: AI's Role in Modern Analytics
1:10:44
1:10:44
Play later
Play later
Lists
Like
Liked
1:10:44Summary In this episode of the Data Engineering Podcast Lucas Thelosen and Drew Gilson from Gravity talk about their development of Orion, an autonomous data analyst that bridges the gap between data availability and business decision-making. Lucas and Drew share their backgrounds in data analytics and how their experiences have shaped their approa…
…
continue reading
1
From Bits to Tables: The Evolution of S3 Storage
50:08
50:08
Play later
Play later
Lists
Like
Liked
50:08Summary In this episode of the Data Engineering Podcast Andy Warfield talks about the innovative functionalities of S3 Tables and Vectors and their integration into modern data stacks. Andy shares his journey through the tech industry and his role at Amazon, where he collaborates to enhance storage capabilities, discussing the evolution of S3 from …
…
continue reading
1
Revolutionizing Python Notebooks with Marimo
51:56
51:56
Play later
Play later
Lists
Like
Liked
51:56Summary In this episode of the Data Engineering Podcast Akshay Agrawal from Marimo discusses the innovative new Python notebook environment, which offers a reactive execution model, full Python integration, and built-in UI elements to enhance the interactive computing experience. He discusses the challenges of traditional Jupyter notebooks, such as…
…
continue reading
1
Warehouse Native Incremental Data Processing With Dynamic Tables And Delayed View Semantics
55:07
55:07
Play later
Play later
Lists
Like
Liked
55:07Summary In this episode of the Data Engineering Podcast Dan Sotolongo from Snowflake talks about the complexities of incremental data processing in warehouse environments. Dan discusses the challenges of handling continuously evolving datasets and the importance of incremental data processing for optimized resource use and reduced latency. He expla…
…
continue reading
1
Streamlining Data Pipelines with MCP Servers and Vector Engines
52:04
52:04
Play later
Play later
Lists
Like
Liked
52:04Summary In this episode of the Data Engineering Podcast Kacper Łukawski from Qdrant about integrating MCP servers with vector databases to process unstructured data. Kacper shares his experience in data engineering, from building big data pipelines in the automotive industry to leveraging large language models (LLMs) for transforming unstructured d…
…
continue reading
1
Foundational Data Engineering At Two Sigma
55:05
55:05
Play later
Play later
Lists
Like
Liked
55:05Summary In this episode of the Data Engineering Podcast Effie Baram, a leader in foundational data engineering at Two Sigma, talks about the complexities and innovations in data engineering within the finance sector. She discusses the critical role of data at Two Sigma, balancing data quality with delivery speed, and the socio-technical challenges …
…
continue reading