Data Engineering Podcast
1) data.world with Bryon Jacob - Episode 9
Summary We have tools and platforms for collaborating on software projects and linking them together, wouldn’t it be nice to have the same capabilities for data? The team at data.world are work...Show More
2) Branches, Diffs, and SQL: How Dolt Powers Agentic Workflows
Summary In this episode Tim Sehn, founder and CEO of DoltHub, talks about Dolt - the world’s first version‑controlled SQL database - and why Git‑style semantics belong at the heart of data system...Show More
3) Logical First, Physical Second: A Pragmatic Path to Trusted Data
Summary In this episode of the Data Engineering Podcast Jamie Knowles, Product Director for ER/Studio, talks about data architecture and its importance in driving business meaning. He discusses h...Show More
4) Your Data, Your Lake: How Observe Uses Iceberg and Streaming ETL for Observability
Summary In this episode Jacob Leverich, cofounder and CTO of Observe, talks about applying lakehouse architectures to observability workloads. Jacob discusses Observe’s decision to leverage cloud...Show More
5) Semantic Operators Meet Dataframes: Building Context for Agents with FENIC
Summary In this episode Kostas Pardalis talks about Fenic - an open-source, PySpark-inspired dataframe engine designed to bring LLM-powered semantics into reliable data engineering workflows. Kos...Show More
6) Beyond Dashboards: How Data Teams Earn a Seat at the Table
Summary In this episode Goutham Budati about his Data–Perspective–Action framework and how it empowers data teams to become true business partners. Gautham traces his path from automating Excel r...Show More
7) Unfreezing The Data Lake: The Future-Proof File Format
Summary In this episode PhD researcher Xinyu Zheng talks about F3, the “future-proof file format” designed to address today’s hardware realities and evolving workloads. He digs into the limitatio...Show More
8) From Context to Semantics: How Metadata Powers Agentic AI
Summary In this episode Suresh Srinivas and Sriharsha Chintalapani explore how metadata platforms are evolving from human-centric catalogs into the foundational context layer for AI and agentic s...Show More
9) From Data Engineering to AI Engineering: Where the Lines Blur
Summary In this solo episode of the Data Engineering Podcast, host Tobias Macey reflects on how AI has transformed the practice and pace of data engineering over time. Starting from its origins i...Show More
10) Malloy: Hierarchical Data, Semantic Models, and the Future of Analytics
Summary In this episode Michael Toy, co-creator of Malloy, talks about rethinking how we work with data beyond SQL. Michael shares the origins of Malloy from his and Lloyd Tabb’s experience at Lo...Show More