show episodes
 
Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.
  continue reading
 
Loading …
show series
 
Highlights from this week’s conversation include: Ryan’s background in data (0:58) Transition from Performing Arts to Data (2:23) Understanding End Users in Data Projects (6:08) Learning from Failures in Data Projects (8:07) The self-service era (19:50) Struggles of self-service (21:23) The disillusion with dashboards (26:23) GoodData's approach (3…
  continue reading
 
Highlights from this week’s conversation include: The Evolution of Data Processing (2:36) Ryan’s Background and Journey in Data (4:52) Challenges in Transitioning to S3 (8:47) Impact of Latency on Query Performance (11:43) Challenges with Table Representation (15:26) Designing a New Metadata Format (21:36) Integration with Existing Tools and Open S…
  continue reading
 
The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data. RudderStack helps …
  continue reading
 
Highlights from this week’s conversation include: Apruva’s background in streaming technology (0:48) Developer experience and Kafka streams (2:47) Motivation to bootstrap a startup (4:09) Meeting the Confluent founders and early work at Confluent (6:59) Projects at Confluent and transition to engineering management (10:34) Overview of Responsive an…
  continue reading
 
The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data. RudderStack helps …
  continue reading
 
Highlights from this week’s conversation include: Chad’s background and journey in data (0:46) Importance of Data Supply Chain (2:19) Challenges with Modern Data Stack (3:28) Comparing Data Supply Chain to Real-world Supply Chains (4:49) Overview of Gable.ai (8:05) Rethinking Data Catalogs (11:42) New Ideas for Managing Data (15:16) Data Discovery …
  continue reading
 
The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data. RudderStack helps …
  continue reading
 
Highlights from this week’s conversation include: Kevin’s background and work at Stripe (0:31) Evolution of Data Infrastructure at Stripe (2:18) Kevin's Interest in Data (5:29) Software Engineer or Data Engineer? (8:27) Speech Recognition Work at Amazon (11:06) Efficiency and Cost Management (15:50) Metadata and Query Analysis (18:38) Surprising Di…
  continue reading
 
The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data. RudderStack helps …
  continue reading
 
Highlights from this week’s conversation include: Michael’s background and journey in data (0:33) The origin story of Druid (2:39) Experiences and growth in Data (8:08) Druid's evolution (21:46) Druid's architectural decisions (26:32) The user experience (30:06) The developer experience (35:14) The evolution of BI tools (40:55) Data architecture an…
  continue reading
 
The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data. RudderStack helps …
  continue reading
 
Highlights from this week’s conversation include: The evolution of data operations (1:13) Unravel's role in simplifying data operations (2:17) Kunal’s journey from fashion to enterprise data management (5:23)\ The Unravel platform and its components (10:08) Challenges in data operations at scale (16:34) Users of Unravel within an organization (22:3…
  continue reading
 
Highlights from this week’s conversation include: Tony's background and research focus (3:35) Challenges in academia and industry (6:15) Ph.D. student's routine (10:47) Academic paper review process (15:26) Aha moments in research (20:05) Academic lab structure (23:09) The decision to move from hardware to data research (24:43) Research focus on ti…
  continue reading
 
Highlights from this week’s conversation include: Peter's background and journey in data (0:26) Introduction to PLG (4:18) Starting in data at Heroku (6:05) Building the data stack at Heroku (8:13) Data stack requirements for early-stage companies (12:00) Differentiating PLG companies from open source companies (19:26) Venture capital and open sour…
  continue reading
 
Highlights from this week’s conversation include: The overview of refuel (0:33) The evolution of AI and LLMs (3:51) Types of LLM models (12:31) Implementing LLM use cases and cost considerations (00:15:52) User experience and fine-tuning LLM models (21:49) Categorizing search queries (22:44) Creating internal benchmark framework (29:50) Benchmarkin…
  continue reading
 
Highlights from this week’s conversation include: Viren’s background in data (0:39) Evolution of Orchestration (1:52) AI Orchestration (3:00) Understanding Conductor and orkes (6:26) Event-Driven Orchestration (8:10) Viren’s Transition to Founder (12:27) Non-Technical Aspects of Being a Founder (15:50) Democratizing AI for Developers (18:16) The ev…
  continue reading
 
Highlights from this week’s conversation include: Introduction of the panel (0:05) Defining composable data stack (5:22) Components of a composable data stack (7:49) Challenges and incentives for composable components (10:37) Specialization and modularity in data workloads (13:05) Organic evolution of composable systems (17:50) Efficiency and commo…
  continue reading
 
In this bonus episode, Eric and Kostas preview their upcoming discussion with a panel of experts as Wes McKinney (Co-Founder, Voltron), Pedro Pedreira Software Engineer, Meta), Chris Riccomini (Seed Investor, various startups), and Ryan Blue (Co-Founder and CEO, Tabular) join the show.By Rudderstack
  continue reading
 
Highlights from this week’s conversation include: Artyom’s background in the data space (0:32) The growth and changes at Cube (5:58) Pain points of managing metrics definitions across different tools (9:39) Trade-offs between coupled and decoupled semantic layers (12:12) Making a case for implementing a semantic layer (14:17) The evolution of seman…
  continue reading
 
Highlights from this week’s conversation include: No Code Analytics (1:22) Analytics as a Team Sport (2:31) The workflow of someone without Alteryx (11:27) Alteryx's ability to handle diverse data sources (14:32) The balance between ease of use and complexity (23:06) Enabling casual end users with a no code interface (24:19) Taking analytics to the…
  continue reading
 
Highlights from this week’s conversation include: Matt’s background and journey with Fermyon (2:32) WebAssembly and enhanced security models (3:43) The IOT Startup and Google Acquisition (10:49) Google's Early Containers (11:50) Scaling and anticipating requests (20:22) Introduction to WebAssembly and its importance (23:32) The Benefits of WebAssem…
  continue reading
 
Highlights from this week’s conversation include: The role of an orchestrator in the lifecycle of data (1:34) Relevance of orchestration in data pipelines (00:02:45) Changes around data ops and MLOps (3:37) Data Cleaning (11:42) Overview of Dagster (13:50) Assets vs Tasks in Data Pipeline (19:15) Building a Data Pipeline with Dexter (25:40) Differe…
  continue reading
 
Highlights from this week’s conversation include: The evolution of the data scientist role (1:03) Common problems in different companies (2:05) Measuring and curating content on Reddit (4:29) The challenges of working with unstructured content at Reddit and Twitter (11:03) Lessons learned from Reddit and applying them at Twitter (13:17) Data challe…
  continue reading
 
Highlights from this week’s conversation include: The Evolution of Databases and Data Systems (2:33) Abstracting Data for Business Users (4:31) Building a Database for Google-like Search (7:58) The Big Data Explosion (11:10) Selling Myspace as First Customer (13:14) Starting ActionIQ (16:57) The customer-centric organization (22:46) Transitioning t…
  continue reading
 
Highlights from this week’s conversation include: Defining data mesh (6:37) Addressing the scale of organizational complexity and usage (9:04) The shift from monolithic to microservices (12:24) The sociological structure in data mesh (13:59) Data product generation and sharing in data mesh (17:27) Data Mesh: Simplifying Data Work (24:09) Getting St…
  continue reading
 
Highlights from this week’s conversation include: Ben’s background in real estate (3:27) Why Fundrise was Started (4:37) Democratizing Investment Opportunities (6:35) Investment Thesis for Venture (11:55) Challenges with Data and Technology (12:34) Importance of Data Model Abstraction (20:03) Data Infrastructure and Investments (23:22) Evolution of…
  continue reading
 
Highlights from this week’s conversation include: The concept of composable at a lower level of data infrastructure (1:28) New architectures and components that allow developers to build databases (3:44) Pedro's background and experience in data infrastructure (6:18) The Spectrum of Latency and Analytics (12:59) Different Query Engines for Differen…
  continue reading
 
Highlights from this week’s conversation include: Colin's Background and Starting Omni (1:48) Defining “good” at Google search early in his career (4:42) Looker's Unique Approach to Analytics (9:48) The paradigm shift in analytics (10:52) The architecture of Looker and its influence (12:04) Combatting the challenge of unbundling in the data stack (…
  continue reading
 
Highlights from this week’s conversation include: The Unique Perspective of Practitioners (2:10) Account-based Marketing (6:30) Sales Development Representatives (SDR) (8:05) Descriptive, People, and Engagement Data (11:38) Data Overload and Actionable Data (14:20) Working with Data Teams and Internal Data (17:52) The relationship between business …
  continue reading
 
Highlights from this week’s conversation include: Johnny and David’s background in working together (1:56) The background story of Estuary (4:15) The challenges of ad tech and the need for low latency (5:44) Use cases for moving data at scale (10:35) Real-time data replication methods (11:54) Challenges with Kafka and the birth of Gazette (13:54) C…
  continue reading
 
Highlights from this week’s conversation include: The potential of AI-driven applications (1:34) The need for hardware infrastructure in AI experimentation (2:40) Oligopoly on the closed side (11:50) Advantages of private side vs. open source (13:18) Leveraging valuable data within enterprises (16:00) The urgency of adopting LLMs in the enterprise …
  continue reading
 
Highlights from this week’s conversation include: Chang’s background and journey with Pandas (6:26) The persisting challenges in data collection and preparation (10:37) The resistance to change in using Python for data workflows (13:05) AI hype and its impact (14:09) The success and evolution of Pandas as a data framework (20:04) The vision for a n…
  continue reading
 
Loading …

Quick Reference Guide