data.world public
[search 0]
More
Download the App!
show episodes
 
Catalog and Cocktails is an honest, no-BS, non-sales-y conversation about data and analytics. This is your unfiltered chat about everything interesting in data and metadata management, DataOps, architecture, and beyond. Join Juan Sequeda and Tim Gasper to explore emerging topics and hear from visionary leaders across the data space.
  continue reading
 
Tristan Handy has been curating the Analytics Engineering Roundup newsletter since 2015, pulling together the internet’s best data science & analytics articles. Tristan and co-host Julia Schottenstein now bring the Roundup to real life, hosting biweekly conversations with data practitioners inventing the future of analytics engineering. You can view full episode summaries and read back issues of the Roundup newsletter at https://roundup.getdbt.com. The podcast is sponsored by dbt labs, maker ...
  continue reading
 
Loading …
show series
 
Eric Avidon is a journalist at TechTarget who's interviewed Tristan a few times, and now Tristan gets to flip the script and interview Eric. Eric is a journalist veteran, covering everything from finance to the Boston Red Sox, but now he spends a lot of time with vendors in the data space and has a broad view of what's going on. Eric and Tristan di…
  continue reading
 
Knowledge Graphs are gaining more and more attention due to their role of structuring data and knowledge and providing accuracy for LLMs through GraphRAG. In this episode, we are joined by Ora Lassila, one of the “fathers” of RDF graphs and semantic web, which are the foundations for modern knowledge graphs, where we will dive into the questions yo…
  continue reading
 
Knowledge Graphs are gaining more and more attention due to their role of structuring data and knowledge and providing accuracy for LLMs through GraphRAG. In this episode, we are joined by Ora Lassila, one of the “fathers” of RDF graphs and semantic web, which are the foundations for modern knowledge graphs, where we will dive into the questions yo…
  continue reading
 
CDO, CDAO, CAIO, CIO, CTO! Oh my, it's a cluster! Sol Rashidi joins Tim and Juan to help navigate this cluster, sharing honest no bs advice from her vast experience in the Data and AI world. If you are a leader, or a practioner aspiring to go to leadership, this is the must listen episode!By data.world
  continue reading
 
CDO, CDAO, CAIO, CIO, CTO! Oh my, it's a cluster! Sol Rashidi joins Tim and Juan to help navigate this cluster, sharing honest no bs advice from her vast experience in the Data and AI world. If you are a leader, or a practioner aspiring to go to leadership, this is the must listen episode!By data.world
  continue reading
 
Barry McCardel is the co-founder and CEO of Hex. Hex is an analytics tool that's structured around a notebook experience, but as you'll hear in the episode, goes well beyond the traditional notebook. We're big fans of Hex at dbt Labs, and use it for a bunch of our internal data work. In this episode, Barry and Tristan discuss notebooks and data ana…
  continue reading
 
Matt Turck has been publishing his ecosystem map since 2012. It was first called the Big Data Landscape. Now it’s the Machine Learning, AI & Data (MAD) Landscape. The 2024 MAD Landscape includes 2,011(!) logos, which Matt attributes first a data infrastructure cycle and now an ML/AI cycle. As Matt writes, “Those two waves are intimately related. A …
  continue reading
 
Matthew Lynley is a bit of a hybrid. He's been a long-time journalist covering enterprise tech, currently in his fantastic AI and data newsletter Supervised, and he's also been a hands-on data practitioner. Matthew has covered the analytics tech stack, but this time Tristan turns the tables to get Matthew’s perspective on the rise of Gen AI as a to…
  continue reading
 
Juan Sequeda is a principal data scientist and head of the AI Lab at data.world, and is also the co-host of the fantastic data podcast Catalog and Cocktails. This episode tackles semantics, semantic web, Juan’s research in how raw text-to-SQL performs versus text-to-semantic layer, and where we both believe AI will make an impact in the world of st…
  continue reading
 
You can’t lift and shift your traditional data governance practices to AI governance. AI's unique quirks bring unique governance requirements: understanding its limitations, ensuring fairness, protecting personal and intellectual property rights, and tailoring accuracy to specific use cases.By data.world
  continue reading
 
Benn Stancil, cofounder and CTO at Mode, returns to The Analytics Engineering Podcast to discuss the evolution of the term "modern data stack" and its value today. Tristan wrote on this idea for The Analytics Engineering Roundup in Is the Modern Data Stack Still a Useful Idea? For full show notes and to read 6+ years of back issues of the podcast's…
  continue reading
 
Jeremiah Owyang is a general partner at Blitzscaling Ventures. His career arc has spanned web, sharing economy, and autonomous/AI technologies. He believes that AI is going to help humanity accomplish many of the big challenges we have for society, from health to learning to work and more. But the way we communicate and measure work will change - r…
  continue reading
 
Moritz Heimpel from Siemens and Ben Flusberg from Cox Automotive have very similar jobs. They both act as stewards of the data strategies at large, complex companies. In this episode, we get into what it’s like to collaborate with data at scale. Ben and Mortitz share their experiences adopting a data mesh architecture and what that looks like at th…
  continue reading
 
Data contracts are all the rage and it’s about shifting responsibility to the left. What does that actually mean? How do you do that? Do we actually need more technology for it? Who needs to be involved? So many honest no-bs questions and who best to answer them than Andrew Jones, inventor of data contracts.…
  continue reading
 
Investing in Knowledge Graph provides higher accuracy for LLM-powered question-answering systems. That's the conclusion of the latest research that Juan Sequeda, Dean Allemang and Bryon Jacob have recently presented. In this episode, we will dive into the details of this research and understand why to succeed in this AI world, enterprises must trea…
  continue reading
 
If Data Vault is a new term for you, it’s a data modeling design pattern. We’re joined by Brandon Taylor, a senior data architect at Guild, and Michael Olschimke, who is the CEO of Scalefree—the consulting firm whose co-founder Dan Lindstedt is credited as the designer of the data vault architecture. In this conversation with Tristan and Julia, Mic…
  continue reading
 
Jonathan Frankle is the Chief Scientist at MosaicML, which was recently bought by Databricks for $1.3 billion. MosaicML helps customers train generative AI models on their data. Lots of companies are excited about gen AI, and the hope is that their company data and information will be what sets them apart from the competition. In this conversation …
  continue reading
 
RAG, Retrieval Augemented Generation, is the term you now constantly hear in conjunction with LLM that provides context. But how does it actually work? And what's the relationship with Vector Databases and Knowledge Graphs? This will be a geeky AI episode with Mike Dillinger.By data.world
  continue reading
 
Technical folks miss the boat and are boring when they talk about the features of data catalog such as glossaries and data lineage to business people. In this episode Krystin Kim will share how a data catalog should be presented to the business: the ultimate place to share ideas across big companies, a treasure trove of use cases for others to disc…
  continue reading
 
In this conversation with Tristan recorded at Coalesce 2023, Kasey Mazza, an analytics engineering manager on the RevOps team at HubSpot, discusses the roles of data analysts and analytics engineers, the importance of building internal data communities, and the evolving landscape of data teams. Watch Kasey’s Coalescse 2023 presentation The career g…
  continue reading
 
What's the state of data stewardship today and where is it going? Will data stewards continue to exist? How is this evolving with respect to data products? And what is the impact of AI? All of these questions and more is what Tim and Juan ranted about in this episode.By data.world
  continue reading
 
It turns out data plays a big role in getting cereal manufactured and delivered so you can enjoy your Cheerios reliably for breakfast. We talk with Arjun Narayan, CEO of Materialize, a company building an operational warehouse, and Nathan Bean, a data leader at General Mills responsible for all of the company's manufacturing analytics and insights.…
  continue reading
 
Takeaways from With the hype of Generative AI, how do we keep focused on the goal of delivering the right valuable and business facing analytics and data use-cases with as low friction and maximum agility as possible? Jon Cooke, founder of Dataception has a lot of honest no-bs thoughts to share!By data.world
  continue reading
 
Tim and Juan provide an honest no-bs summary of what they observed and learned at Big Data London and the AI Conference in San Francisco. Tune in to get the latest on Data and AI and short interviews from Peter Norvig (Google), Nazneen Rajani (Hugging Face), Fabiana Clemente (YData), Danny Bickson (Visual Layer), Gev Sogomonian (AimStack), Yujian T…
  continue reading
 
Yannick Misteli is the head of engineering for the go-to-market domain at Roche, a $250 billion multinational pharmaceutical and diagnostics company. Roche was an early supporter of dbt Cloud, and Yannick helped move his team of 120+ engineers to a modern data stack. He always finds a way to push the boundaries to make a large company founded in 18…
  continue reading
 
Loading …

Quick Reference Guide