show episodes
 
Data leaders powering data-driven innovation. In each episode, we salute Champions of Data + AI, the change agents who are shaking up the status quo. These mavericks are rethinking how data and AI can enhance the human experience. We’ll dive into their challenges — and celebrate their successes — all while getting to know these leaders a little more personally.
 
Welcome to Data Brew by Databricks with Denny and Brooke! In this series, we explore various topics in the data and AI community and interview subject matter experts in data engineering/data science. So join us with your morning brew in hand and get ready to dive deep into data + AI! For this first season, we will be focusing on lakehouses – combining the key features of data warehouses, such as ACID transactions, with the scalability of data lakes, directly against low-cost object stores.
 
Welcome to the Building the Backend Podcast! We’re a data podcast focused on uncovering the data technologies, processes, and patterns that are driving today’s most successful companies. You will hear from data leaders sharing their knowledge and insights with what’s working and what’s not working for them. Our goal is to bring you valuable insights that will save you and your team time when building a modern data architecture in the cloud. Topics will span from big data, AI, ML, governance, ...
 
New to networks? Looking into links? Realising the relevance of relationships? Welcome to GraphStuff, your one-stop podcast for all things connected. Join your hosts William Lyon and Lju Lazarevic and guests as they dive into the world of graph databases. They’ll cover everything from how they’re constructed and where they’re used; deep dives to modelling, from first concepts to finished application, from graph-shaped problems, to best-practice graphs. With something new and different each w ...
 
Loading …
show series
 
In this episode of Building The Backend we hear from Dipti Borkar cofounder @ Ahana a managed service for Presto on AWS, where we talk all about the data lake, how it should be structured and where the industry is going. Below are top 3 value bombs: Presto is an open source distributed SQL query engine originally created by Facebook, mainly used to…
 
We had some technical difficulties with Matt getting on the podcast so, Ray had to fly solo. This month we continue our investigations into K8s storage with a discussion with Tad Lebeck (@TadLebeck) US CTO, ionir, a software defined storage system that only runs under K8s. ionir Kubernetes Data Services platform is an outgrowth of Reduxio a “tin-wr…
 
What tools are you using for data viz? Are they low cost? One option is Apache Superset, in this episode we speak with Robert Stolz to learn more about Superset and other open source data tools. Top 3 Value Bombs: One popular use case with Apache Superset is embedding it within applications because it’s open source, there is a wide range of flexibi…
 
For our third season, we focus on how leaders use data for change. Whether it’s building data teams or using data as a constructive catalyst, we interview subject matter experts from industry to dive deeper into these topics. For our season 3 finale, Nithya Ruff discusses the open-source ecosystem, ways to contribute to open-source projects (hint: …
 
Organizations worldwide are focusing on greater diversity, equality and inclusion (DEI) in the workplace. But what about ensuring DEI within data? In this episode, Jeffrey Reid from Regeneron joins us to explore the struggles facing the genetics and healthcare industry when it comes to representation in data and the adverse impact it can have on AI…
 
In this episode of Building The Backend we hear from Simon Crosby – CTO @ Swim an open source edge computing operating system, where we talk all about edge computing, event streaming and much more. Below are top 3 value bombs: Edge means more than just being physically located somewhere it could also mean in the cloud. It really is the closest poin…
 
This episode is a little different then the usual format. Instead of interviewing a data leader - I share what I consider are the 12 most important principles when designing a modern data architecture. Please message me on LinkedIn with the thoughts on this show.By Travis Lawrence
 
For our third season, we focus on how leaders use data for change. Whether it’s building data teams or using data as a constructive catalyst, we interview subject matter experts from industry to dive deeper into these topics. We interview Junta Nakai in our most unique location yet - Brooklyn Kura - the first non-Japanese sake distillery in New Yor…
 
In this episode of Building The Backend we hear from Prukalpa Sankar – Co-founder of Atlan, where we talk all about data quality/governance, common issues organizations face when implementing data quality and much much more. Below are top 3 value bombs: Data Governance has a bad reputation. It should not be a bureaucratic controlling process that i…
 
Stateful containers are becoming a hot topic these days so we thought it a good time to talk to the CNCF (Cloud Native Computing Foundation) Rook team about what they are doing to make storage easier to use for k8s container apps. CNCF put us into contact with Sébastien Han (@leseb_), Ceph Storage Architect and Travis Nielsen (@STravisNielsen), bot…
 
Try the new GraphAcademy today: https://graphacademy.neo4j.com/ Introducing the New GraphAcademy blog post: https://medium.com/neo4j/introducing-the-new-graphacademy-45b0df491a23 Adam's "Improving the Neo4j Developer Experience with Neo4j" talk at NODES 2021: https://www.youtube.com/watch?v=D4dTBzZ4uC8&list=PL9Hl4pk2FsvXfH-q5aghB2g7AlIztqoaf&index=…
 
For our third season, we focus on how leaders use data for change. Whether it’s building data teams or using data as a constructive catalyst, we interview subject matter experts from industry to dive deeper into these topics. Did you know that the average tenure of a board member is longer than the average tenure of a marriage in the United States?…
 
Neo4j Sandbox: https://neo4j.com/sandbox/ OpenStreetMap Neo4j Sandbox: https://sandbox.neo4j.com/?usecase=openstreetmap Neo4j Graph Examples: https://github.com/neo4j-graph-examples Google dataset search: https://datasetsearch.research.google.com/ Meetup API: https://www.meetup.com/meetup_api/ Working with the Meetup API & Neo4j: https://github.com…
 
This is a podcast episode you do not want to miss with Stephen Brobst, CTO @ Teradata. We discuss all things Data Warehouses, the shift to the distributed cloud and, key principles to implementing successful DW's. Top 3 Value Bombs: Large organizations are shifting more to a distributed / inter-cloud architecture for many reasons, a couple of reaso…
 
For our third season, we focus on how leaders use data for change. Whether it’s building data teams or using data as a constructive catalyst, we interview subject matter experts from industry to dive deeper into these topics. What does it mean to make your machine learning system “production-ready”? Yaron Singer walks us through the infrastructure,…
 
Driving any type of change across a large enterprise is hard enough. Now imagine having to transform your organization to be data- and AI-driven so you can improve operational efficiency, accelerate innovation to gain new insights and implement best practices to build data products. John Irvin joins us to discuss this very transformation and the ro…
 
“The hardest part of ETL is not building the connectors, it is maintaining them.” Truer words never spoken. Really enjoyed this episode with Michel Tricot CEO & Co-Founder of Airbyte where we discuss all things data integration and connectors. Top 3 value bombs: The future of ETL/ELT integration connectors may lie with open source. Many closed sour…
 
For our third season, we focus on how leaders use data for change. Whether it’s building data teams or using data as a constructive catalyst, we interview subject matter experts from industry to dive deeper into these topics. Have you ever had a spam call automatically blocked for you? You can thank First Orion for that - in one day they blocked or…
 
This episode features Gleb Mezhanskiy Co-Founder & CEO @ Datafold, during our discussion we talk all about data observability and how to improve your data quality. Before Datafold, Gleb was a founding member of data teams at Lyft and Autodesk, where he built sophisticated data platforms and developed tooling to improve productivity and data quality…
 
For our third season, we focus on how leaders use data for change. Whether it’s building data teams or using data as a constructive catalyst, we interview subject matter experts from industry to dive deeper into these topics. In this season opener, Elena Donio shares her experience using data and domain knowledge to disrupt the traditional service …
 
Data is huge. But no matter how much data you have, it’s dwarfed by the data collected by your partners and third parties. That’s why data sharing is the next big thing. As organizations increase the use of third-party data to complement and supplement their existing data sets, the ability to securely share reliable, relevant and large quantities o…
 
The GreyBeards move up the stack this month with a talk on big data and data analytics with Sean Owen (@sean_r_owen), Data Science lead at Databricks and Apache Spark committee and PMC member. The focus of the talk was on Apache Spark. Spark is an Apache Software Foundation open-source data analytics project and has been up and running since 2010. …
 
This episode features Arjun Narayan Co-Founder & CEO @ Materialize, during our discussion we talk all about transforming streaming data, the do’s the don’ts and how Materialize is changing the landscape of streaming. Top 3 Value Bombs: When creating schema changes organizations should always strive to create forward compatible schema changes only. …
 
This episode features Jean-Yves Stephan Co-Founder & CEO @ Data Mechanics (recently Acq. by Spot by NetApp), during our discussion we talk about optimizing Spark to run in the cloud at a low cost. Top 3 Value Bombs: Running Spark CAN be expensive but there are ways to reduce your current operating costs by 50-75% by smart automations (i.e. tune for…
 
This episode features Josh Benamrum, who is the co-founder of Databand. Databand is a company that helps engineering teams achieve better observability and control over their tech stack. Top 3 Value Bombs: When observing our data we should be looking at our data and pipelines Don’t wait till the board meeting for an incorrect metric to make DQ a pr…
 
Travis welcomes to his podcast Saket Saurabh, who provides a window into the world of data management and the self-service options that are democratizing it. Co-founder and CEO of Nexla, Saket has a passion for data and infrastructure and how to improve its flow among partners, customers and vendors. Nexla automates various data engineering tasks, …
 
The GreyBeards had a great discussion with Floyd Christofferson, CEO, StrongBox Data Solutions on their big data/HPC file and archive solution. Floyd’s is very knowledgeable on problems of extremely large data repositories and has been around the HPC and other data intensive industries for decades. StrongBox’s StrongLink solution offers a global na…
 
In this episode, we speak with Rob Hedgpeth, a director of developer developer relations at Maria DB. We explore all things Maria DB, the capabilities it has and when you should consider it for your next project. Top 3 value bombs: MariaDB follows a shared nothing architecture and supports distributed SQL for unlimited scaling on demand. MariaDB ca…
 
In this episode, we speak with Lior Gavish, the co-founder of Monte Carlo to explore all things data quality. Monte Carlo is a data lineage and observability tool that lowers your data downtime. Top 3 Value Bombs: Data products should be thought of in it’s entirely from the source to the consumer. No one data stakeholder can solve data quality issu…
 
For our second season of Data Brew, we will be focusing on machine learning, from research to production. We will interview folks in academia and industry to discuss topics such as data ethics, production-grade infrastructure for ML, hyperparameter tuning, AutoML, and many more. We branch, version, and test our code, but what if we treated data lik…
 
Healthcare is increasingly turning to data to improve the services it provides to its customers. Slawek shares his personal views on the role of data and AI in healthcare equity, particularly how data architectures like lakehouse can enable key insights while still protecting patient information. Finally, he’ll talk about how leadership coaching he…
 
In this episode, we speak with DeVaris Brown, he is the CEO and co-founder of Meroxa, which is a data platform that enables organizations to build real time data pipelines in minutes not months. Prior to founding Meroxa, DeVaris was a product leader at Twitter, Heroku, and Zendesk. In this episode we will be talking about all things data ingestion.…
 
GreyBeards had an amazing discussion with Peter Thompson (@Lucid_Link), CEO & co-founder and George Dochev (@GDochev), CTO & co-founder of LucidLink. Both Peter and George were very knowledgeable and easy to talk with. LucidLink’s Cloud NAS creates a NAS storage system out of cloud (any S3 compatible AND Azure Blob) object storage. LucidLink is mad…
 
In this episode, we speak with Blake Burch, co-founder of Shipyard, a data orchestrator tool that allows you to create powerful workflows in a matter of minutes. Top 3 Value Bombs: Data tests are often for the assumptions we already know. There's a lot of unknowns that can crop up and cause issues that tests are not catching. Start analyzing job me…
 
For our second season of Data Brew, we will be focusing on machine learning, from research to production. We will interview folks in academia and industry to discuss topics such as data ethics, production-grade infrastructure for ML, hyperparameter tuning, AutoML, and many more. Is there ever a “one-size fits all” approach for feature engineering? …
 
NODES 2021 Keynote video: https://www.youtube.com/watch?v=4ZCs83_iHU8 Neo4j 4.3 blog post: https://neo4j.com/blog/introducing-neo4j-4-3-the-fastest-path-to-graph-productivity/ What New in Neo4j 4.3 video: https://www.youtube.com/watch?v=9klz_c7MPJ4 Neo4j 4.3 release notes: https://neo4j.com/release-notes/neo4j-4-3-0/ Neo4j 4.3 Blog Series: Relation…
 
For our second season of Data Brew, we will be focusing on machine learning, from research to production. We will interview folks in academia and industry to discuss topics such as data ethics, production-grade infrastructure for ML, hyperparameter tuning, AutoML, and many more. What does it mean for a model to be “interpretable”? Ameet Talwalkar s…
 
Neo4j Connectors page: neo4j.com/product/connectors Neo4j Connector for Apacha Kafka: neo4j.com/labs/kafka/ Neo4j Connector for Business Intelligence: neo4j.com/bi-connector/ Neo4j Connector for Apache Spark: neo4j.com/product/connectors/apache-spark-connector The Neo4j GraphQL library: neo4j.com/product/graphql-library/ Using Retool with Neo4j Gra…
 
In this episode, we speak with Mark Cusack, CTO at Yellowbrick. Yellowbrick is a data warehouse platform that was built from the ground up for performance and cost that can be deployed across clouds and on-prem. Top 3 Value Bombs: Yellowbrick DW was recently named a contender in Cloud Data Warehouses by Forrester Research and they are able to achie…
 
Loading …

Quick Reference Guide

Copyright 2021 | Sitemap | Privacy Policy | Terms of Service
Google login Twitter login Classic login