222 subscribers
Go offline with the Player FM app!
Podcasts Worth a Listen
SPONSORED


1 Understanding Taxes as a Newly Formed Small Business - Part 2 of the Small Business Starter Kit 28:24
#239 Tuhin Srivatsa: How Baseten is Disrupting AI Deployment & Scaling in 2025
Manage episode 468574200 series 2455219
This episode is sponsored by Thuma.
Thuma is a modern design company that specializes in timeless home essentials that are mindfully made with premium materials and intentional details.
To get $100 towards your first bed purchase, go to http://thuma.co/eyeonai
—————————————————————————————————————————
AI deployment is broken—can it be fixed? In this episode, Tuhin Srivatsa, CEO & Co-Founder of Baseten, reveals how his company is DISRUPTING AI infrastructure, making it easier, faster, and more cost-effective to deploy and scale AI models in production.
As enterprises increasingly turn to open-source AI models and grapple with the high costs and complexity of scaling, Baseten offers a game-changing solution that eliminates bottlenecks and simplifies the process. Discover how Baseten is taking on AWS SageMaker, OpenAI, and cloud-based AI deployment platforms to reshape the future of AI model deployment.
What You’ll Learn in This Episode:Why AI deployment & scaling is one of the biggest challenges in 2025
How Baseten enables enterprises to run AI models faster & more efficiently
The shift from closed-source to open-source AI models—and why it matters
The hidden costs of AI inference & how to optimize for performance
Why most AI models fail in production and how to prevent it
The future of AI infrastructure: What comes next for scalable AI
Whether you’re a machine learning engineer, AI researcher, startup founder, or enterprise leader, this episode is packed with actionable insights to help you scale AI models without the headaches.
Don’t miss this conversation on the next era of AI deployment!
#AI #ArtificialIntelligence #MachineLearning #Baseten #AIDeployment #AIScaling #Inference #MLInfrastructure #TechPodcast
Stay Updated:
Craig Smith Twitter: https://twitter.com/craigss
Eye on A.I. Twitter: https://twitter.com/EyeOn_AI
—————————————————————————————————————————
(00:00) Tuhin Srivatsa’s Journey in AI & Baseten
(01:50) What is AI Infrastructure & Why It Matters
(03:30) How Baseten Optimizes AI Model Deployment
(05:19) Why Most AI Deployments Fail (And How to Fix It)
(09:17) The Future of Open-Source AI Models in Enterprise
(11:01) How Baseten Automates AI Scaling & Inference
(14:12) Why AI Developers Struggle with Cloud-Based AI Tools
(18:47) The Real Cost of AI Inference (And How to Reduce It)
(20:44) Why AI Scaling is the Biggest Challenge in 2025
(26:55) Can AI Run on Non-NVIDIA Chips? (The Hardware Debate)
(31:23) The Future of AI Model Deployment & Inference
(37:05) How AI Agents & Reasoning Models Are Changing the Game
(40:39) The Truth About AI Hype vs. Reality
(45:04) How to Get Started with Baseten
(45:48) The Future of AI Infrastructure
245 episodes
Manage episode 468574200 series 2455219
This episode is sponsored by Thuma.
Thuma is a modern design company that specializes in timeless home essentials that are mindfully made with premium materials and intentional details.
To get $100 towards your first bed purchase, go to http://thuma.co/eyeonai
—————————————————————————————————————————
AI deployment is broken—can it be fixed? In this episode, Tuhin Srivatsa, CEO & Co-Founder of Baseten, reveals how his company is DISRUPTING AI infrastructure, making it easier, faster, and more cost-effective to deploy and scale AI models in production.
As enterprises increasingly turn to open-source AI models and grapple with the high costs and complexity of scaling, Baseten offers a game-changing solution that eliminates bottlenecks and simplifies the process. Discover how Baseten is taking on AWS SageMaker, OpenAI, and cloud-based AI deployment platforms to reshape the future of AI model deployment.
What You’ll Learn in This Episode:Why AI deployment & scaling is one of the biggest challenges in 2025
How Baseten enables enterprises to run AI models faster & more efficiently
The shift from closed-source to open-source AI models—and why it matters
The hidden costs of AI inference & how to optimize for performance
Why most AI models fail in production and how to prevent it
The future of AI infrastructure: What comes next for scalable AI
Whether you’re a machine learning engineer, AI researcher, startup founder, or enterprise leader, this episode is packed with actionable insights to help you scale AI models without the headaches.
Don’t miss this conversation on the next era of AI deployment!
#AI #ArtificialIntelligence #MachineLearning #Baseten #AIDeployment #AIScaling #Inference #MLInfrastructure #TechPodcast
Stay Updated:
Craig Smith Twitter: https://twitter.com/craigss
Eye on A.I. Twitter: https://twitter.com/EyeOn_AI
—————————————————————————————————————————
(00:00) Tuhin Srivatsa’s Journey in AI & Baseten
(01:50) What is AI Infrastructure & Why It Matters
(03:30) How Baseten Optimizes AI Model Deployment
(05:19) Why Most AI Deployments Fail (And How to Fix It)
(09:17) The Future of Open-Source AI Models in Enterprise
(11:01) How Baseten Automates AI Scaling & Inference
(14:12) Why AI Developers Struggle with Cloud-Based AI Tools
(18:47) The Real Cost of AI Inference (And How to Reduce It)
(20:44) Why AI Scaling is the Biggest Challenge in 2025
(26:55) Can AI Run on Non-NVIDIA Chips? (The Hardware Debate)
(31:23) The Future of AI Model Deployment & Inference
(37:05) How AI Agents & Reasoning Models Are Changing the Game
(40:39) The Truth About AI Hype vs. Reality
(45:04) How to Get Started with Baseten
(45:48) The Future of AI Infrastructure
245 episodes
All episodes
×
1 #243 Greg Osuri: Why the Future of AI Depends on Decentralized Cloud Platforms 59:19

1 #242 Dylan Arena: The AI Education Revolution: How AI is Changing the Way We Learn 57:58

1 #241 Patrick M. Pilarski: The Alberta Plan’s Roadmap to AI and AGI 1:01:44

1 #240 Manos Koukoumidis: Why The Future of AI is Open-Source 1:06:03

1 #239 Tuhin Srivatsa: How Baseten is Disrupting AI Deployment & Scaling in 2025 46:17

1 #238 Dominic Williams Reveals His Vision for the Internet Computer (ICP) 1:14:52

1 #237 Pedro Domingos Breaks Down The Symbolist Approach to AI 48:12

1 #236 Pedro Domingo’s on Bayesians and Analogical Learning in AI 56:43

1 #235 Vall Herard: The Future of AI-Driven Compliance (Saifr.ai) 51:55

1 #234 Tyler Xuan Saltsman: How AI is Shaping the Future of Combat & Warfare 38:48

1 #233 Matt Price: How Crescendo is Disrupting Customer Service with Gen AI 44:53

1 #232 Sepp Hochreiter: How LSTMs Power Modern AI System’s 51:08

1 #231 Paras Jain: The Future of AI Video Generation with Genmo 47:49

1 #230 Jamie Lerner: How Quantum Solves AI’s Need for Unstructured Data Solutions 53:23

1 #229 Mitesh Agrawal: Why Lambda Labs’ AI Cloud Is a Game-Changer for Developers 56:07
Welcome to Player FM!
Player FM is scanning the web for high-quality podcasts for you to enjoy right now. It's the best podcast app and works on Android, iPhone, and the web. Signup to sync subscriptions across devices.