Artwork

Content provided by Building The Future Radio & TV Show and Kevin Horek - Building The Future Show - Radio / TV / Podcast. All podcast content including episodes, graphics, and podcast descriptions are uploaded and provided directly by Building The Future Radio & TV Show and Kevin Horek - Building The Future Show - Radio / TV / Podcast or their podcast platform partner. If you believe someone is using your copyrighted work without your permission, you can follow the process outlined here https://player.fm/legal.
Player FM - Podcast App
Go offline with the Player FM app!

Ep. 567 w/ Brian Stevens CEO at Neural Magic

46:33
 
Share
 

Manage episode 414235059 series 2396473
Content provided by Building The Future Radio & TV Show and Kevin Horek - Building The Future Show - Radio / TV / Podcast. All podcast content including episodes, graphics, and podcast descriptions are uploaded and provided directly by Building The Future Radio & TV Show and Kevin Horek - Building The Future Show - Radio / TV / Podcast or their podcast platform partner. If you believe someone is using your copyrighted work without your permission, you can follow the process outlined here https://player.fm/legal.

Together with our community, we engineer sparse LLM, CV, and NLP models that are more efficient and performant in production. Why does this matter? Sparse models are more flexible and can achieve unrivaled latency and throughput performance on your private CPU and GPU infrastructure. Check us out on GitHub and join the Neural Magic Slack Community to get started with software-delivered AI.

http://neuralmagic.com/

  continue reading

595 episodes

Artwork
iconShare
 
Manage episode 414235059 series 2396473
Content provided by Building The Future Radio & TV Show and Kevin Horek - Building The Future Show - Radio / TV / Podcast. All podcast content including episodes, graphics, and podcast descriptions are uploaded and provided directly by Building The Future Radio & TV Show and Kevin Horek - Building The Future Show - Radio / TV / Podcast or their podcast platform partner. If you believe someone is using your copyrighted work without your permission, you can follow the process outlined here https://player.fm/legal.

Together with our community, we engineer sparse LLM, CV, and NLP models that are more efficient and performant in production. Why does this matter? Sparse models are more flexible and can achieve unrivaled latency and throughput performance on your private CPU and GPU infrastructure. Check us out on GitHub and join the Neural Magic Slack Community to get started with software-delivered AI.

http://neuralmagic.com/

  continue reading

595 episodes

All episodes

×
 
Loading …

Welcome to Player FM!

Player FM is scanning the web for high-quality podcasts for you to enjoy right now. It's the best podcast app and works on Android, iPhone, and the web. Signup to sync subscriptions across devices.

 

Quick Reference Guide