Artwork

Content provided by Daniel Reid Cahn. All podcast content including episodes, graphics, and podcast descriptions are uploaded and provided directly by Daniel Reid Cahn or their podcast platform partner. If you believe someone is using your copyrighted work without your permission, you can follow the process outlined here https://player.fm/legal.
Player FM - Podcast App
Go offline with the Player FM app!

Pre-training LLMs: One Model To Rule Them All? with Talfan Evans, DeepMind

37:36
 
Share
 

Manage episode 418936584 series 3514761
Content provided by Daniel Reid Cahn. All podcast content including episodes, graphics, and podcast descriptions are uploaded and provided directly by Daniel Reid Cahn or their podcast platform partner. If you believe someone is using your copyrighted work without your permission, you can follow the process outlined here https://player.fm/legal.

Talfan Evans is a research engineer at DeepMind, where he focuses on data curation and foundational research for pre-training LLMs and multimodal models like Gemini. I ask Talfan:

  • Will one model rule them all?
  • What does "high quality data" actually mean in the context of LLM training?
  • Is language model pre-training becoming commoditized?
  • Are companies like Google and OpenAI keeping their AI secrets to themselves?
  • Does the startup or open source community stand a chance next to the giants?

Also check out Talfan's latest paper at DeepMind, Bad Students Make Good Teachers.

  continue reading

22 episodes

Artwork
iconShare
 
Manage episode 418936584 series 3514761
Content provided by Daniel Reid Cahn. All podcast content including episodes, graphics, and podcast descriptions are uploaded and provided directly by Daniel Reid Cahn or their podcast platform partner. If you believe someone is using your copyrighted work without your permission, you can follow the process outlined here https://player.fm/legal.

Talfan Evans is a research engineer at DeepMind, where he focuses on data curation and foundational research for pre-training LLMs and multimodal models like Gemini. I ask Talfan:

  • Will one model rule them all?
  • What does "high quality data" actually mean in the context of LLM training?
  • Is language model pre-training becoming commoditized?
  • Are companies like Google and OpenAI keeping their AI secrets to themselves?
  • Does the startup or open source community stand a chance next to the giants?

Also check out Talfan's latest paper at DeepMind, Bad Students Make Good Teachers.

  continue reading

22 episodes

All episodes

×
 
Loading …

Welcome to Player FM!

Player FM is scanning the web for high-quality podcasts for you to enjoy right now. It's the best podcast app and works on Android, iPhone, and the web. Signup to sync subscriptions across devices.

 

Quick Reference Guide