Artwork

Content provided by Emily Laird. All podcast content including episodes, graphics, and podcast descriptions are uploaded and provided directly by Emily Laird or their podcast platform partner. If you believe someone is using your copyrighted work without your permission, you can follow the process outlined here https://player.fm/legal.
Player FM - Podcast App
Go offline with the Player FM app!

Transformers Mini Series: How do Transformers work?

8:12
 
Share
 

Manage episode 425588407 series 3578824
Content provided by Emily Laird. All podcast content including episodes, graphics, and podcast descriptions are uploaded and provided directly by Emily Laird or their podcast platform partner. If you believe someone is using your copyrighted work without your permission, you can follow the process outlined here https://player.fm/legal.

In part two of our Transformer mini-series, we peel back the layers to uncover the mechanics that make Transformers the rock stars of the AI world. Think of this episode as your backstage pass to understanding how these models operate. We’ll break down the self-attention mechanism, comparing it to having superhuman hearing at a party, and explore the power of multi-head attention, likened to having multiple sets of ears tuned to different conversations. We also delve into the rigorous training process of Transformers, from the use of GPUs and TPUs to optimization strategies.
Connect with Emily Laird on LinkedIn

  continue reading

22 episodes

Artwork
iconShare
 
Manage episode 425588407 series 3578824
Content provided by Emily Laird. All podcast content including episodes, graphics, and podcast descriptions are uploaded and provided directly by Emily Laird or their podcast platform partner. If you believe someone is using your copyrighted work without your permission, you can follow the process outlined here https://player.fm/legal.

In part two of our Transformer mini-series, we peel back the layers to uncover the mechanics that make Transformers the rock stars of the AI world. Think of this episode as your backstage pass to understanding how these models operate. We’ll break down the self-attention mechanism, comparing it to having superhuman hearing at a party, and explore the power of multi-head attention, likened to having multiple sets of ears tuned to different conversations. We also delve into the rigorous training process of Transformers, from the use of GPUs and TPUs to optimization strategies.
Connect with Emily Laird on LinkedIn

  continue reading

22 episodes

همه قسمت ها

×
 
Loading …

Welcome to Player FM!

Player FM is scanning the web for high-quality podcasts for you to enjoy right now. It's the best podcast app and works on Android, iPhone, and the web. Signup to sync subscriptions across devices.

 

Quick Reference Guide