Go offline with the Player FM app!
Parallelism and Acceleration for Large Language Models with Bryan Catanzaro - #507
Manage episode 299512547 series 2355587
Today we’re joined by Bryan Catanzaro, vice president of applied deep learning research at NVIDIA.
Most folks know Bryan as one of the founders/creators of cuDNN, the accelerated library for deep neural networks. In our conversation, we explore his interest in high-performance computing and its recent overlap with AI, his current work on Megatron, a framework for training giant language models, and the basic approach for distributing a large language model on DGX infrastructure.
We also discuss the three different kinds of parallelism, tensor parallelism, pipeline parallelism, and data parallelism, that Megatron provides when training models, as well as his work on the Deep Learning Super Sampling project and the role it's playing in the present and future of game development via ray tracing.
The complete show notes for this episode can be found at twimlai.com/go/507.
720 episodes
Parallelism and Acceleration for Large Language Models with Bryan Catanzaro - #507
The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)
Manage episode 299512547 series 2355587
Today we’re joined by Bryan Catanzaro, vice president of applied deep learning research at NVIDIA.
Most folks know Bryan as one of the founders/creators of cuDNN, the accelerated library for deep neural networks. In our conversation, we explore his interest in high-performance computing and its recent overlap with AI, his current work on Megatron, a framework for training giant language models, and the basic approach for distributing a large language model on DGX infrastructure.
We also discuss the three different kinds of parallelism, tensor parallelism, pipeline parallelism, and data parallelism, that Megatron provides when training models, as well as his work on the Deep Learning Super Sampling project and the role it's playing in the present and future of game development via ray tracing.
The complete show notes for this episode can be found at twimlai.com/go/507.
720 episodes
All episodes
×Welcome to Player FM!
Player FM is scanning the web for high-quality podcasts for you to enjoy right now. It's the best podcast app and works on Android, iPhone, and the web. Signup to sync subscriptions across devices.