Discover a whole new take on Artificial Intelligence with Squirro's educational podcast! Join host Lauren Hawker Zafer, a top voice in Artificial Intelligence on LinkedIn, for insightful chats that unravel the fascinating world of tech innovation, use case exploration and AI knowledge. Dive into candid discussions with accomplished industry experts and established academics. With each episode, you'll expand your grasp of cutting-edge technologies and their incredible impact on society, and y ...
…
continue reading
Content provided by Robin Ranjit Singh Chauhan. All podcast content including episodes, graphics, and podcast descriptions are uploaded and provided directly by Robin Ranjit Singh Chauhan or their podcast platform partner. If you believe someone is using your copyrighted work without your permission, you can follow the process outlined here https://player.fm/legal.
Player FM - Podcast App
Go offline with the Player FM app!
Go offline with the Player FM app!
Csaba Szepesvari
MP3•Episode home
Manage episode 257924138 series 2536330
Content provided by Robin Ranjit Singh Chauhan. All podcast content including episodes, graphics, and podcast descriptions are uploaded and provided directly by Robin Ranjit Singh Chauhan or their podcast platform partner. If you believe someone is using your copyrighted work without your permission, you can follow the process outlined here https://player.fm/legal.
Csaba Szepesvari is:
- Head of the Foundations Team at DeepMind
- Professor of Computer Science at the University of Alberta
- Canada CIFAR AI Chair
- Fellow at the Alberta Machine Intelligence Institute
- Co-Author of the book Bandit Algorithms along with Tor Lattimore, and author of the book Algorithms for Reinforcement Learning
References
- Bandit based monte-carlo planning, Levente Kocsis, Csaba Szepesvári
- Bandit Algorithms, Tor Lattimore, Csaba Szepesvári
- Algorithms for Reinforcement Learning, Csaba Szepesvári
- The Predictron: End-To-End Learning and Planning, David Silver, Hado van Hasselt, Matteo Hessel, Tom Schaul, Arthur Guez, Tim Harley, Gabriel Dulac-Arnold, David Reichert, Neil Rabinowitz, Andre Barreto, Thomas Degris
- A Bayesian framework for reinforcement learning, Strens
- Solving Rubik’s Cube with a Robot Hand ; Paper, OpenAI, Ilge Akkaya, Marcin Andrychowicz, Maciek Chociej, Mateusz Litwin, Bob McGrew, Arthur Petron, Alex Paino, Matthias Plappert, Glenn Powell, Raphael Ribas, Jonas Schneider, Nikolas Tezak, Jerry Tworek, Peter Welinder, Lilian Weng, Qiming Yuan, Wojciech Zaremba, Lei Zhang
- The Nonstochastic Multiarmed Bandit Problem, Peter Auer, Nicolò Cesa-Bianchi, Yoav Freund, and Robert E. Schapire
- Deep Learning with Bayesian Principles, Mohammad Emtiyaz Khan
- Tackling climate change with Machine Learning David Rolnick, Priya L. Donti, Lynn H. Kaack, Kelly Kochanski, Alexandre Lacoste, Kris Sankaran, Andrew Slavin Ross, Nikola Milojevic-Dupont, Natasha Jaques, Anna Waldman-Brown, Alexandra Luccioni, Tegan Maharaj, Evan D. Sherwin, S. Karthik Mukkavilli, Konrad P. Kording, Carla Gomes, Andrew Y. Ng, Demis Hassabis, John C. Platt, Felix Creutzig, Jennifer Chayes, Yoshua Bengio
61 episodes
MP3•Episode home
Manage episode 257924138 series 2536330
Content provided by Robin Ranjit Singh Chauhan. All podcast content including episodes, graphics, and podcast descriptions are uploaded and provided directly by Robin Ranjit Singh Chauhan or their podcast platform partner. If you believe someone is using your copyrighted work without your permission, you can follow the process outlined here https://player.fm/legal.
Csaba Szepesvari is:
- Head of the Foundations Team at DeepMind
- Professor of Computer Science at the University of Alberta
- Canada CIFAR AI Chair
- Fellow at the Alberta Machine Intelligence Institute
- Co-Author of the book Bandit Algorithms along with Tor Lattimore, and author of the book Algorithms for Reinforcement Learning
References
- Bandit based monte-carlo planning, Levente Kocsis, Csaba Szepesvári
- Bandit Algorithms, Tor Lattimore, Csaba Szepesvári
- Algorithms for Reinforcement Learning, Csaba Szepesvári
- The Predictron: End-To-End Learning and Planning, David Silver, Hado van Hasselt, Matteo Hessel, Tom Schaul, Arthur Guez, Tim Harley, Gabriel Dulac-Arnold, David Reichert, Neil Rabinowitz, Andre Barreto, Thomas Degris
- A Bayesian framework for reinforcement learning, Strens
- Solving Rubik’s Cube with a Robot Hand ; Paper, OpenAI, Ilge Akkaya, Marcin Andrychowicz, Maciek Chociej, Mateusz Litwin, Bob McGrew, Arthur Petron, Alex Paino, Matthias Plappert, Glenn Powell, Raphael Ribas, Jonas Schneider, Nikolas Tezak, Jerry Tworek, Peter Welinder, Lilian Weng, Qiming Yuan, Wojciech Zaremba, Lei Zhang
- The Nonstochastic Multiarmed Bandit Problem, Peter Auer, Nicolò Cesa-Bianchi, Yoav Freund, and Robert E. Schapire
- Deep Learning with Bayesian Principles, Mohammad Emtiyaz Khan
- Tackling climate change with Machine Learning David Rolnick, Priya L. Donti, Lynn H. Kaack, Kelly Kochanski, Alexandre Lacoste, Kris Sankaran, Andrew Slavin Ross, Nikola Milojevic-Dupont, Natasha Jaques, Anna Waldman-Brown, Alexandra Luccioni, Tegan Maharaj, Evan D. Sherwin, S. Karthik Mukkavilli, Konrad P. Kording, Carla Gomes, Andrew Y. Ng, Demis Hassabis, John C. Platt, Felix Creutzig, Jennifer Chayes, Yoshua Bengio
61 episodes
All episodes
×Welcome to Player FM!
Player FM is scanning the web for high-quality podcasts for you to enjoy right now. It's the best podcast app and works on Android, iPhone, and the web. Signup to sync subscriptions across devices.