Artwork

Content provided by Thomas Frey. All podcast content including episodes, graphics, and podcast descriptions are uploaded and provided directly by Thomas Frey or their podcast platform partner. If you believe someone is using your copyrighted work without your permission, you can follow the process outlined here https://player.fm/legal.
Player FM - Podcast App
Go offline with the Player FM app!

Ep. 143: Evolution, values, and AI Safety | Quintin Pope

1:09:21
 
Share
 

Manage episode 379331431 series 2824460
Content provided by Thomas Frey. All podcast content including episodes, graphics, and podcast descriptions are uploaded and provided directly by Thomas Frey or their podcast platform partner. If you believe someone is using your copyrighted work without your permission, you can follow the process outlined here https://player.fm/legal.

Quintin Pope is a computer science graduate student at Oregon State University, and an alignment researcher focusing on methods of instilling human-compatible values into deep learning-based AI systems, with a particular focus on language models. He co-developed shard theory, an attempt to explain the human value formation process as a consequence of simple reinforcement learning and self-supervised learning dynamics. His interests also include the optimization dynamics of neural networks, human brains, and evolution, as well as how they tie into AI takeoff scenarios and alignment concerns. His current research focuses on methods of scalably supervising self-improving AI systems.

Learn more about your ad choices. Visit megaphone.fm/adchoices

  continue reading

163 episodes

Artwork
iconShare
 
Manage episode 379331431 series 2824460
Content provided by Thomas Frey. All podcast content including episodes, graphics, and podcast descriptions are uploaded and provided directly by Thomas Frey or their podcast platform partner. If you believe someone is using your copyrighted work without your permission, you can follow the process outlined here https://player.fm/legal.

Quintin Pope is a computer science graduate student at Oregon State University, and an alignment researcher focusing on methods of instilling human-compatible values into deep learning-based AI systems, with a particular focus on language models. He co-developed shard theory, an attempt to explain the human value formation process as a consequence of simple reinforcement learning and self-supervised learning dynamics. His interests also include the optimization dynamics of neural networks, human brains, and evolution, as well as how they tie into AI takeoff scenarios and alignment concerns. His current research focuses on methods of scalably supervising self-improving AI systems.

Learn more about your ad choices. Visit megaphone.fm/adchoices

  continue reading

163 episodes

All episodes

×
 
Loading …

Welcome to Player FM!

Player FM is scanning the web for high-quality podcasts for you to enjoy right now. It's the best podcast app and works on Android, iPhone, and the web. Signup to sync subscriptions across devices.

 

Quick Reference Guide