Artwork

Content provided by Ben Jaffe and Katie Malone, Ben Jaffe, and Katie Malone. All podcast content including episodes, graphics, and podcast descriptions are uploaded and provided directly by Ben Jaffe and Katie Malone, Ben Jaffe, and Katie Malone or their podcast platform partner. If you believe someone is using your copyrighted work without your permission, you can follow the process outlined here https://player.fm/legal.
Player FM - Podcast App
Go offline with the Player FM app!

The Lottery Ticket Hypothesis

19:45
 
Share
 

Manage episode 254315967 series 74115
Content provided by Ben Jaffe and Katie Malone, Ben Jaffe, and Katie Malone. All podcast content including episodes, graphics, and podcast descriptions are uploaded and provided directly by Ben Jaffe and Katie Malone, Ben Jaffe, and Katie Malone or their podcast platform partner. If you believe someone is using your copyrighted work without your permission, you can follow the process outlined here https://player.fm/legal.
Recent research into neural networks reveals that sometimes, not all parts of the neural net are equally responsible for the performance of the network overall. Instead, it seems like (in some neural nets, at least) there are smaller subnetworks present where most of the predictive power resides. The fascinating thing is that, for some of these subnetworks (so-called “winning lottery tickets”), it’s not the training process that makes them good at their classification or regression tasks: they just happened to be initialized in a way that was very effective. This changes the way we think about what training might be doing, in a pretty fundamental way. Sometimes, instead of crafting a good fit from wholecloth, training might be finding the parts of the network that always had predictive power to begin with, and isolating and strengthening them. This research is pretty recent, having only come to prominence in the last year, but nonetheless challenges our notions about what it means to train a machine learning model.
  continue reading

293 episodes

Artwork

The Lottery Ticket Hypothesis

Linear Digressions

3,117 subscribers

published

iconShare
 
Manage episode 254315967 series 74115
Content provided by Ben Jaffe and Katie Malone, Ben Jaffe, and Katie Malone. All podcast content including episodes, graphics, and podcast descriptions are uploaded and provided directly by Ben Jaffe and Katie Malone, Ben Jaffe, and Katie Malone or their podcast platform partner. If you believe someone is using your copyrighted work without your permission, you can follow the process outlined here https://player.fm/legal.
Recent research into neural networks reveals that sometimes, not all parts of the neural net are equally responsible for the performance of the network overall. Instead, it seems like (in some neural nets, at least) there are smaller subnetworks present where most of the predictive power resides. The fascinating thing is that, for some of these subnetworks (so-called “winning lottery tickets”), it’s not the training process that makes them good at their classification or regression tasks: they just happened to be initialized in a way that was very effective. This changes the way we think about what training might be doing, in a pretty fundamental way. Sometimes, instead of crafting a good fit from wholecloth, training might be finding the parts of the network that always had predictive power to begin with, and isolating and strengthening them. This research is pretty recent, having only come to prominence in the last year, but nonetheless challenges our notions about what it means to train a machine learning model.
  continue reading

293 episodes

All episodes

×
 
Loading …

Welcome to Player FM!

Player FM is scanning the web for high-quality podcasts for you to enjoy right now. It's the best podcast app and works on Android, iPhone, and the web. Signup to sync subscriptions across devices.

 

Quick Reference Guide