Go offline with the Player FM app!
[QA] Towards a Theoretical Understanding of the `Reversal Curse' via Training Dynamics
Manage episode 417327954 series 3524393
The paper analyzes the "reversal curse" in large language models, explaining why they struggle with logical reasoning tasks like inverse search and chain-of-thought.
https://arxiv.org/abs//2405.04669
YouTube: https://www.youtube.com/@ArxivPapers
TikTok: https://www.tiktok.com/@arxiv_papers
Apple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016
Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers
--- Support this podcast: https://podcasters.spotify.com/pod/show/arxiv-papers/support
1119 episodes
Manage episode 417327954 series 3524393
The paper analyzes the "reversal curse" in large language models, explaining why they struggle with logical reasoning tasks like inverse search and chain-of-thought.
https://arxiv.org/abs//2405.04669
YouTube: https://www.youtube.com/@ArxivPapers
TikTok: https://www.tiktok.com/@arxiv_papers
Apple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016
Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers
--- Support this podcast: https://podcasters.spotify.com/pod/show/arxiv-papers/support
1119 episodes
همه قسمت ها
×Welcome to Player FM!
Player FM is scanning the web for high-quality podcasts for you to enjoy right now. It's the best podcast app and works on Android, iPhone, and the web. Signup to sync subscriptions across devices.