Go offline with the Player FM app!
Detoxifying Large Language Models via Knowledge Editing
Manage episode 408513241 series 3524393
The paper explores detoxifying Large Language Models using knowledge editing techniques, introducing SafeEdit benchmark and proposing DINM baseline for efficient detoxification with minimal performance impact.
https://arxiv.org/abs//2403.14472
YouTube: https://www.youtube.com/@ArxivPapers
TikTok: https://www.tiktok.com/@arxiv_papers
Apple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016
Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers
2489 episodes
Manage episode 408513241 series 3524393
The paper explores detoxifying Large Language Models using knowledge editing techniques, introducing SafeEdit benchmark and proposing DINM baseline for efficient detoxification with minimal performance impact.
https://arxiv.org/abs//2403.14472
YouTube: https://www.youtube.com/@ArxivPapers
TikTok: https://www.tiktok.com/@arxiv_papers
Apple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016
Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers
2489 episodes
Tous les épisodes
×Welcome to Player FM!
Player FM is scanning the web for high-quality podcasts for you to enjoy right now. It's the best podcast app and works on Android, iPhone, and the web. Signup to sync subscriptions across devices.