Go offline with the Player FM app!
Podcasts Worth a Listen
SPONSORED


1 Phil Wang Pitches Psychological Thriller Starring WHO?! 24:35
Activated LoRA: Fine-tuned LLMs for Intrinsics
Manage episode 477636509 series 3524393
Activated LoRA (aLoRA) enhances LoRA by adapting weights only for relevant tokens, allowing instant activation without recomputing the KV cache, improving efficiency in multiturn settings.
https://arxiv.org/abs//2504.12397
YouTube: https://www.youtube.com/@ArxivPapers
TikTok: https://www.tiktok.com/@arxiv_papers
Apple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016
Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers
2313 episodes
Manage episode 477636509 series 3524393
Activated LoRA (aLoRA) enhances LoRA by adapting weights only for relevant tokens, allowing instant activation without recomputing the KV cache, improving efficiency in multiturn settings.
https://arxiv.org/abs//2504.12397
YouTube: https://www.youtube.com/@ArxivPapers
TikTok: https://www.tiktok.com/@arxiv_papers
Apple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016
Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers
2313 episodes
All episodes
×
1 [QA] Rewarding the Unlikely: Lifting GRPO Beyond Distribution Sharpening 7:45

1 Rewarding the Unlikely: Lifting GRPO Beyond Distribution Sharpening 16:56



1 [QA] Why Gradients Rapidly Increase Near the End of Training 7:00

1 Why Gradients Rapidly Increase Near the End of Training 11:24

1 [QA] GEM: Empowering LLM for both Embedding Generation and Language Understanding 7:41

1 GEM: Empowering LLM for both Embedding Generation and Language Understanding 20:38

1 [QA] HYPERSTEER: Activation Steering at Scale with Hypernetworks 7:49

1 HYPERSTEER: Activation Steering at Scale with Hypernetworks 9:15



1 [QA] Accelerating Diffusion LLMs via Adaptive Parallel Decoding 8:08

1 [QA] Let Me Think! A Long Chain-of-Thought Can Be Worth Exponentially Many Short Ones 7:26

1 Let Me Think! A Long Chain-of-Thought Can Be Worth Exponentially Many Short Ones 24:00

1 [QA] Maximizing Confidence Alone Improves Reasoning 7:08

1 Maximizing Confidence Alone Improves Reasoning 13:21

1 [QA] Hardware-Efficient Attention for Fast Decoding 7:57

1 Hardware-Efficient Attention for Fast Decoding 30:59

1 [QA] Reinforcing General Reasoning without Verifiers 7:08

1 Reinforcing General Reasoning without Verifiers 17:11

1 [QA] ENIGMATA: Scaling Logical Reasoning in Large Language Models with Synthetic Verifiable Puzzles 8:16

1 ENIGMATA: Scaling Logical Reasoning in Large Language Models with Synthetic Verifiable Puzzles 23:54

1 [QA] Temporal Sampling for Forgotten Reasoning in LLMs 7:04

1 Temporal Sampling for Forgotten Reasoning in LLMs 10:43

1 [QA] Are Large Language Models Reliable AI Scientists? Assessing Reverse-Engineering of Black-Box Systems 10:15

1 Are Large Language Models Reliable AI Scientists? Assessing Reverse-Engineering of Black-Box Systems 17:21


1 Accelerating Diffusion LLMs via Adaptive Parallel Decoding 21:09

1 [QA] Reflect, Retry, Reward: Self-Improving LLMs via Reinforcement Learning 7:34

1 Reflect, Retry, Reward: Self-Improving LLMs via Reinforcement Learning 16:44

1 [QA] Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning 8:08

1 Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning 23:02

1 [QA] ALPHAONE: Reasoning Models Thinking Slow and Fast at Test Time 7:21

1 ALPHAONE: Reasoning Models Thinking Slow and Fast at Test Time 17:12

1 [QA] ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries in Large Language Models 7:40

1 ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries in Large Language Models 23:32

1 [QA] Are Reasoning Models More Prone to Hallucination? 7:52

1 Are Reasoning Models More Prone to Hallucination? 20:24

1 [QA] How does Transformer Learn Implicit Reasoning? 8:56

1 How does Transformer Learn Implicit Reasoning? 23:21
Welcome to Player FM!
Player FM is scanning the web for high-quality podcasts for you to enjoy right now. It's the best podcast app and works on Android, iPhone, and the web. Signup to sync subscriptions across devices.