AI lab podcast, "decrypting" expert analysis to understand Artificial Intelligence from a policy making point of view.
…
continue reading
A new podcast about the world of generative AI, including ChatGPT, Large Language Models (LLMs), DALL-E, Stable Diffusion, and more.
…
continue reading
1
Designing Futures: Exploring AI, Data, Architecture and beyond.
Nathalie Rozencwajg & Melanie Rozencwajg
Join us as we embark on a captivating journey through the ever-evolving intersection of AI, data, and architecture. In this podcast, we dive deep into the vast potential of AI for architecture and design, examining the remarkable possibilities it offers, while also acknowledging the challenges it presents. Our mission is to expand the conversation, engaging with leaders, thinkers, and doers in the ecosystem. We invite them to share their profound insights, groundbreaking ideas, and innovativ ...
…
continue reading
Welcome to Stable DiNEWSion, breaking down the latest stories in the world of Stable Diffusion and Generative AI. We fend off the fuss with facts. Hosted by Bill Meeks, writer, AI artist, and founder of Everly Heights Productions.
…
continue reading
Eye on A.I. is a biweekly podcast, hosted by longtime New York Times correspondent Craig S. Smith. In each episode, Craig will talk to people making a difference in artificial intelligence. The podcast aims to put incremental advances into a broader context and consider the global implications of the developing technology. AI is about to change your world, so pay attention.
…
continue reading
Running out of time to catch up with new arXiv papers? We take the most impactful papers and present them as convenient podcasts. If you're a visual learner, we offer these papers in an engaging video format. Our service fills the gap between overly brief paper summaries and time-consuming full paper reads. You gain academic insights in a time-efficient, digestible format. Code behind this work: https://github.com/imelnyk/ArxivPapers Support this podcast: https://podcasters.spotify.com/pod/s ...
…
continue reading
Artificial Antics is a podcast about Artificial Intelligence that caters to the skeptic and uninitiated. Join this unlikely trio Mike (the techy), Rico (the skeptic) and A.I. as they dive headfirst into the world of artificial intelligence. From debating the social implications and ethical concerns around AI to figuring out how to break into the lucrative AI market, no topic is off-limits. And with A.I. on board, you never know what kind of shenanigans are in store. Will A.I. turn out to be ...
…
continue reading
Hey there! Welcome to our cozy corner of the internet, where it’s all about merging tech with texture, pixels with palettes, and algorithms with aesthetics. I’m Jenna Gaidusek, your guide and fellow explorer in this fascinating world where interior design meets cutting-edge AI. Think of this as your go-to spot for demystifying the tech that’s set to revolutionize our industry, from the newest AI-driven design apps to the smartest tech out there. Visit www.aiforinteriordesigners.com for more ...
…
continue reading
Knowledge Distillation is the podcast that brings together a mixture of experts from across the Artificial Intelligence community. We talk to the world’s leading researchers about their experiences developing cutting-edge models as well as the technologists taking AI tools out of the lab and turning them into commercial products and services. Knowledge Distillation also takes a critical look at the impact of artificial intelligence on society – opting for expert analysis instead of hysterica ...
…
continue reading
Learn to write effective prompts for ChatGPT, Bard, Midjourney, DALLE, and other AI systems. Also hosting bi-weekly prompt engineering masterminds, where you bring a prompt and we all colaborate to improve it. Each episode we explore prompting techniques, interviews with experts and newbies, and tips on selling your prompts. Released weekly! Let me know who you'd like me to interview at PromptEngineeringPodcast.com Keep in touch: - https://www.linkedin.com/groups/14231334/ - https://t.me/Pro ...
…
continue reading
"What if we Could?" A podcast exploring this question that drive us. We explore the practical application of artificial intelligence, product design, blockchain & AR/VR, and tech alpha in the service of humans.
…
continue reading
Become a Paid Subscriber: https://podcasters.spotify.com/pod/show/rebeltech/subscribe Welcome to my Rebel Rant Series podcast! Join me as I dive into topics that matter, and share my unfiltered thoughts and opinions. This podcast is a different side of me, separate from my YouTube videos that I upload. It's raw, it's real, and it's here to inspire and motivate. In this podcast, I'll be sharing never-before-seen footage and insights into my life, as well as discussing topics ranging from busi ...
…
continue reading
"The AI Tools Report from Rachel Perry" is a cutting-edge podcast dedicated to exploring the dynamic and rapidly evolving world of artificial intelligence tools. Each episode, hosted by Rachel Perry, delves into the latest developments, breakthroughs, and ethical considerations in AI. The show offers in-depth analyses, expert interviews, and engaging discussions, making complex AI concepts accessible to a broad audience. Whether you're an AI enthusiast, a professional in the field, or just c ...
…
continue reading
1
#186 Ronen Dar: Maximizing GPU Utilization for AI with Run:ai
1:11:01
1:11:01
Play later
Play later
Lists
Like
Liked
1:11:01
This episode is sponsored by 1Password. 1Password combines industry-leading security with award-winning design to bring private, secure, and user-friendly password management to everyone. Companies lose hours every day just from employees forgetting and resetting passwords. A single data breach costs millions of dollars. 1Password secures every sig…
…
continue reading
1
[QA] From LLMs to Actions: Latent Codes as Bridges in Hierarchical Robot Control
11:33
11:33
Play later
Play later
Lists
Like
Liked
11:33
Hierarchical control in robotics faces challenges with language interfaces. Learnable Latent Codes as Bridges (LCB) offer a solution, outperforming language-based baselines on complex tasks in embodied agent benchmarks. https://arxiv.org/abs//2405.04798 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple…
…
continue reading
1
From LLMs to Actions: Latent Codes as Bridges in Hierarchical Robot Control
13:23
13:23
Play later
Play later
Lists
Like
Liked
13:23
Hierarchical control in robotics faces challenges with language interfaces. Learnable Latent Codes as Bridges (LCB) offer a solution, outperforming language-based baselines on complex tasks in embodied agent benchmarks. https://arxiv.org/abs//2405.04798 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple…
…
continue reading
1
[QA] Distilling Diffusion Models into Conditional GANs
8:28
8:28
Play later
Play later
Lists
Like
Liked
8:28
Proposing a method to distill a complex diffusion model into a single-step GAN, accelerating inference while maintaining image quality, outperforming existing models on COCO benchmark. https://arxiv.org/abs//2405.05967 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.apple.c…
…
continue reading
1
Distilling Diffusion Models into Conditional GANs
17:14
17:14
Play later
Play later
Lists
Like
Liked
17:14
Proposing a method to distill a complex diffusion model into a single-step GAN, accelerating inference while maintaining image quality, outperforming existing models on COCO benchmark. https://arxiv.org/abs//2405.05967 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.apple.c…
…
continue reading
1
[QA] AlphaMath Almost Zero: process Supervision without process
10:57
10:57
Play later
Play later
Lists
Like
Liked
10:57
Innovative approach uses Monte Carlo Tree Search to automatically generate supervision signals for training large language models, improving mathematical reasoning proficiency without manual annotation. https://arxiv.org/abs//2405.03553 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https:…
…
continue reading
1
AlphaMath Almost Zero: process Supervision without process
12:31
12:31
Play later
Play later
Lists
Like
Liked
12:31
Innovative approach uses Monte Carlo Tree Search to automatically generate supervision signals for training large language models, improving mathematical reasoning proficiency without manual annotation. https://arxiv.org/abs//2405.03553 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https:…
…
continue reading
1
[QA] Arctic-Embed: Scalable, Efficient, and Accurate Text Embedding Models
10:34
10:34
Play later
Play later
Lists
Like
Liked
10:34
The paper presents the creation and performance of the arctic-embed text embedding models, showcasing state-of-the-art retrieval accuracy and providing insights into their training process. https://arxiv.org/abs//2405.05374 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.ap…
…
continue reading
1
Arctic-Embed: Scalable, Efficient, and Accurate Text Embedding Models
13:52
13:52
Play later
Play later
Lists
Like
Liked
13:52
The paper presents the creation and performance of the arctic-embed text embedding models, showcasing state-of-the-art retrieval accuracy and providing insights into their training process. https://arxiv.org/abs//2405.05374 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.ap…
…
continue reading
1
[QA] Poser: Unmasking Alignment Faking LLMs by Manipulating Their Internals
9:53
9:53
Play later
Play later
Lists
Like
Liked
9:53
Large Language Models (LLMs) can deceive as 'alignment fakers.' A benchmark with 324 LLM pairs is introduced to detect misbehaving models, achieving 98% accuracy with a specific strategy. https://arxiv.org/abs//2405.05466 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.appl…
…
continue reading
1
Poser: Unmasking Alignment Faking LLMs by Manipulating Their Internals
8:56
8:56
Play later
Play later
Lists
Like
Liked
8:56
Large Language Models (LLMs) can deceive as 'alignment fakers.' A benchmark with 324 LLM pairs is introduced to detect misbehaving models, achieving 98% accuracy with a specific strategy. https://arxiv.org/abs//2405.05466 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.appl…
…
continue reading
1
[QA] Does Fine-Tuning LLMs on New Knowledge Encourage Hallucinations?
7:16
7:16
Play later
Play later
Lists
Like
Liked
7:16
Supervised fine-tuning of large language models introduces new factual knowledge, impacting model behavior. New knowledge is learned slower, leading to increased tendency to hallucinate factually incorrect responses. https://arxiv.org/abs//2405.05904 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Po…
…
continue reading
1
Does Fine-Tuning LLMs on New Knowledge Encourage Hallucinations?
17:29
17:29
Play later
Play later
Lists
Like
Liked
17:29
Supervised fine-tuning of large language models introduces new factual knowledge, impacting model behavior. New knowledge is learned slower, leading to increased tendency to hallucinate factually incorrect responses. https://arxiv.org/abs//2405.05904 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Po…
…
continue reading
1
[QA] Towards a Theoretical Understanding of the `Reversal Curse' via Training Dynamics
10:44
10:44
Play later
Play later
Lists
Like
Liked
10:44
The paper analyzes the "reversal curse" in large language models, explaining why they struggle with logical reasoning tasks like inverse search and chain-of-thought. https://arxiv.org/abs//2405.04669 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.apple.com/us/podcast/arxiv…
…
continue reading
1
Towards a Theoretical Understanding of the `Reversal Curse' via Training Dynamics
24:33
24:33
Play later
Play later
Lists
Like
Liked
24:33
The paper analyzes the "reversal curse" in large language models, explaining why they struggle with logical reasoning tasks like inverse search and chain-of-thought. https://arxiv.org/abs//2405.04669 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.apple.com/us/podcast/arxiv…
…
continue reading
1
[QA] Attention-Driven Training-Free Efficiency Enhancement of Diffusion Models
10:00
10:00
Play later
Play later
Lists
Like
Liked
10:00
AT-EDM framework uses attention maps for efficient token pruning in Diffusion Models, achieving significant FLOPs savings and speed-up without retraining, maintaining image quality. https://arxiv.org/abs//2405.05252 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.apple.com/…
…
continue reading
1
Attention-Driven Training-Free Efficiency Enhancement of Diffusion Models
13:32
13:32
Play later
Play later
Lists
Like
Liked
13:32
AT-EDM framework uses attention maps for efficient token pruning in Diffusion Models, achieving significant FLOPs savings and speed-up without retraining, maintaining image quality. https://arxiv.org/abs//2405.05252 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.apple.com/…
…
continue reading
1
[QA] Custom Gradient Estimators are Straight-Through Estimators in Disguise
8:09
8:09
Play later
Play later
Lists
Like
Liked
8:09
The paper addresses challenges in quantization-aware training by proposing differentiable approximations for quantization functions, showing equivalence of weight gradient estimators, and experimental validation on various models. https://arxiv.org/abs//2405.05171 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_p…
…
continue reading
1
Custom Gradient Estimators are Straight-Through Estimators in Disguise
16:36
16:36
Play later
Play later
Lists
Like
Liked
16:36
The paper addresses challenges in quantization-aware training by proposing differentiable approximations for quantization functions, showing equivalence of weight gradient estimators, and experimental validation on various models. https://arxiv.org/abs//2405.05171 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_p…
…
continue reading
1
EP 10: Social Media for Interior Designers Using Chat GPT with Lezlie Swink
21:46
21:46
Play later
Play later
Lists
Like
Liked
21:46
In this conversation, Jenna Gaidusek interviews Leslie Swink of Swink Social about utilizing AI in interior design. They discuss the importance of defining the ideal client and how AI can assist in this process. They also talk about creating personalized content, including value-added, personal brand, and promotional content. Leslie shares how AI c…
…
continue reading
1
[QA] The Curse of Diversity in Ensemble-Based Exploration
9:19
9:19
Play later
Play later
Lists
Like
Liked
9:19
Ensemble training in deep reinforcement learning can harm individual agents due to data sharing. The curse of diversity is explained and mitigated with Cross-Ensemble Representation Learning. https://arxiv.org/abs//2405.04342 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.…
…
continue reading
1
The Curse of Diversity in Ensemble-Based Exploration
10:27
10:27
Play later
Play later
Lists
Like
Liked
10:27
Ensemble training in deep reinforcement learning can harm individual agents due to data sharing. The curse of diversity is explained and mitigated with Cross-Ensemble Representation Learning. https://arxiv.org/abs//2405.04342 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.…
…
continue reading
1
AI lab - AI in Action | Episode 01: AI History
9:15
9:15
Play later
Play later
Lists
Like
Liked
9:15
We are kickstarting our AI in Action series by diving headfirst into the key milestones that led to the gradual deployment of Artificial Intelligence, or AI for short. You might think it's some shiny new invention, looking at all the recent media coverage about robots taking over your jobs and writing bad poetry. But hold on to your Roomba, because…
…
continue reading
1
[QA] ImageInWords: Unlocking Hyper-Detailed Image Descriptions
10:56
10:56
Play later
Play later
Lists
Like
Liked
10:56
Image descriptions for training Vision-Language models are often inaccurate. ImageInWords introduces a new dataset with hyper-detailed descriptions, improving model performance significantly. https://arxiv.org/abs//2405.02793 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.…
…
continue reading
1
ImageInWords: Unlocking Hyper-Detailed Image Descriptions
15:09
15:09
Play later
Play later
Lists
Like
Liked
15:09
Image descriptions for training Vision-Language models are often inaccurate. ImageInWords introduces a new dataset with hyper-detailed descriptions, improving model performance significantly. https://arxiv.org/abs//2405.02793 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.…
…
continue reading
Sharpness-Aware Minimization (SAM) excels in label noise robustness, with peak performance under early stopping, attributed to changes in logit term and network Jacobian. Alternative methods mimic SAM's regularization effects effectively. https://arxiv.org/abs//2405.03676 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/…
…
continue reading
Sharpness-Aware Minimization (SAM) excels in label noise robustness, with peak performance under early stopping, attributed to changes in logit term and network Jacobian. Alternative methods mimic SAM's regularization effects effectively. https://arxiv.org/abs//2405.03676 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/…
…
continue reading
The paper addresses challenges in training large-scale machine learning models, focusing on numeric deviation causing instability, with a case study on Flash Attention optimization. https://arxiv.org/abs//2405.02803 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.apple.com/…
…
continue reading
The paper addresses challenges in training large-scale machine learning models, focusing on numeric deviation causing instability, with a case study on Flash Attention optimization. https://arxiv.org/abs//2405.02803 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.apple.com/…
…
continue reading
1
#185 Damon Rasheed & Saurabh Jain: Achieving Up to 90% Accuracy in Predicting Drug Clinical Trial Outcomes with Opyl's Trialkey.ai
51:55
51:55
Play later
Play later
Lists
Like
Liked
51:55
This episode is sponsored by Netsuite by Oracle, the number one cloud financial system, streamlining accounting, financial management, inventory, HR, and more. NetSuite is offering a one-of-a-kind flexible financing program. Head to https://netsuite.com/EYEONAI to know more. Unlock the secrets of clinical trial predictions with Saurabh Jain and Dam…
…
continue reading
1
The Future of AI Accountability & Responsibility
51:01
51:01
Play later
Play later
Lists
Like
Liked
51:01
We are thrilled to bring you an insightful episode that dives deep into the pressing issues of accountability and responsibility in the realm of Artificial Intelligence within the creative industries. Introduction by Dyann Heward-Mills: The EU AI Act To set the framework, we have the honor of welcoming Dyann Heward-Mills:, an esteemed expert in AI …
…
continue reading
1
[QA] Understanding LLMs Requires More Than Statistical Generalization
10:10
10:10
Play later
Play later
Lists
Like
Liked
10:10
The paper discusses the non-identifiability of large language models (LLMs) and its implications on generalization, highlighting the need for a new theoretical perspective. https://arxiv.org/abs//2405.01964 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.apple.com/us/podcas…
…
continue reading
1
Understanding LLMs Requires More Than Statistical Generalization
19:25
19:25
Play later
Play later
Lists
Like
Liked
19:25
The paper discusses the non-identifiability of large language models (LLMs) and its implications on generalization, highlighting the need for a new theoretical perspective. https://arxiv.org/abs//2405.01964 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.apple.com/us/podcas…
…
continue reading
1
[QA] Mitigating LLM Hallucinations via Conformal Abstention
8:28
8:28
Play later
Play later
Lists
Like
Liked
8:28
Developing a method for large language models to abstain from providing incorrect answers, using self-consistency and conformal prediction to reduce hallucination rates. https://arxiv.org/abs//2405.01563 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.apple.com/us/podcast/a…
…
continue reading
1
Mitigating LLM Hallucinations via Conformal Abstention
19:11
19:11
Play later
Play later
Lists
Like
Liked
19:11
Developing a method for large language models to abstain from providing incorrect answers, using self-consistency and conformal prediction to reduce hallucination rates. https://arxiv.org/abs//2405.01563 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.apple.com/us/podcast/a…
…
continue reading
1
[QA] Structural Pruning of Pre-trained Language Models via Neural Architecture Search
8:56
8:56
Play later
Play later
Lists
Like
Liked
8:56
Paper explores using neural architecture search (NAS) for structural pruning of pre-trained language models to optimize efficiency and generalization performance, utilizing two-stage weight-sharing NAS for accelerated search. https://arxiv.org/abs//2405.02267 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers…
…
continue reading
1
Structural Pruning of Pre-trained Language Models via Neural Architecture Search
13:25
13:25
Play later
Play later
Lists
Like
Liked
13:25
Paper explores using neural architecture search (NAS) for structural pruning of pre-trained language models to optimize efficiency and generalization performance, utilizing two-stage weight-sharing NAS for accelerated search. https://arxiv.org/abs//2405.02267 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers…
…
continue reading
1
[QA] Capabilities of Gemini Models in Medicine
13:07
13:07
Play later
Play later
Lists
Like
Liked
13:07
The paper explores the impact of climate change on global food security, highlighting the need for sustainable agricultural practices to mitigate future risks. https://arxiv.org/abs//2404.18416 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-paper…
…
continue reading
1
Capabilities of Gemini Models in Medicine
35:57
35:57
Play later
Play later
Lists
Like
Liked
35:57
The paper explores the impact of climate change on global food security, highlighting the need for sustainable agricultural practices to mitigate future risks. https://arxiv.org/abs//2404.18416 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-paper…
…
continue reading
1
[QA] StoryDiffusion: Consistent Self-Attention for Long-Range Image and Video Generation
9:55
9:55
Play later
Play later
Lists
Like
Liked
9:55
The paper introduces Consistent Self-Attention and Semantic Motion Predictor to enhance content consistency in diffusion-based generative models for text-to-image and video generation, enabling rich visual story creation. https://arxiv.org/abs//2405.01434 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers App…
…
continue reading
1
StoryDiffusion: Consistent Self-Attention for Long-Range Image and Video Generation
15:05
15:05
Play later
Play later
Lists
Like
Liked
15:05
The paper introduces Consistent Self-Attention and Semantic Motion Predictor to enhance content consistency in diffusion-based generative models for text-to-image and video generation, enabling rich visual story creation. https://arxiv.org/abs//2405.01434 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers App…
…
continue reading
1
[QA] In-Context Learning with Long-Context Models: An In-Depth Exploration
9:43
9:43
Play later
Play later
Lists
Like
Liked
9:43
The paper explores in-context learning (ICL) at extreme scales, showing performance improvements with hundreds or thousands of demonstrations, contrasting with example retrieval and finetuning. https://arxiv.org/abs//2405.00200 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcast…
…
continue reading
1
In-Context Learning with Long-Context Models: An In-Depth Exploration
13:01
13:01
Play later
Play later
Lists
Like
Liked
13:01
The paper explores in-context learning (ICL) at extreme scales, showing performance improvements with hundreds or thousands of demonstrations, contrasting with example retrieval and finetuning. https://arxiv.org/abs//2405.00200 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcast…
…
continue reading
1
[QA] WILDCHAT: 1M ChatGPT Interaction Logs in the Wild
11:01
11:01
Play later
Play later
Lists
Like
Liked
11:01
WILDCHAT is a diverse dataset of 1 million user-ChatGPT conversations, offering rich insights into chatbot interactions and potential toxic use-cases for researchers. https://arxiv.org/abs//2405.01470 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.apple.com/us/podcast/arxi…
…
continue reading
1
WILDCHAT: 1M ChatGPT Interaction Logs in the Wild
13:11
13:11
Play later
Play later
Lists
Like
Liked
13:11
WILDCHAT is a diverse dataset of 1 million user-ChatGPT conversations, offering rich insights into chatbot interactions and potential toxic use-cases for researchers. https://arxiv.org/abs//2405.01470 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.apple.com/us/podcast/arxi…
…
continue reading
1
[QA] NeMo-Aligner: Scalable Toolkit for Efficient Model Alignment
10:22
10:22
Play later
Play later
Lists
Like
Liked
10:22
NeMo-Aligner is a scalable toolkit for aligning Large Language Models with human values, supporting various alignment paradigms and designed for extensibility. https://arxiv.org/abs//2405.01481 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-paper…
…
continue reading
1
NeMo-Aligner: Scalable Toolkit for Efficient Model Alignment
11:39
11:39
Play later
Play later
Lists
Like
Liked
11:39
NeMo-Aligner is a scalable toolkit for aligning Large Language Models with human values, supporting various alignment paradigms and designed for extensibility. https://arxiv.org/abs//2405.01481 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-paper…
…
continue reading
1
[QA] PROMETHEUS 2: An Open Source Language Model Specialized in Evaluating Other Language Models
7:57
7:57
Play later
Play later
Lists
Like
Liked
7:57
Prometheus 2 is an open-source LM designed for evaluating responses, outperforming existing models in correlation with human and proprietary LM judgments. https://arxiv.org/abs//2405.01535 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1…
…
continue reading
1
PROMETHEUS 2: An Open Source Language Model Specialized in Evaluating Other Language Models
14:09
14:09
Play later
Play later
Lists
Like
Liked
14:09
Prometheus 2 is an open-source LM designed for evaluating responses, outperforming existing models in correlation with human and proprietary LM judgments. https://arxiv.org/abs//2405.01535 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1…
…
continue reading
1
[QA] A Careful Examination of Large Language Model Performance on Grade School Arithmetic
7:39
7:39
Play later
Play later
Lists
Like
Liked
7:39
Study investigates dataset contamination in large language models for mathematical reasoning using Grade School Math 1000 benchmark, finding evidence of overfitting and potential memorization of benchmark questions. https://arxiv.org/abs//2405.00332 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Pod…
…
continue reading