A daily update on the latest AI Research Papers. We provide a high level overview of a handful of papers each day and will link all papers in the description for further reading. This podcast is created entirely with AI by PocketPod. Head over to https://pocketpod.app to learn more.
…
continue reading
Are you a critical thinker ready to dive into AI? Welcome to The Generative AI Podcast: Super Prompt. Join me, Tony Wan, an ex Silicon Valley executive, as we 'unhype the hype' of AI via illuminating conversations with top engineers and entrepreneurs, complemented by in-depth solo episodes. Our goal? To make it almost unnecessary to send a cybernetic organism back in time to fix things. Tailored for the technically-minded and discerningly skeptical, our discussions cover Large Language Model ...
…
continue reading
Keeping you up to date with the latest trends and best performing architectures in this fast evolving field in computer science. Selecting papers by comparative results, citations and influence we educate you on the latest research. Consider supporting us on Patreon.com/PapersRead for feedback and ideas.
…
continue reading
Tune in as we dissect recent AI news, explore cutting-edge innovations, and sit down with influential voices shaping the future of AI. Whether you're a seasoned expert or just dipping your toes into the AI waters, our podcast is your go-to resource for staying informed and inspired. #IntelAI @IntelAI
…
continue reading
Your AI Roadmap the podcast is on a mission to decrease fluffy HYPE and talk to the people actually building AI. Anyone can build in AI. Including you. Whether you’re terrified or excited, there’s been no better time than today to dive in! Now is the time to be curious and future-proof your career and ... ultimately your income. This podcast isn't about white dudes patting themselves on the back, this is about you and me and ALL the paths into cool projects around the world! What's next on y ...
…
continue reading
Welcome to an exciting new season of the podcast Your Career: Choice or Chance? - as we dive into the ever-evolving world of GenAI in the Workplace and explore the latest trends, experiences, and career journeys shaping the future of work as AI is increasingly ingrained in it. Each episode provides fresh insights, addressing the transformative influence of GenAI in shaping the workforce of tomorrow, making it a must-listen for anyone interested in staying ahead in the ever-evolving world of ...
…
continue reading
AI Named This Show is a weekly AI-focused tech show. Join longtime friends and tech media veterans Tasia Custode and Tristan Jutras as they dive into the AI abyss, unraveling the complexities of artificial intelligence. They cover everything from groundbreaking AI news to the practical applications — and societal implications — of large language models, machine learning, deep learning, generative AI and more. FOLLOW AI Named This Show on Facebook, Instagram, YouTube and X (Twitter) Tristan & ...
…
continue reading
Weekly talks and fireside chats about everything that has to do with the new space emerging around DevOps for Machine Learning aka MLOps aka Machine Learning Operations.
…
continue reading
"On AI" is an innovative podcast uniquely tailored for creators diving into the fascinating world of generative AI. Generative AI is accelerating artistic expressions resulting in the development of completely new genres, transforming art, design, film, music, storytelling and immersive multimodal experiences. It's also transforming the way we experience the world. From fashion, gaming, robotics and all the pop culture in between. In each episode, we examine the transformative power of artif ...
…
continue reading
Looking to explore the intersection of AI and journalism? Influential thought leaders in the industry join data scientist and media entrepreneur, Nikita Roy, each week to explore what's next with AI and its implications for the media landscape. In each episode, industry experts discuss how automated newsrooms have the potential to change journalism and uncover opportunities to optimize workflows and increase efficiency without compromising journalistic integrity. Hosted on Acast. See acast.c ...
…
continue reading
EdgeCortix subject matter experts discuss edge AI processors, AI software frameworks, and AI industry trends.
…
continue reading
A weekly show discussing the latest developments in the European technology ecosystem and featuring interviews with some of the most interesting people in the industry.
…
continue reading
Your monthly dose of AR/VR
…
continue reading
I make videos about machine learning research papers, programming, and issues of the AI community, and the broader impact of AI in society. Twitter: https://twitter.com/ykilcher Discord: https://discord.gg/4H8xxDF If you want to support me, the best thing to do is to share out the content :) If you want to support me financially (completely optional and voluntary, but a lot of people have asked for this): SubscribeStar (preferred to Patreon): https://www.subscribestar.com/yannickilcher Patre ...
…
continue reading
The AR Show dives deep into the emerging world of Augmented Reality with a focus on the underlying technologies and uses of Smartglasses, and the people behind them. I talk with entrepreneurs, executives, investors and early adopters to extract insights that will both inform and inspire you. In each episode, I explore the approaches, challenges, and progress behind the products and companies. I also extract the lessons learned and insightful advice from each guest. Equal parts technology, pr ...
…
continue reading
Seed to Harvest, hosted by Paige Finn Doherty, highlights stories, frameworks & tactics from a diverse array of investors, founders, and creators. If you're interested in investing or building a business, this show is for you!
…
continue reading
eDiscovery Data Points are selected articles published on the ComplexDiscovery blog and shared to update legal, information technology, and business professionals on the art and science of data discovery and legal discovery.
…
continue reading
![Artwork](/static/images/128pixel.png)
1
Vocabulary Expansion for Large Models, Big Data Enhancing LMs, 4D Reconstruction Progress, AI Cityscape Generation, DPO Policy Analysis, Expanding Code Models, Multimodal LM Trust Evaluation
14:55
14:55
Play later
Play later
Lists
Like
Liked
14:55
Scaling Laws with Vocabulary: Larger Models Deserve Larger VocabulariesScaling Retrieval-Based Language Models with a Trillion-Token DatastoreShape of Motion: 4D Reconstruction from a Single VideoStreetscapes: Large-scale Consistent Street View Generation Using Autoregressive Video DiffusionUnderstanding Reference Policies in Direct Preference Opti…
…
continue reading
![Artwork](/static/images/128pixel.png)
1
SEED-Story: Multimodal Long Story Generation with Large Language Model
22:27
22:27
Play later
Play later
Lists
Like
Liked
22:27
With the remarkable advancements in image generation and open-form text generation, the creation of interleaved image-text content has become an increasingly intriguing field. Multimodal story generation, characterized by producing narrative texts and vivid images in an interleaved manner, has emerged as a valuable and practical task with broad app…
…
continue reading
![Artwork](/static/images/128pixel.png)
1
From satellite tracking and space regulation to multimodal AI, and more
38:54
38:54
Play later
Play later
Lists
Like
Liked
38:54
Welcome to the new episode of the TNW Podcast — the show where we discuss the latest developments in the European technology ecosystem and feature interviews with some of the most interesting people in the industry. In today’s episode, we're featuring two interviews recorded at TNW Conference 2024 ago with two amazing women working in very differen…
…
continue reading
![Artwork](/static/images/128pixel.png)
1
IMAGDressing-v1: Customizable Virtual Dressing
27:37
27:37
Play later
Play later
Lists
Like
Liked
27:37
Latest advances have achieved realistic virtual try-on (VTON) through localized garment inpainting using latent diffusion models, significantly enhancing consumers' online shopping experience. However, existing VTON technologies neglect the need for merchants to showcase garments comprehensively, including flexible control over garments, optional f…
…
continue reading
![Artwork](/static/images/128pixel.png)
1
The next frontier of generative AI 🔴 AINTS 046
52:56
52:56
Play later
Play later
Lists
Like
Liked
52:56
This week, Tristan and Tasia look at new image-generating tools from Samsung and Microsoft and check out Amazon's new AI shopping assistant, Rufus. Then we encounter OpenAI's Strawberry (FKA Q*) reasoning technology and new types of AI models that might someday supplant transformers. Join us as we navigate the dangers of the frontier. Can you survi…
…
continue reading
Eric Landry is a seasoned AI and Machine Learning leader with extensive expertise in software engineering and practical applications in NLP, document classification, and conversational AI. With technical proficiency in Java, Python, and key ML tools, he leads the Expedia Machine Learning Engineering Guild and has spoken at major conferences like Ap…
…
continue reading
![Artwork](/static/images/128pixel.png)
1
A Comprehensive Survey on Human Video Generation: Challenges, Methods, and Insights
36:34
36:34
Play later
Play later
Lists
Like
Liked
36:34
Human video generation is a dynamic and rapidly evolving task that aims to synthesize 2D human body video sequences with generative models given control conditions such as text, audio, and pose. With the potential for wide-ranging applications in film, gaming, and virtual communication, the ability to generate natural and realistic human video is c…
…
continue reading
![Artwork](/static/images/128pixel.png)
1
Making it Easier for Businesses to Deploy AI Today with Comprehensive AI Solutions Featuring Seamus Jones
27:06
27:06
Play later
Play later
Lists
Like
Liked
27:06
Whether you're an AI enthusiast or a business looking to integrate AI into your operations, this episode offers valuable insights into the rapidly evolving AI landscape. Join Seamus Jones, Director of Technical Marketing/Engineering at Dell, as he explains how Dell is creating comprehensive AI solutions. These solutions encompass everything from AI…
…
continue reading
![Artwork](/static/images/128pixel.png)
1
Internet of Agents: Weaving a Web of Heterogeneous Agents for Collaborative Intelligence
49:58
49:58
Play later
Play later
Lists
Like
Liked
49:58
The rapid advancement of large language models (LLMs) has paved the way for the development of highly capable autonomous agents. However, existing multi-agent frameworks often struggle with integrating diverse capable third-party agents due to reliance on agents defined within their own ecosystems. They also face challenges in simulating distribute…
…
continue reading
![Artwork](/static/images/128pixel.png)
1
Ariane 6 brings hope; how European companies use AI
1:12:07
1:12:07
Play later
Play later
Lists
Like
Liked
1:12:07
Welcome to the new episode of the TNW Podcast — the show where we discuss the latest developments in the European technology ecosystem and feature interviews with some of the most interesting people in the industry. In today’s episode, Linnea and Andrii talk about the launch of Ariane 6 and its consequences, the woes of Firefly, robotic laundry fol…
…
continue reading
![Artwork](/static/images/128pixel.png)
1
Qwen2 Language Model, Mitigating Privacy Risks in LLMs, Exploring Non-Determinism, Increased Efficiency with Q-Sparse, GRUtopia for Embodied AI
10:38
10:38
Play later
Play later
Lists
Like
Liked
10:38
Qwen2 Technical ReportLearning to Refuse: Towards Mitigating Privacy Risks in LLMsThe Good, The Bad, and The Greedy: Evaluation of LLMs Should Not Ignore Non-DeterminismQ-Sparse: All Large Language Models can be Fully Sparsely-ActivatedGRUtopia: Dream General Robots in a City at Scale
…
continue reading
![Artwork](/static/images/128pixel.png)
1
Evaluating the Effectiveness of Large Language Models: Challenges and Insights // Aniket Singh // #248
35:40
35:40
Play later
Play later
Lists
Like
Liked
35:40
Aniket Kumar Singh is a Vision Systems Engineer at Ultium Cells, skilled in Machine Learning and Deep Learning. I'm also engaged in AI research, focusing on Large Language Models (LLMs).Evaluating the Effectiveness of Large Language Models: Challenges and Insights // MLOps Podcast #248 with Aniket Kumar Singh, CTO @ MyEvaluationPal | ML Engineer @ …
…
continue reading
![Artwork](/static/images/128pixel.png)
1
AI in Law: Revolutionizing the Legal Field with Peter Geovanes
45:47
45:47
Play later
Play later
Lists
Like
Liked
45:47
Peter Geovanes, Chief Innovation and AI Officer at McGuireWoods LLP, discusses AI's transformative impact on the legal sector. He emphasizes AI tools like ChatGPT-4 for enhancing efficiency, competitive advantage, and prompt engineering. Highlighting data privacy and ethical use, he advocates a "trust but verify" approach. Geovanes also stresses th…
…
continue reading
![Artwork](/static/images/128pixel.png)
1
Skywork-Math's Reasoning, Video Diffusion Model Innovations, Multimodal Learning, Q-GaLore's Memory Efficiency, MAVIS: Visual Math Instruction
12:11
12:11
Play later
Play later
Lists
Like
Liked
12:11
Skywork-Math: Data Scaling Laws for Mathematical Reasoning in Large Language Models -- The Story Goes OnVideo Diffusion Alignment via Reward GradientsMultimodal Self-Instruct: Synthetic Abstract Image and Visual Reasoning Instruction Using Language ModelQ-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank GradientsMAVIS: Math…
…
continue reading
![Artwork](/static/images/128pixel.png)
1
Language Agent Tree Search Unifies Reasoning Acting and Planning in Language Models
39:20
39:20
Play later
Play later
Lists
Like
Liked
39:20
While language models (LMs) have shown potential across a range of decision-making tasks, their reliance on simple acting processes limits their broad deployment as autonomous agents. In this paper, we introduce Language Agent Tree Search (LATS) -- the first general framework that synergizes the capabilities of LMs in reasoning, acting, and plannin…
…
continue reading
This week, Tristan and Tasia explore various recent AI announcements, collaborations and whoopsie-doopsies involving Microsoft, OpenAI, Apple, Samsung, Google, and Motorola. We also discuss Google's recent backtracking on its net zero emissions targets in light of increasing resource demands from AI. Plus: an AI beauty pageant and a surprise appear…
…
continue reading
![Artwork](/static/images/128pixel.png)
1
Extending AI: From Industry to Innovation // Sophia Rowland & David Weik // #246
1:01:36
1:01:36
Play later
Play later
Lists
Like
Liked
1:01:36
Sophia Rowland is a Senior Product Manager focusing on ModelOps and MLOps at SAS. In her previous role as a data scientist, Sophia worked with dozens of organizations to solve a variety of problems using analytics. David Weik has a passion for data and creating integrated customer-centric solutions. Thinking data and people first to create value-ad…
…
continue reading
![Artwork](/static/images/128pixel.png)
1
LivePortrait: Efficient Portrait Animation with Stitching and Retargeting Control
39:35
39:35
Play later
Play later
Lists
Like
Liked
39:35
Portrait Animation aims to synthesize a lifelike video from a single source image, using it as an appearance reference, with motion (i.e., facial expressions and head pose) derived from a driving video, audio, text, or generation. Instead of following mainstream diffusion-based methods, we explore and extend the potential of the implicit-keypoint-b…
…
continue reading
![Artwork](/static/images/128pixel.png)
1
Agentless: Demystifying LLM-based Software Engineering Agents
35:54
35:54
Play later
Play later
Lists
Like
Liked
35:54
Recent advancements in large language models (LLMs) have significantly advanced the automation of software development tasks, including code synthesis, program repair, and test generation. More recently, researchers and industry practitioners have developed various autonomous LLM agents to perform end-to-end software development tasks. These agents…
…
continue reading
![Artwork](/static/images/128pixel.png)
1
Daniel Keiper-Knorr on startups and funding; fusion power in Europe delayed again
35:04
35:04
Play later
Play later
Lists
Like
Liked
35:04
Welcome to the new episode of the TNW Podcast — the show where we discuss the latest developments in the European technology ecosystem and feature interviews with some of the most interesting people in the industry. In today’s episode, Andrii talks about fusion energy in Europe, 'the war on floppy disks', and more. The guest of the show is Daniel K…
…
continue reading
![Artwork](/static/images/128pixel.png)
1
Beyond Encoders in Vision-Language Models, Revolutionizing Human-LLM Interaction, and Advancing Knowledge Graphs
12:05
12:05
Play later
Play later
Lists
Like
Liked
12:05
Unveiling Encoder-Free Vision-Language ModelsFunAudioLLM: Voice Understanding and Generation Foundation Models for Natural Interaction Between Humans and LLMsAriGraph: Learning Knowledge Graph World Models with Episodic Memory for LLM AgentsRULE: Reliable Multimodal RAG for Factuality in Medical Vision Language ModelsChartGemma: Visual Instruction-…
…
continue reading
![Artwork](/static/images/128pixel.png)
1
Detecting Harmful Content at Scale // Matar Haller // #245
51:27
51:27
Play later
Play later
Lists
Like
Liked
51:27
Matar Haller is the VP of Data & AI at ActiveFence, where her teams own the end-to-end automated detection of harmful content at scale, regardless of the abuse area or media type. The work they do here is engaging, impactful, and tough, and Matar is grateful for the people she gets to do it with. AI For Good - Detecting Harmful Content at Scale // …
…
continue reading
![Artwork](/static/images/128pixel.png)
1
Can Long-Context Language Models Subsume Retrieval, RAG, SQL, and More?
36:47
36:47
Play later
Play later
Lists
Like
Liked
36:47
Long-context language models (LCLMs) have the potential to revolutionize our approach to tasks traditionally reliant on external tools like retrieval systems or databases. Leveraging LCLMs' ability to natively ingest and process entire corpora of information offers numerous advantages. It enhances user-friendliness by eliminating the need for speci…
…
continue reading
![Artwork](/static/images/128pixel.png)
1
Inside Upwork: Lauren Dycus on AI and Market Differentiation
47:35
47:35
Play later
Play later
Lists
Like
Liked
47:35
Lauren Dycus, Director of Product Management at Upwork, details her role and the company's mission. She focuses on work management, which streamlines processes from user onboarding to contract finalization. Upwork, the world's largest work marketplace, supports freelancers with specialized skills, enabling personal and company growth. Dycus emphasi…
…
continue reading
![Artwork](/static/images/128pixel.png)
1
Garance Burke: A Journalist's Guide to Investigating Artificial Intelligence
45:26
45:26
Play later
Play later
Lists
Like
Liked
45:26
Garance Burke, a global investigative journalist at the Associated Press, joins host Nikita Roy to discuss the crucial role of journalism in holding AI systems accountable and the challenges reporters face in covering this complex topic. Burke, a global investigative journalist with The Associated Press, has been at the forefront of investigating t…
…
continue reading
![Artwork](/static/images/128pixel.png)
1
ML-Bench: Evaluating Large Language Models and Agents for Machine Learning Tasks on Repository-Level Code
27:24
27:24
Play later
Play later
Lists
Like
Liked
27:24
Despite Large Language Models (LLMs) like GPT-4 achieving impressive results in function-level code generation, they struggle with repository-scale code understanding (e.g., coming up with the right arguments for calling routines), requiring a deeper comprehension of complex file interactions. Also, recently, people have developed LLM agents that a…
…
continue reading
![Artwork](/static/images/128pixel.png)
1
Diffusion Forcing to Expert Tuning, Structured Planning, Vision-Language Models, and Tabular ML Benchmarks
11:34
11:34
Play later
Play later
Lists
Like
Liked
11:34
Diffusion Forcing: Next-token Prediction Meets Full-Sequence DiffusionLet the Expert Stick to His Last: Expert-Specialized Fine-Tuning for Sparse Architectural Large Language ModelsPlanetarium: A Rigorous Benchmark for Translating Text to Structured Planning LanguagesInternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Co…
…
continue reading
![Artwork](/static/images/128pixel.png)
1
Advancing AI's Mathematical Reasoning: WE-MATH, ROS-LLM Framework, Autoregressive Image Generation
10:36
10:36
Play later
Play later
Lists
Like
Liked
10:36
We-Math: Does Your Large Multimodal Model Achieve Human-like Mathematical Reasoning?ROS-LLM: A ROS framework for embodied AI with task feedback and structured reasoningMMEvalPro: Calibrating Multimodal Benchmarks Towards Trustworthy and Efficient EvaluationLiteSearch: Efficacious Tree Search for LLMWavelets Are All You Need for Autoregressive Image…
…
continue reading
![Artwork](/static/images/128pixel.png)
1
AI storytelling: As you wish 🔴 AINTS 044
1:07:16
1:07:16
Play later
Play later
Lists
Like
Liked
1:07:16
This week, Tristan and Tasia are joined by AINTS' very first guest, writer/performer/podcaster/video producer, Bill Meeks,. to look at the latest AI image and video generator updates from Stable Diffusion and Runway AI, along with a couple of newcomers. Then Bill walks us through the key features of Everly Heights Story Studio, his AI storytelling …
…
continue reading
![Artwork](/static/images/128pixel.png)
1
All Data Scientists Should Learn Software Engineering Principles // Catherine Nelson // #245
52:54
52:54
Play later
Play later
Lists
Like
Liked
52:54
Catherine Nelson is a freelance data scientist and writer. She is currently working on the forthcoming O’Reilly book "Software Engineering for Data Scientists”.Why All Data Scientists Should Learn Software Engineering Principles // MLOps podcast #245 with Catherine Nelson, a freelance Data Scientist.A big thank you to LatticeFlow AI for sponsoring …
…
continue reading
![Artwork](/static/images/128pixel.png)
1
Unique3D: High-Quality and Efficient 3D Mesh Generation from a Single Image
22:25
22:25
Play later
Play later
Lists
Like
Liked
22:25
In this work, we introduce Unique3D, a novel image-to-3D framework for efficiently generating high-quality 3D meshes from single-view images, featuring state-of-the-art generation fidelity and strong generalizability. Previous methods based on Score Distillation Sampling (SDS) can produce diversified 3D results by distilling 3D knowledge from large…
…
continue reading
![Artwork](/static/images/128pixel.png)
1
DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence
37:18
37:18
Play later
Play later
Lists
Like
Liked
37:18
We present DeepSeek-Coder-V2, an open-source Mixture-of-Experts (MoE) code language model that achieves performance comparable to GPT4-Turbo in code-specific tasks. Specifically, DeepSeek-Coder-V2 is further pre-trained from an intermediate checkpoint of DeepSeek-V2 with additional 6 trillion tokens. Through this continued pre-training, DeepSeek-Co…
…
continue reading
![Artwork](/static/images/128pixel.png)
1
The Importance of Flexibility and Governance in AI Model Management, with Robert Daigle
24:49
24:49
Play later
Play later
Lists
Like
Liked
24:49
Learn about the importance of flexibility and governance in AI model management as Robert Daigle, Director of Global AI Business at Lenovo, discusses the future of AI deployment across various computing environments. He highlights the collaborative efforts of Lenovo and partners in addressing specific vertical use cases such as retail, healthcare, …
…
continue reading
![Artwork](/static/images/128pixel.png)
1
AWE 2024 Panel Discussion: Current State and Future Direction of AR Glasses
56:00
56:00
Play later
Play later
Lists
Like
Liked
56:00
I moderated a panel at the recent AWE conference that took place a couple of weeks ago in Long Beach, California. The panel featured Karl Guttag from KGOnTech, Adi Robertson from the Verve, Jeri Ellsworth from Tilt Five, and Ed Tang from Avegant. The session was titled: Current State and Future Direction of AR Glasses and the session description re…
…
continue reading
![Artwork](/static/images/128pixel.png)
1
Meta GenAI Infra Blog Review // Special MLOps Podcast
38:53
38:53
Play later
Play later
Lists
Like
Liked
38:53
Meta GenAI Infra Blog Review // Special MLOps Podcast episode by Demetrios.// AbstractDemetrios explores Meta's innovative infrastructure for large-scale AI operations, highlighting three blog posts on training large language models, maintaining AI capacity, and building Meta's GenAI infrastructure. The discussion reveals Meta's handling of hundred…
…
continue reading
![Artwork](/static/images/128pixel.png)
1
Persona-Driven Data Synthesis, Enhancing Medical MLLMs, Robot Learning, Knowledge Distillation in LLMs, Text to 3D Gaussian Revolution
11:24
11:24
Play later
Play later
Lists
Like
Liked
11:24
Scaling Synthetic Data Creation with 1,000,000,000 PersonasHuatuoGPT-Vision, Towards Injecting Medical Visual Knowledge into Multimodal LLMs at ScaleLLaRA: Supercharging Robot Learning Data for Vision-Language PolicyDirect Preference Knowledge Distillation for Large Language ModelsGaussianDreamerPro: Text to Manipulable 3D Gaussians with Highly Enh…
…
continue reading
![Artwork](/static/images/128pixel.png)
1
Generative AI for Enterprise with Noelle Russell of AI Leadership Institute
45:42
45:42
Play later
Play later
Lists
Like
Liked
45:42
Noelle Russell, founder of the AI Leadership Institute, shares her experience in AI, highlighting responsible AI, challenges in implementation, and the potential of generative AI in customer service. She emphasizes collaboration, flexibility, and aligning values with partners. Russell discusses harnessing team knowledge with LLMs, the role of certi…
…
continue reading
![Artwork](/static/images/128pixel.png)
1
OMG-LLaVA: Unifying Vision and Language Understanding, Step-DPO for LLMs Mathematical Reasoning, MUMU's Multimodal Image Generation
12:15
12:15
Play later
Play later
Lists
Like
Liked
12:15
OMG-LLaVA: Bridging Image-level, Object-level, Pixel-level Reasoning and UnderstandingStep-DPO: Step-wise Preference Optimization for Long-chain Reasoning of LLMsMUMU: Bootstrapping Multimodal Image Generation from Text-to-Image DataSimulating Classroom Education with LLM-Empowered AgentsSeaKR: Self-aware Knowledge Retrieval for Adaptive Retrieval …
…
continue reading
This week, Tristan and Tasia relive a tech journalist's chaotic adventure with Meta's Ray-Ban AI glasses in Montreal and uncover the significant security flaw found in Rabbit’s R1 AI gadget. Then we dive into the heated legal battles over AI-generated music, with the RIAA suing Suno and Udio for copyright infringement and YouTube negotiating music …
…
continue reading
![Artwork](/static/images/128pixel.png)
1
AI Agents for Consumers // Shaun Wei // #244
57:26
57:26
Play later
Play later
Lists
Like
Liked
57:26
Sean Wei, the CEO and co-founder of RealChar, shares his journey from working in the autonomous vehicle industry to creating an open-source voice assistant project called Realchar, which eventually evolved into Rivia, a voice AI assistant focused on managing personal phone calls.The Future of AI and Consumer Empowerment // MLOps podcast #244 with S…
…
continue reading
![Artwork](/static/images/128pixel.png)
1
Model soups: averaging weights of multiple fine-tuned models improves accuracy without increasing inference time
38:01
38:01
Play later
Play later
Lists
Like
Liked
38:01
The conventional recipe for maximizing model accuracy is to (1) train multiple models with various hyperparameters and (2) pick the individual model which performs best on a held-out validation set, discarding the remainder. In this paper, we revisit the second step of this procedure in the context of fine-tuning large pre-trained models, where fin…
…
continue reading
![Artwork](/static/images/128pixel.png)
1
FineWeb Datasets, YouDream's 3D Animals, PDE-Solving Breakthrough, Noise-Conditioned Perception Alignment, Language Models' Continual Learning
11:02
11:02
Play later
Play later
Lists
Like
Liked
11:02
The FineWeb Datasets: Decanting the Web for the Finest Text Data at ScaleYouDream: Generating Anatomically Controllable Consistent Text-to-3D AnimalsDiffusionPDE: Generative PDE-Solving Under Partial ObservationAligning Diffusion Models with Noise-Conditioned PerceptionUnlocking Continual Learning Abilities in Language Models…
…
continue reading
![Artwork](/static/images/128pixel.png)
1
RAG vs Fine-tuning: Pipelines, Tradeoffs, and a Case Study on Agriculture
1:06:40
1:06:40
Play later
Play later
Lists
Like
Liked
1:06:40
There are two common ways in which developers are incorporating proprietary and domain-specific data when building applications of Large Language Models (LLMs): Retrieval-Augmented Generation (RAG) and Fine-Tuning. RAG augments the prompt with the external data, while fine-Tuning incorporates the additional knowledge into the model itself. However,…
…
continue reading
![Artwork](/static/images/128pixel.png)
1
BigCodeBench Challenges, Cambrian-1 Leap, D-MERIT's Evaluation, Long Context Breakthrough in Vision
11:06
11:06
Play later
Play later
Lists
Like
Liked
11:06
DreamBench++: A Human-Aligned Benchmark for Personalized Image GenerationBigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex InstructionsCambrian-1: A Fully Open, Vision-Centric Exploration of Multimodal LLMsEvaluating D-MERIT of Partial-annotation on Information RetrievalLong Context Transfer from Language to Vision…
…
continue reading
![Artwork](/static/images/128pixel.png)
1
Live from TNW 2024! Maria Amelie on fact-checking with AI; post-quantum cryptography; competitive Excel
37:58
37:58
Play later
Play later
Lists
Like
Liked
37:58
In today’s episode — recorded with live audience at TNW Conference 2024 — Linnea and Andrii talk about post-quantum cryptography, competitive Excel, and a few more things in between. The guest of the show is Maria Amelie, CEO and founder of Factiverse. The company has just raised €1mn in funding to further build its platform that helps researchers,…
…
continue reading
![Artwork](/static/images/128pixel.png)
1
Seven Failure Points When Engineering a Retrieval Augmented Generation System
21:27
21:27
Play later
Play later
Lists
Like
Liked
21:27
Software engineers are increasingly adding semantic search capabilities to applications using a strategy known as Retrieval Augmented Generation (RAG). A RAG system involves finding documents that semantically match a query and then passing the documents to a large language model (LLM) such as ChatGPT to extract the right answer using an LLM. RAG s…
…
continue reading
![Artwork](/static/images/128pixel.png)
1
LongRAG Breakthrough, LLMs as Judges, Transformer Memory Insights, Video Library AI, Democratizing Art Styles
10:14
10:14
Play later
Play later
Lists
Like
Liked
10:14
LongRAG: Enhancing Retrieval-Augmented Generation with Long-context LLMsJudging the Judges: Evaluating Alignment and Vulnerabilities in LLMs-as-JudgesComplexity of Symbolic Representation in Working Memory of Transformer Correlates with the Complexity of a TaskTowards Retrieval Augmented Generation over Large Video LibrariesStylebreeder: Exploring …
…
continue reading
![Artwork](/static/images/128pixel.png)
1
ML and AI as Distinct Control Systems in Heavy Industrial Settings // Richard Howes // #243
56:30
56:30
Play later
Play later
Lists
Like
Liked
56:30
Join us at our first in-person conference today all about AI Quality: https://www.aiqualityconference.com/ML and AI as Distinct Control Systems in Heavy Industrial Settings // MLOps podcast #243 with Richard Howes, CTO of Metaformed. Richard Howes is a dedicated engineer who is passionate about control systems whether it be embedded systems, indust…
…
continue reading