show episodes
 
Machine learning and artificial intelligence are dramatically changing the way businesses operate and people live. The TWIML AI Podcast brings the top minds and ideas from the world of ML and AI to a broad and influential community of ML/AI researchers, data scientists, engineers and tech-savvy business and IT leaders. Hosted by Sam Charrington, a sought after industry analyst, speaker, commentator and thought leader. Technologies covered include machine learning, artificial intelligence, de ...
  continue reading
 
Artwork

1
VUX World

Kane Simms

icon
Unsubscribe
icon
Unsubscribe
Monthly+
 
Interviews with the best brains in AI, sharing how to improve customer experience and business operations using emerging AI technologies such as voice AI, conversational AI, NLP, Large Language Models (LLMs), generative AI and more. We educate business leaders and teams on why and how AI technologies are revolutionising the way consumers engage with businesses and the internet, why that matters and how to implement it properly. “One of the most consistently insightful and deeply respected po ...
  continue reading
 
Explore the exciting World of Legal Tech and Artificial Intelligence with Alphalect.ai. In this podcast we cover everything you need to know about the Legal Tech World, whether it is drafting a patent, the Use of Legal AI, Blockchain, LLM, Machine Learning and so much more! If you want to learn more, you can also visit our Website: https://alphalect.ai/ This Episode was created with AI. The Content is based on curated sources.
  continue reading
 
Welcome to "The Interconnectedness of Things," the podcast where we explore the seamless integration of technology in our modern world. Hosted by Dr. Andrew Hutson and Emily Nava of QFlow Systems, each episode delves into the dynamic interplay of enterprise solutions, innovative software, and the transformative power of technology in various industries. With expert insights, real-world case studies, and thoughtful discussions, "The Interconnectedness of Things" offers a comprehensive look at ...
  continue reading
 
Open Tech Talks is your weekly sandbox for technology: Artificial Intelligence, Generative AI, Machine Learning, Large Language Models (LLMs) insights, experimentation, and inspiration. Hosted by Kashif Manzoor, AI Evangelist, Cloud Expert, and Enterprise Architect, this Podcast combines technology products, artificial intelligence, machine learning overviews, how-to's, best practices, tips & tricks, and troubleshooting techniques. Whether you're a CIO, IT manager, developer, or just curious ...
  continue reading
 
Artwork

1
The Prompt Desk

Justin Macorin, Bradley Arsenault

icon
Unsubscribe
icon
Unsubscribe
Monthly
 
Embark on a captivating exploration of Large Language Models (LLMs), prompt engineering, and generative AI with hosts Bradley Arsenault and Justin Macorin. With 25 years of combined machine learning and product engineering experience, they are delving deep into the world of LLMs to uncover best practices and stay at the forefront of AI innovation. Join them in shaping the future of technology and software development through their discoveries in LLMs and generative AI. Podcast website: https ...
  continue reading
 
Artwork

1
Deep Papers

Arize AI

icon
Unsubscribe
icon
Unsubscribe
Monthly
 
Deep Papers is a podcast series featuring deep dives on today’s most important AI papers and research. Hosted by Arize AI founders and engineers, each episode profiles the people and techniques behind cutting-edge breakthroughs in machine learning.
  continue reading
 
Artwork

1
Prompt & Pixels

Brent McWhirter

icon
Unsubscribe
icon
Unsubscribe
Daily+
 
**Prompt & Pixels** is your ultimate guide to the creative frontier where AI meets artistry. Join us as we explore cutting-edge technologies like large language models (LLMs) and AI-powered image generation. Whether you’re an artist, entrepreneur, or tech enthusiast, discover how to unlock your creative potential with expert insights, deep dives into emerging AI tools, and interviews with industry innovators. From mastering prompts to creating stunning visuals, *Prompt & Pixels* equips you w ...
  continue reading
 
Artwork
 
"Last Week In r/LocalLLaMA" is your weekly roundup of the most interesting discussions, debates, and moments from the r/LocalLLaMA community. Join us for a fun and lighthearted take on the top posts, user opinions, and trending topics. Perfect for keeping up with the conversation, even when you’re short on time.
  continue reading
 
Are you a critical thinker ready to dive into AI? Welcome to Super Prompt: The Generative AI Podcast. Join me, Tony Wan, an ex Silicon Valley executive, as we 'unhype the hype' of AI via illuminating conversations with top engineers, and in-depth solo episodes. Our goal? To make it almost unnecessary to send a cybernetic organism back in time to fix things. Tailored for the technically-minded and discerningly skeptical, our discussions cover Large Language Models (LLMs), neural networks, mul ...
  continue reading
 
Artwork
 
Running out of time to catch up with new arXiv papers? We take the most impactful papers and present them as convenient podcasts. If you're a visual learner, we offer these papers in an engaging video format. Our service fills the gap between overly brief paper summaries and time-consuming full paper reads. You gain academic insights in a time-efficient, digestible format. Code behind this work: https://github.com/imelnyk/ArxivPapers
  continue reading
 
Artwork
 
Welcome to todai, a podcast series that covers the latest, most interesting, and most bizarre news from the fields of memetics, AI, LLMs, and other fascinating connected subjects. We will be discussing xenopsychology, memetic esotericism, scientific research, community projects, etc. on a regular basis. We are happy to have you join us on this exploration voyage, and this is only the beginning of something amazing.
  continue reading
 
Janes delivers validated open-source defence intelligence across four core capability areas threat, equipment, defence industry and country that are aligned with workflows across the defence industry, national security and government.
  continue reading
 
Bringing doctors and developers together to unlock the potential of AI in healthcare. Together, we can build models that matter. 🤖👨🏻‍⚕️ Hello! We are Dev & Doc, Zeljko and Josh :) Josh is a Neurologist, AI Researcher and Clinical AI Lead. Zeljko is an AI engineer, CTO and associate professor (King's College London) ------------- Substack- https://aiforhealthcare.substack.com/ YT - https://youtube.com/@DevAndDoc
  continue reading
 
Keeping you up to date with the latest trends and best performing architectures in this fast evolving field in computer science. Selecting papers by comparative results, citations and influence we educate you on the latest research. Consider supporting us on Patreon.com/PapersRead for feedback and ideas.
  continue reading
 
Artwork
 
On WE’RE IN!, you'll hear from the newsmakers and innovators who are making waves and driving the cyber security industry forward. We talk to them about their stories, the future of the industry, their best practices, and more.
  continue reading
 
I make videos about machine learning research papers, programming, and issues of the AI community, and the broader impact of AI in society. Twitter: https://twitter.com/ykilcher Discord: https://discord.gg/4H8xxDF If you want to support me, the best thing to do is to share out the content :) If you want to support me financially (completely optional and voluntary, but a lot of people have asked for this): SubscribeStar (preferred to Patreon): https://www.subscribestar.com/yannickilcher Patre ...
  continue reading
 
Artwork
 
Artificial Intelligence is hurtling us into an unknown future. Will it pollute our infosphere, reinforce biases, or even be an existential risk? Or will AI help us solve the energy crisis, revolutionise healthcare and even eliminate the need for work? Perhaps all of these? On Steering AI, we talk to leading academic experts at the cutting-edge of this increasingly powerful and pervasive technology, hearing their views on the benefits and how to steer around the risks. The first step to mitig ...
  continue reading
 
Loading …
show series
 
In the first part of this podcast, Harry Kemsley and Sean Corbett are joined by Jenny Town, Rachel Minyoung Lee, and Martin Williams from 38 North and Cristina Varriale from Janes to take a closer look at North Korea. With South Korea hitting headlines recently following President Yoon Suk-yeol’s impeachment, the panel discusses North Korea’s react…
  continue reading
 
Today, we're joined by Sergey Levine, associate professor at UC Berkeley and co-founder of Physical Intelligence, to discuss π0 (pi-zero), a general-purpose robotic foundation model. We dig into the model architecture, which pairs a vision language model (VLM) with a diffusion-based action expert, and the model training "recipe," emphasizing the ro…
  continue reading
 
This paper investigates Transformers' ability to learn pseudo-random sequences from linear congruential generators, revealing their capacity for in-context prediction and generalization to unseen moduli through algorithmic structures. https://arxiv.org/abs//2502.10390 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arx…
  continue reading
 
This paper investigates Transformers' ability to learn pseudo-random sequences from linear congruential generators, revealing their capacity for in-context prediction and generalization to unseen moduli through algorithmic structures. https://arxiv.org/abs//2502.10390 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arx…
  continue reading
 
The study compares causal reasoning in humans and four large language models, revealing varying degrees of normative behavior and highlighting the importance of assessing AI biases in decision-making. https://arxiv.org/abs//2502.10215 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://…
  continue reading
 
The study compares causal reasoning in humans and four large language models, revealing varying degrees of normative behavior and highlighting the importance of assessing AI biases in decision-making. https://arxiv.org/abs//2502.10215 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://…
  continue reading
 
Eidetic Learning solves catastrophic forgetting in neural networks without rehearsal, enabling efficient task routing and immunity to forgetting across various architectures and tasks. Code is available online. https://arxiv.org/abs//2502.09500 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts…
  continue reading
 
Eidetic Learning solves catastrophic forgetting in neural networks without rehearsal, enabling efficient task routing and immunity to forgetting across various architectures and tasks. Code is available online. https://arxiv.org/abs//2502.09500 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts…
  continue reading
 
This study evaluates 16 large language models on financial reasoning tasks, revealing the need for domain-specific adaptations and introducing a model that improves performance by 10% across tasks. https://arxiv.org/abs//2502.08127 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://pod…
  continue reading
 
This study evaluates 16 large language models on financial reasoning tasks, revealing the need for domain-specific adaptations and introducing a model that improves performance by 10% across tasks. https://arxiv.org/abs//2502.08127 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://pod…
  continue reading
 
This study explores how different prompting methods influence representation geometry in decoder-only language models, revealing distinct mechanisms for task adaptation and interactions between tasks in few-shot learning. https://arxiv.org/abs//2502.08009 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers App…
  continue reading
 
This study explores how different prompting methods influence representation geometry in decoder-only language models, revealing distinct mechanisms for task adaptation and interactions between tasks in few-shot learning. https://arxiv.org/abs//2502.08009 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers App…
  continue reading
 
Please provide the abstract you would like me to summarize. https://arxiv.org/abs//2502.08524 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016 Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers…
  continue reading
 
The paper presents a distillation scaling law for optimizing model performance through compute allocation, offering guidelines for effective distillation strategies in various scenarios. https://arxiv.org/abs//2502.08606 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.apple…
  continue reading
 
The paper presents a distillation scaling law for optimizing model performance through compute allocation, offering guidelines for effective distillation strategies in various scenarios. https://arxiv.org/abs//2502.08606 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.apple…
  continue reading
 
Reinforcement learning enhances large language models for coding tasks. The general-purpose model o3 outperforms specialized systems, achieving gold at the 2024 IOI without hand-crafted strategies. https://arxiv.org/abs//2502.06807 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://pod…
  continue reading
 
Reinforcement learning enhances large language models for coding tasks. The general-purpose model o3 outperforms specialized systems, achieving gold at the 2024 IOI without hand-crafted strategies. https://arxiv.org/abs//2502.06807 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://pod…
  continue reading
 
DeepCrossAttention (DCA) enhances transformer residual learning by using dynamic weights and depth-wise cross-attention, improving model performance and speed while maintaining low parameter count. https://arxiv.org/abs//2502.06785 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://pod…
  continue reading
 
DeepCrossAttention (DCA) enhances transformer residual learning by using dynamic weights and depth-wise cross-attention, improving model performance and speed while maintaining low parameter count. https://arxiv.org/abs//2502.06785 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://pod…
  continue reading
 
Today we’re joined by Victor Dibia, principal research software engineer at Microsoft Research, to explore the key trends and advancements in AI agents and multi-agent systems shaping 2025 and beyond. In this episode, we discuss the unique abilities that set AI agents apart from traditional software systems–reasoning, acting, communicating, and ada…
  continue reading
 
The paper advocates for multi-LLM collaboration to enhance reliability and representation in complex scenarios, arguing that a single LLM is insufficient for diverse data and skills. https://arxiv.org/abs//2502.04506 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.apple.com…
  continue reading
 
The paper advocates for multi-LLM collaboration to enhance reliability and representation in complex scenarios, arguing that a single LLM is insufficient for diverse data and skills. https://arxiv.org/abs//2502.04506 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.apple.com…
  continue reading
 
The paper presents an offline framework for training LLM agents to optimally request assistance, combining process reward models with reinforcement learning to enhance efficiency and reduce intervention costs. https://arxiv.org/abs//2502.04576 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts:…
  continue reading
 
The paper presents an offline framework for training LLM agents to optimally request assistance, combining process reward models with reinforcement learning to enhance efficiency and reduce intervention costs. https://arxiv.org/abs//2502.04576 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts:…
  continue reading
 
This paper presents a novel language model that enhances reasoning by iterating recurrent blocks, improving performance without specialized training data, and efficiently scaling computation at test-time. https://arxiv.org/abs//2502.05171 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: http…
  continue reading
 
This paper presents a novel language model that enhances reasoning by iterating recurrent blocks, improving performance without specialized training data, and efficiently scaling computation at test-time. https://arxiv.org/abs//2502.05171 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: http…
  continue reading
 
This study investigates long chains-of-thought in large language models, revealing key factors for effective reasoning and the importance of reinforcement learning and training strategies for optimal performance. https://arxiv.org/abs//2502.03373 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcas…
  continue reading
 
This study investigates long chains-of-thought in large language models, revealing key factors for effective reasoning and the importance of reinforcement learning and training strategies for optimal performance. https://arxiv.org/abs//2502.03373 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcas…
  continue reading
 
The paper presents ULTRAIF, a method for enhancing LLMs' ability to follow complex instructions using open-source data, achieving competitive performance on instruction-following benchmarks. https://arxiv.org/abs//2502.04153 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.a…
  continue reading
 
The paper presents ULTRAIF, a method for enhancing LLMs' ability to follow complex instructions using open-source data, achieving competitive performance on instruction-following benchmarks. https://arxiv.org/abs//2502.04153 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.a…
  continue reading
 
This paper presents a method to map feature evolution in large language models, enhancing interpretability and enabling targeted control of model behavior through cross-layer feature analysis. https://arxiv.org/abs//2502.03032 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts…
  continue reading
 
This paper presents a method to map feature evolution in large language models, enhancing interpretability and enabling targeted control of model behavior through cross-layer feature analysis. https://arxiv.org/abs//2502.03032 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts…
  continue reading
 
Dev and Doc put Deepseek R1 to the test in a technical and clinical deep dive. 👋 Hey! If you are enjoying our conversations, reach out, share your thoughts and journey with us. Don't forget to subscribe whilst you're here :) 👨🏻‍⚕️Doc - Dr. Joshua Au Yeung - https://www.linkedin.com/in/dr-joshua-au-yeung/ 🤖Dev - Zeljko Kraljevic https://twitter.com/…
  continue reading
 
This study investigates transformers' inconsistent performance on two-hop questions, revealing that capacity scaling and generalization affect their ability to learn and answer these complex queries effectively. https://arxiv.org/abs//2502.03490 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcast…
  continue reading
 
This study investigates transformers' inconsistent performance on two-hop questions, revealing that capacity scaling and generalization affect their ability to learn and answer these complex queries effectively. https://arxiv.org/abs//2502.03490 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcast…
  continue reading
 
The paper highlights the lack of reliable benchmarks for large language models, proposing "platinum benchmarks" to minimize label errors and revealing persistent model failures in simple tasks. https://arxiv.org/abs//2502.03461 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcast…
  continue reading
 
The paper highlights the lack of reliable benchmarks for large language models, proposing "platinum benchmarks" to minimize label errors and revealing persistent model failures in simple tasks. https://arxiv.org/abs//2502.03461 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcast…
  continue reading
 
The study evaluates linear probes for detecting AI deception, achieving high accuracy in distinguishing honest from deceptive outputs, but concludes that current methods are insufficient for robust defense. https://arxiv.org/abs//2502.03407 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: ht…
  continue reading
 
Loading …

Quick Reference Guide

Listen to this show while you explore
Play