show episodes
 
A daily update on the latest AI Research Papers. We provide a high level overview of a handful of papers each day and will link all papers in the description for further reading. This podcast is created entirely with AI by PocketPod. Head over to https://pocketpod.app to learn more.
  continue reading
 
Are you a critical thinker ready to dive into AI? Welcome to The Generative AI Podcast: Super Prompt. Join me, Tony Wan, an ex Silicon Valley executive, as we 'unhype the hype' of AI via illuminating conversations with top engineers and entrepreneurs, complemented by in-depth solo episodes. Our goal? To make it almost unnecessary to send a cybernetic organism back in time to fix things. Tailored for the technically-minded and discerningly skeptical, our discussions cover Large Language Model ...
  continue reading
 
Keeping you up to date with the latest trends and best performing architectures in this fast evolving field in computer science. Selecting papers by comparative results, citations and influence we educate you on the latest research. Consider supporting us on Patreon.com/PapersRead for feedback and ideas.
  continue reading
 
Artwork

1
Intel on AI

Intel Corporation

Unsubscribe
Unsubscribe
Monthly
 
Tune in as we dissect recent AI news, explore cutting-edge innovations, and sit down with influential voices shaping the future of AI. Whether you're a seasoned expert or just dipping your toes into the AI waters, our podcast is your go-to resource for staying informed and inspired. #IntelAI @IntelAI
  continue reading
 
Artwork

1
Your AI Roadmap

Dr. Joan Palmiter Bajorek

Unsubscribe
Unsubscribe
Weekly
 
Your AI Roadmap the podcast is on a mission to decrease fluffy HYPE and talk to the people actually building AI. Anyone can build in AI. Including you. Whether you’re terrified or excited, there’s been no better time than today to dive in! Now is the time to be curious and future-proof your career and ... ultimately your income. This podcast isn't about white dudes patting themselves on the back, this is about you and me and ALL the paths into cool projects around the world! What's next on y ...
  continue reading
 
Welcome to an exciting new season of the podcast Your Career: Choice or Chance? - as we dive into the ever-evolving world of GenAI in the Workplace and explore the latest trends, experiences, and career journeys shaping the future of work as AI is increasingly ingrained in it. Each episode provides fresh insights, addressing the transformative influence of GenAI in shaping the workforce of tomorrow, making it a must-listen for anyone interested in staying ahead in the ever-evolving world of ...
  continue reading
 
Artwork

1
AI Named This Show

Tasia Custode, Tristan Jutras

Unsubscribe
Unsubscribe
Weekly
 
AI Named This Show is a weekly AI-focused tech show. Join longtime friends and tech media veterans Tasia Custode and Tristan Jutras as they dive into the AI abyss, unraveling the complexities of artificial intelligence. They cover everything from groundbreaking AI news to the practical applications — and societal implications — of large language models, machine learning, deep learning, generative AI and more. FOLLOW AI Named This Show on Facebook, Instagram, YouTube and X (Twitter) Tristan & ...
  continue reading
 
"On AI" is an innovative podcast uniquely tailored for creators diving into the fascinating world of generative AI. Generative AI is accelerating artistic expressions resulting in the development of completely new genres, transforming art, design, film, music, storytelling and immersive multimodal experiences. It's also transforming the way we experience the world. From fashion, gaming, robotics and all the pop culture in between. In each episode, we examine the transformative power of artif ...
  continue reading
 
Looking to explore the intersection of AI and journalism? Influential thought leaders in the industry join data scientist and media entrepreneur, Nikita Roy, each week to explore what's next with AI and its implications for the media landscape. In each episode, industry experts discuss how automated newsrooms have the potential to change journalism and uncover opportunities to optimize workflows and increase efficiency without compromising journalistic integrity. Hosted on Acast. See acast.c ...
  continue reading
 
I make videos about machine learning research papers, programming, and issues of the AI community, and the broader impact of AI in society. Twitter: https://twitter.com/ykilcher Discord: https://discord.gg/4H8xxDF If you want to support me, the best thing to do is to share out the content :) If you want to support me financially (completely optional and voluntary, but a lot of people have asked for this): SubscribeStar (preferred to Patreon): https://www.subscribestar.com/yannickilcher Patre ...
  continue reading
 
Artwork

1
AR Show with Jason McDowall

Jason McDowall: Investor | Advocate | Entrepreneur

Unsubscribe
Unsubscribe
Monthly
 
The AR Show dives deep into the emerging world of Augmented Reality with a focus on the underlying technologies and uses of Smartglasses, and the people behind them. I talk with entrepreneurs, executives, investors and early adopters to extract insights that will both inform and inspire you. In each episode, I explore the approaches, challenges, and progress behind the products and companies. I also extract the lessons learned and insightful advice from each guest. Equal parts technology, pr ...
  continue reading
 
Artwork

1
Seed to Harvest

Paige Finn Doherty

Unsubscribe
Unsubscribe
Monthly+
 
Seed to Harvest, hosted by Paige Finn Doherty, highlights stories, frameworks & tactics from a diverse array of investors, founders, and creators. If you're interested in investing or building a business, this show is for you!
  continue reading
 
Loading …
show series
 
Scaling Laws with Vocabulary: Larger Models Deserve Larger VocabulariesScaling Retrieval-Based Language Models with a Trillion-Token DatastoreShape of Motion: 4D Reconstruction from a Single VideoStreetscapes: Large-scale Consistent Street View Generation Using Autoregressive Video DiffusionUnderstanding Reference Policies in Direct Preference Opti…
  continue reading
 
With the remarkable advancements in image generation and open-form text generation, the creation of interleaved image-text content has become an increasingly intriguing field. Multimodal story generation, characterized by producing narrative texts and vivid images in an interleaved manner, has emerged as a valuable and practical task with broad app…
  continue reading
 
Welcome to the new episode of the TNW Podcast — the show where we discuss the latest developments in the European technology ecosystem and feature interviews with some of the most interesting people in the industry. In today’s episode, we're featuring two interviews recorded at TNW Conference 2024 ago with two amazing women working in very differen…
  continue reading
 
Latest advances have achieved realistic virtual try-on (VTON) through localized garment inpainting using latent diffusion models, significantly enhancing consumers' online shopping experience. However, existing VTON technologies neglect the need for merchants to showcase garments comprehensively, including flexible control over garments, optional f…
  continue reading
 
This week, Tristan and Tasia look at new image-generating tools from Samsung and Microsoft and check out Amazon's new AI shopping assistant, Rufus. Then we encounter OpenAI's Strawberry (FKA Q*) reasoning technology and new types of AI models that might someday supplant transformers. Join us as we navigate the dangers of the frontier. Can you survi…
  continue reading
 
Eric Landry is a seasoned AI and Machine Learning leader with extensive expertise in software engineering and practical applications in NLP, document classification, and conversational AI. With technical proficiency in Java, Python, and key ML tools, he leads the Expedia Machine Learning Engineering Guild and has spoken at major conferences like Ap…
  continue reading
 
Human video generation is a dynamic and rapidly evolving task that aims to synthesize 2D human body video sequences with generative models given control conditions such as text, audio, and pose. With the potential for wide-ranging applications in film, gaming, and virtual communication, the ability to generate natural and realistic human video is c…
  continue reading
 
Whether you're an AI enthusiast or a business looking to integrate AI into your operations, this episode offers valuable insights into the rapidly evolving AI landscape. Join Seamus Jones, Director of Technical Marketing/Engineering at Dell, as he explains how Dell is creating comprehensive AI solutions. These solutions encompass everything from AI…
  continue reading
 
The rapid advancement of large language models (LLMs) has paved the way for the development of highly capable autonomous agents. However, existing multi-agent frameworks often struggle with integrating diverse capable third-party agents due to reliance on agents defined within their own ecosystems. They also face challenges in simulating distribute…
  continue reading
 
Welcome to the new episode of the TNW Podcast — the show where we discuss the latest developments in the European technology ecosystem and feature interviews with some of the most interesting people in the industry. In today’s episode, Linnea and Andrii talk about the launch of Ariane 6 and its consequences, the woes of Firefly, robotic laundry fol…
  continue reading
 
Qwen2 Technical ReportLearning to Refuse: Towards Mitigating Privacy Risks in LLMsThe Good, The Bad, and The Greedy: Evaluation of LLMs Should Not Ignore Non-DeterminismQ-Sparse: All Large Language Models can be Fully Sparsely-ActivatedGRUtopia: Dream General Robots in a City at Scale
  continue reading
 
Aniket Kumar Singh is a Vision Systems Engineer at Ultium Cells, skilled in Machine Learning and Deep Learning. I'm also engaged in AI research, focusing on Large Language Models (LLMs).Evaluating the Effectiveness of Large Language Models: Challenges and Insights // MLOps Podcast #248 with Aniket Kumar Singh, CTO @ MyEvaluationPal | ML Engineer @ …
  continue reading
 
Peter Geovanes, Chief Innovation and AI Officer at McGuireWoods LLP, discusses AI's transformative impact on the legal sector. He emphasizes AI tools like ChatGPT-4 for enhancing efficiency, competitive advantage, and prompt engineering. Highlighting data privacy and ethical use, he advocates a "trust but verify" approach. Geovanes also stresses th…
  continue reading
 
Skywork-Math: Data Scaling Laws for Mathematical Reasoning in Large Language Models -- The Story Goes OnVideo Diffusion Alignment via Reward GradientsMultimodal Self-Instruct: Synthetic Abstract Image and Visual Reasoning Instruction Using Language ModelQ-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank GradientsMAVIS: Math…
  continue reading
 
While language models (LMs) have shown potential across a range of decision-making tasks, their reliance on simple acting processes limits their broad deployment as autonomous agents. In this paper, we introduce Language Agent Tree Search (LATS) -- the first general framework that synergizes the capabilities of LMs in reasoning, acting, and plannin…
  continue reading
 
This week, Tristan and Tasia explore various recent AI announcements, collaborations and whoopsie-doopsies involving Microsoft, OpenAI, Apple, Samsung, Google, and Motorola. We also discuss Google's recent backtracking on its net zero emissions targets in light of increasing resource demands from AI. Plus: an AI beauty pageant and a surprise appear…
  continue reading
 
Sophia Rowland is a Senior Product Manager focusing on ModelOps and MLOps at SAS. In her previous role as a data scientist, Sophia worked with dozens of organizations to solve a variety of problems using analytics. David Weik has a passion for data and creating integrated customer-centric solutions. Thinking data and people first to create value-ad…
  continue reading
 
Portrait Animation aims to synthesize a lifelike video from a single source image, using it as an appearance reference, with motion (i.e., facial expressions and head pose) derived from a driving video, audio, text, or generation. Instead of following mainstream diffusion-based methods, we explore and extend the potential of the implicit-keypoint-b…
  continue reading
 
Recent advancements in large language models (LLMs) have significantly advanced the automation of software development tasks, including code synthesis, program repair, and test generation. More recently, researchers and industry practitioners have developed various autonomous LLM agents to perform end-to-end software development tasks. These agents…
  continue reading
 
Welcome to the new episode of the TNW Podcast — the show where we discuss the latest developments in the European technology ecosystem and feature interviews with some of the most interesting people in the industry. In today’s episode, Andrii talks about fusion energy in Europe, 'the war on floppy disks', and more. The guest of the show is Daniel K…
  continue reading
 
Unveiling Encoder-Free Vision-Language ModelsFunAudioLLM: Voice Understanding and Generation Foundation Models for Natural Interaction Between Humans and LLMsAriGraph: Learning Knowledge Graph World Models with Episodic Memory for LLM AgentsRULE: Reliable Multimodal RAG for Factuality in Medical Vision Language ModelsChartGemma: Visual Instruction-…
  continue reading
 
Matar Haller is the VP of Data & AI at ActiveFence, where her teams own the end-to-end automated detection of harmful content at scale, regardless of the abuse area or media type. The work they do here is engaging, impactful, and tough, and Matar is grateful for the people she gets to do it with. AI For Good - Detecting Harmful Content at Scale // …
  continue reading
 
Long-context language models (LCLMs) have the potential to revolutionize our approach to tasks traditionally reliant on external tools like retrieval systems or databases. Leveraging LCLMs' ability to natively ingest and process entire corpora of information offers numerous advantages. It enhances user-friendliness by eliminating the need for speci…
  continue reading
 
Lauren Dycus, Director of Product Management at Upwork, details her role and the company's mission. She focuses on work management, which streamlines processes from user onboarding to contract finalization. Upwork, the world's largest work marketplace, supports freelancers with specialized skills, enabling personal and company growth. Dycus emphasi…
  continue reading
 
Garance Burke, a global investigative journalist at the Associated Press, joins host Nikita Roy to discuss the crucial role of journalism in holding AI systems accountable and the challenges reporters face in covering this complex topic. Burke, a global investigative journalist with The Associated Press, has been at the forefront of investigating t…
  continue reading
 
Despite Large Language Models (LLMs) like GPT-4 achieving impressive results in function-level code generation, they struggle with repository-scale code understanding (e.g., coming up with the right arguments for calling routines), requiring a deeper comprehension of complex file interactions. Also, recently, people have developed LLM agents that a…
  continue reading
 
Diffusion Forcing: Next-token Prediction Meets Full-Sequence DiffusionLet the Expert Stick to His Last: Expert-Specialized Fine-Tuning for Sparse Architectural Large Language ModelsPlanetarium: A Rigorous Benchmark for Translating Text to Structured Planning LanguagesInternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Co…
  continue reading
 
We-Math: Does Your Large Multimodal Model Achieve Human-like Mathematical Reasoning?ROS-LLM: A ROS framework for embodied AI with task feedback and structured reasoningMMEvalPro: Calibrating Multimodal Benchmarks Towards Trustworthy and Efficient EvaluationLiteSearch: Efficacious Tree Search for LLMWavelets Are All You Need for Autoregressive Image…
  continue reading
 
This week, Tristan and Tasia are joined by AINTS' very first guest, writer/performer/podcaster/video producer, Bill Meeks,. to look at the latest AI image and video generator updates from Stable Diffusion and Runway AI, along with a couple of newcomers. Then Bill walks us through the key features of Everly Heights Story Studio, his AI storytelling …
  continue reading
 
Catherine Nelson is a freelance data scientist and writer. She is currently working on the forthcoming O’Reilly book "Software Engineering for Data Scientists”.Why All Data Scientists Should Learn Software Engineering Principles // MLOps podcast #245 with Catherine Nelson, a freelance Data Scientist.A big thank you to LatticeFlow AI for sponsoring …
  continue reading
 
In this work, we introduce Unique3D, a novel image-to-3D framework for efficiently generating high-quality 3D meshes from single-view images, featuring state-of-the-art generation fidelity and strong generalizability. Previous methods based on Score Distillation Sampling (SDS) can produce diversified 3D results by distilling 3D knowledge from large…
  continue reading
 
We present DeepSeek-Coder-V2, an open-source Mixture-of-Experts (MoE) code language model that achieves performance comparable to GPT4-Turbo in code-specific tasks. Specifically, DeepSeek-Coder-V2 is further pre-trained from an intermediate checkpoint of DeepSeek-V2 with additional 6 trillion tokens. Through this continued pre-training, DeepSeek-Co…
  continue reading
 
Learn about the importance of flexibility and governance in AI model management as Robert Daigle, Director of Global AI Business at Lenovo, discusses the future of AI deployment across various computing environments. He highlights the collaborative efforts of Lenovo and partners in addressing specific vertical use cases such as retail, healthcare, …
  continue reading
 
I moderated a panel at the recent AWE conference that took place a couple of weeks ago in Long Beach, California. The panel featured Karl Guttag from KGOnTech, Adi Robertson from the Verve, Jeri Ellsworth from Tilt Five, and Ed Tang from Avegant. The session was titled: Current State and Future Direction of AR Glasses and the session description re…
  continue reading
 
Meta GenAI Infra Blog Review // Special MLOps Podcast episode by Demetrios.// AbstractDemetrios explores Meta's innovative infrastructure for large-scale AI operations, highlighting three blog posts on training large language models, maintaining AI capacity, and building Meta's GenAI infrastructure. The discussion reveals Meta's handling of hundred…
  continue reading
 
Scaling Synthetic Data Creation with 1,000,000,000 PersonasHuatuoGPT-Vision, Towards Injecting Medical Visual Knowledge into Multimodal LLMs at ScaleLLaRA: Supercharging Robot Learning Data for Vision-Language PolicyDirect Preference Knowledge Distillation for Large Language ModelsGaussianDreamerPro: Text to Manipulable 3D Gaussians with Highly Enh…
  continue reading
 
Noelle Russell, founder of the AI Leadership Institute, shares her experience in AI, highlighting responsible AI, challenges in implementation, and the potential of generative AI in customer service. She emphasizes collaboration, flexibility, and aligning values with partners. Russell discusses harnessing team knowledge with LLMs, the role of certi…
  continue reading
 
OMG-LLaVA: Bridging Image-level, Object-level, Pixel-level Reasoning and UnderstandingStep-DPO: Step-wise Preference Optimization for Long-chain Reasoning of LLMsMUMU: Bootstrapping Multimodal Image Generation from Text-to-Image DataSimulating Classroom Education with LLM-Empowered AgentsSeaKR: Self-aware Knowledge Retrieval for Adaptive Retrieval …
  continue reading
 
This week, Tristan and Tasia relive a tech journalist's chaotic adventure with Meta's Ray-Ban AI glasses in Montreal and uncover the significant security flaw found in Rabbit’s R1 AI gadget. Then we dive into the heated legal battles over AI-generated music, with the RIAA suing Suno and Udio for copyright infringement and YouTube negotiating music …
  continue reading
 
Sean Wei, the CEO and co-founder of RealChar, shares his journey from working in the autonomous vehicle industry to creating an open-source voice assistant project called Realchar, which eventually evolved into Rivia, a voice AI assistant focused on managing personal phone calls.The Future of AI and Consumer Empowerment // MLOps podcast #244 with S…
  continue reading
 
The conventional recipe for maximizing model accuracy is to (1) train multiple models with various hyperparameters and (2) pick the individual model which performs best on a held-out validation set, discarding the remainder. In this paper, we revisit the second step of this procedure in the context of fine-tuning large pre-trained models, where fin…
  continue reading
 
The FineWeb Datasets: Decanting the Web for the Finest Text Data at ScaleYouDream: Generating Anatomically Controllable Consistent Text-to-3D AnimalsDiffusionPDE: Generative PDE-Solving Under Partial ObservationAligning Diffusion Models with Noise-Conditioned PerceptionUnlocking Continual Learning Abilities in Language Models…
  continue reading
 
There are two common ways in which developers are incorporating proprietary and domain-specific data when building applications of Large Language Models (LLMs): Retrieval-Augmented Generation (RAG) and Fine-Tuning. RAG augments the prompt with the external data, while fine-Tuning incorporates the additional knowledge into the model itself. However,…
  continue reading
 
DreamBench++: A Human-Aligned Benchmark for Personalized Image GenerationBigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex InstructionsCambrian-1: A Fully Open, Vision-Centric Exploration of Multimodal LLMsEvaluating D-MERIT of Partial-annotation on Information RetrievalLong Context Transfer from Language to Vision…
  continue reading
 
In today’s episode — recorded with live audience at TNW Conference 2024 — Linnea and Andrii talk about post-quantum cryptography, competitive Excel, and a few more things in between. The guest of the show is Maria Amelie, CEO and founder of Factiverse. The company has just raised €1mn in funding to further build its platform that helps researchers,…
  continue reading
 
Software engineers are increasingly adding semantic search capabilities to applications using a strategy known as Retrieval Augmented Generation (RAG). A RAG system involves finding documents that semantically match a query and then passing the documents to a large language model (LLM) such as ChatGPT to extract the right answer using an LLM. RAG s…
  continue reading
 
LongRAG: Enhancing Retrieval-Augmented Generation with Long-context LLMsJudging the Judges: Evaluating Alignment and Vulnerabilities in LLMs-as-JudgesComplexity of Symbolic Representation in Working Memory of Transformer Correlates with the Complexity of a TaskTowards Retrieval Augmented Generation over Large Video LibrariesStylebreeder: Exploring …
  continue reading
 
Join us at our first in-person conference today all about AI Quality: https://www.aiqualityconference.com/ML and AI as Distinct Control Systems in Heavy Industrial Settings // MLOps podcast #243 with Richard Howes, CTO of Metaformed. Richard Howes is a dedicated engineer who is passionate about control systems whether it be embedded systems, indust…
  continue reading
 
Loading …

Quick Reference Guide