Best Marcus Edel Podcasts (2024)

1
October 27th, 2023 - AI Unleashed: Decoding Sycophancy, Mastering Control, and Crafting 3D Realities 8:32

8M ago8:32

8:32

Towards Understanding Sycophancy in Language Models Controlled Decoding from Language Models HyperFields: Towards Zero-Shot Generation of NeRFs from Text Support the showBy Marcus Edel

1
October 26th, 2023 - Frontiers of AI: From Quantum Compression to Visionary Transformers 14:15

8M ago14:15

14:15

LLM-FP4: 4-Bit Floating-Point Quantized Transformers Detecting Pretraining Data from Large Language Models ConvNets Match Vision Transformers at Scale A Picture is Worth a Thousand Words: Principled Recaptioning Improves Image Generation QMoE: Practical Sub-1-Bit Compression of Trillion-Parameter Models Support the show…

1
October 25th, 2023 - Pixel to Perception: Matryoshka Synthesis, GPT-3's Linguistic Mysteries, Woodpecker's Visual Refinement, and SAM-CLIP's Vision Evolution 11:12

8M ago11:12

11:12

Matryoshka Diffusion Models Dissecting In-Context Learning of Translations in GPTs Woodpecker: Hallucination Correction for Multimodal Large Language Models SAM-CLIP: Merging Vision Foundation Models towards Semantic and Spatial Understanding Support the showBy Marcus Edel

1
October 24th, 2023 - Neural Visions Unveiled: From FreeNoise's Video Clarity, HallusionBench's Reality Check, to FlashEdit's Instant Image Refinements 6:35

8M ago6:35

6:35

FreeNoise: Tuning-Free Longer Video Diffusion Via Noise Rescheduling HallusionBench: You See What You Think? Or You Think What You See? An Image-Context Reasoning Benchmark Challenging for GPT-4V(ision), LLaVA-1.5, and Other Multi-modality Models Localizing and Editing Knowledge in Text-to-Image Generative Models Support the show…

1
October 23th, 2023 - Unlocking AI's Potential: From Open Waters to Self-Enhancing Miniature Models 6:35

8M ago6:35

6:35

H2O Open Ecosystem for State-of-the-art Large Language Models Let's Synthesize Step by Step: Iterative Dataset Synthesis with Large Language Models by Extrapolating Errors from Small Models Teaching Language Models to Self-Improve through Interactive Demonstrations Support the showBy Marcus Edel

1
October 4th, 2023 - NeuroFrontiers: Pensive Processors, Natural Evolution, and the New Age of Linguistic Titans 13:09

9M ago13:09

13:09

Think before you speak: Training Language Models With Pause Tokens Towards Self-Assembling Artificial Neural Networks through Neural Developmental Programs Efficient Streaming Language Models with Attention Sinks Large Language Models Cannot Self-Correct Reasoning Yet SmartPlay : A Benchmark for LLMs as Intelligent Agents Support the show…

1
October 3nd, 2023 - Evolution in Text: Self-Improvement, Synthesis, and Scrutiny 7:51

9M ago7:51

7:51

Enable Language Models to Implicitly Learn Self-Improvement From Data PixArt-alpha: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis FELM: Benchmarking Factuality Evaluation of Large Language Models Support the showBy Marcus Edel

1
October 2nd, 2023 - Math to Motion: ToRA, Decaf, and DRaFT Transformations 6:52

9M ago6:52

6:52

ToRA: A Tool-Integrated Reasoning Agent for Mathematical Problem Solving Decaf: Monocular Deformation Capture for Face and Hand Interactions Directly Fine-Tuning Diffusion Models on Differentiable Rewards Support the showBy Marcus Edel

1
September 29th, 2023 - Masters of AI Metamorphosis: From Long-Context Linguistics to 3D Dreamscapes 16:14

9M ago16:14

16:14

Effective Long-Context Scaling of Foundation Models Demystifying CLIP Data Vision Transformers Need Registers Qwen Technical Report DreamGaussian: Generative Gaussian Splatting for Efficient 3D Content Creation Support the showBy Marcus Edel

1
September 28th, 2023 - Neural Vistas & Visual Alchemy: From NeuRBF Reconstructions to ScalarSimplicity in AI Imagery 8:49

9M ago8:49

8:49

NeuRBF: A Neural Fields Representation with Adaptive Radial Basis Functions Emu: Enhancing Image Generation Models Using Photogenic Needles in a Haystack Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation Finite Scalar Quantization: VQ-VAE Made Simple Support the showBy Marcus Edel

1
September 27th, 2023 - Beyond Boundaries: Pioneering Sequences, Alignments, and Realism in AI Evolution 6:30

9M ago6:30

6:30

DeepSpeed Ulysses: System Optimizations for Enabling Training of Extreme Long Sequence Transformer Models Aligning Large Multimodal Models with Factually Augmented RLHF LAVIE: High-Quality Video Generation with Cascaded Latent Diffusion Models Support the showBy Marcus Edel

1
September 25th, 2023 - From Pixels to Precedents: Pioneering Visions in Color, Law, Code, and Sight 10:47

9M ago10:47

10:47

CoRF : Colorizing Radiance Fields using Knowledge Distillation The Cambridge Law Corpus: A Corpus for Legal AI Research CodePlan: Repository-level Coding using LLMs and Planning DualToken-ViT: Position-aware Efficient Vision Transformer with Dual Token Fusion Support the showBy Marcus Edel

1
September 22th, 2023 - Revolutionary Speeds & Precision: The Future of Neural Networks and Language Models 13:54

9M ago13:54

13:54

Parallelizing non-linear sequential models over the sequence length Fast Feedforward Networks LongLoRA: Efficient Fine-tuning of Long-Context Large Language Models A Paradigm Shift in Machine Translation: Boosting Translation Performance of Large Language Models Boolformer: Symbolic Regression of Logic Functions with Transformers Support the show…

1
September 21th, 2023 - Neural Frontiers: From FreeU's Image Mastery to Languini Kitchen's Equalized Research 14:32

9M ago14:32

14:32

FreeU: Free Lunch in Diffusion U-Net Neurons in Large Language Models: Dead, N-gram, Positional DreamLLM: Synergistic Multimodal Comprehension and Creation Kosmos-2.5: A Multimodal Literate Model End-to-End Speech Recognition Contextualization with Large Language Models The Languini Kitchen: Enabling Language Modelling Research at Different Scales …

1
September 20th, 2023 - From Overthinking Graphs to Code Whispering and Polyglot AI: The New Frontiers of Neural Networks, Language Models, and Data Compression 13:20

9M ago13:20

13:20

Graph Neural Networks Use Graphs When They Shouldn't Large Language Models for Compiler Optimization OpenBA: An Open-sourced 15B Bilingual Asymmetric seq2seq Model Pre-trained from Scratch Baichuan 2: Open Large-scale Language Models Language Modeling Is Compression FoleyGen: Visually-Guided Audio Generation Support the show…

1
September 12th, 2023 - Frontiers in AI: From Pint-Sized Powerhouses and Pruned Datasets to Multilingual Mastery and Image Restoration 11:11

10M ago11:11

11:11

Textbooks Are All You Need II: phi-1.5 technical report DiffBIR: Towards Blind Image Restoration with Generative Diffusion Prior When Less is More: Investigating Data Pruning for Pretraining LLMs at Scale MADLAD-400: A Multilingual And Document-Level Large Audited Dataset FIAT: Fusing learning paradigms with Instruction-Accelerated Tuning Optimize …

1
September 11th, 2023 - Neural Frontiers: Audiobooks, Virtual Cities, Summarization, and Vision Transformers Reimagined 9:26

10M ago9:26

9:26

Large-Scale Automatic Audiobook Creation CityDreamer: Compositional Generative Model of Unbounded 3D Cities From Sparse to Dense: GPT-4 Summarization with Chain of Density Prompting Mobile V-MoEs: Scaling Down Vision Transformers via Sparse Mixture-of-Experts High-Quality Entity Segmentation Support the show…

1
September 8th, 2023 - Unlocking the Future of AI: From Master Optimizers and Budget-Friendly Giants to Truthful Decoding and Video Segmentation Breakthroughs 11:28

10M ago11:28

11:28

Large Language Models as Optimizers FLM-101B: An Open LLM and How to Train It with $100K Budget XGen-7B Technical Report Tracking Anything with Decoupled Video Segmentation DoLa: Decoding by Contrasting Layers Improves Factuality in Large Language Models Support the showBy Marcus Edel

1
September 7th, 2023 - SLiMe, Matcha-TTS, RoboSense, and CM3Leon: Revolutionizing Vision, Speech, and Multi-Modal Intelligence for a Smarter, Faster Future 8:11

10M ago8:11

8:11

SLiMe: Segment Like Me Matcha-TTS: A fast TTS architecture with conditional flow matching Physically Grounded Vision-Language Models for Robotic Manipulation Scaling Autoregressive Multi-Modal Models: Pretraining and Instruction Tuning Support the showBy Marcus Edel

1
September 6th, 2023 - Unlocking the Future of AI: Lean Transformers, Memory-Efficient RLHF, Voice-Altering Text Prompts, and 3D Virtual Humans 8:02

10M ago8:02

8:02

One Wide Feedforward is All You Need Efficient RLHF: Reducing the Memory Usage of PPO PromptTTS 2: Describing and Generating Voices with Text Prompt AniPortraitGAN: Animatable 3D Portrait Generation from 2D Image Collections Support the showBy Marcus Edel

1
September 5th, 2023 - Frontiers in AI Efficiency and Capability: From Turbocharged Transformers and Extended Contexts to High-Definition Video Generation and Self-Tuned Learning 9:59

10M ago9:59

9:59

Fast Inference from Transformers via Speculative Decoding YaRN: Efficient Context Window Extension of Large Language Models VideoGen: A Reference-Guided Latent Diffusion Approach for High Definition Text-to-Video Generation RLAIF: Scaling Reinforcement Learning from Human Feedback with AI Feedback Support the show…

1
September 1st, 2023 - Unlocking Multilingual AI & Beyond: Innovations in Data, Synthesis, Bioinformatics, and 3D Creation 10:46

10M ago10:46

10:46

The Belebele Benchmark: a Parallel Reading Comprehension Dataset in 122 Language Variants Any-Size-Diffusion: Toward Efficient Text-Driven Synthesis for Any-Size HD Images BioCoder: A Benchmark for Bioinformatics Code Generation with Contextual Pragmatic Knowledge MVDream: Multi-view Diffusion for 3D Generation Can Programming Languages Boost Each …

1
August 31th, 2023 - Advancing Weather Forecasts, Robotic Learning, and AI Conversations: A Trilogy of Innovation 6:44

10M ago6:44

6:44

WeatherBench 2: A benchmark for the next generation of data-driven global weather models RoboTAP: Tracking Arbitrary Points for Few-Shot Visual Imitation LLaSM: Large Language and Speech Model Support the showBy Marcus Edel

1
August 30th, 2023 - From AI Planners to 3D Faces: Groundbreaking Innovations in Machine Learning and Digital Media 7:55

10M ago7:55

7:55

Reward-Respecting Subtasks for Model-Based Reinforcement Learning Relightify: Relightable 3D Faces from a Single Image via Diffusion Models MagicEdit: High-Fidelity and Temporally Coherent Video Editing Support the showBy Marcus Edel

1
August 29th, 2023 - Concept Dissection, Alignment Pitfalls, and Responsible AI: Pioneering Approaches in Image Generation, Language Modeling, and Healthcare 8:53

10M ago8:53

8:53

Break-A-Scene: Extracting Multiple Concepts from a Single Image The Poison of Alignment MedAlign: A Clinician-Generated Dataset for Instruction Following with Electronic Medical Records ORES: Open-vocabulary Responsible Visual Synthesis Support the showBy Marcus Edel

1
August 28th, 2023 - Unified Ingenuity: Trailblazing Techniques Across NLP, ML, Optical Text Recognition, and Computer Graphics 8:49

10M ago8:49

8:49

PMET: Precise Model Editing in a Transformer Interpretable Graph Neural Networks for Tabular Data Nougat: Neural Optical Understanding for Academic Documents Relighting Neural Radiance Fields with Shadow and Highlight Hints Support the showBy Marcus Edel

1
August 25th, 2023 - Harmonizing Audio, Privacy, and Visuals: WavJourney, Differentially Private Diffusion, and Diff2Lip Unveil the Future of Content Creation 7:16

10M ago7:16

7:16

WavJourney: Compositional Audio Creation with Large Language Models Differentially Private Diffusion Models Diff2Lip: Audio Conditioned Diffusion Models for Lip-Synchronization Support the showBy Marcus Edel

1
August 24th, 2023 - Revolutionizing Pixels and Prose: Breakthroughs in Diffusion Models, Multimodal Language Learning, and Media Editing 8:09

10M ago8:09

8:09

Scalable Diffusion Models with Transformers BLIVA: A Simple Multimodal LLM for Better Handling of Text-Rich Visual Questions StableVideo: Text-driven Consistency-aware Diffusion Video Editing Exploiting Diffusion Prior for Real-World Image Super-Resolution Support the showBy Marcus Edel

1
August 23th, 2023 - Semantic Symphony: Aligning Multilingual Text, Multimodal Translation, and Image Relation Inversion in the Era of AI 7:57

10M ago7:57

7:57

Extrapolating Large Language Models to Non-English by Aligning Languages SeamlessM4T—Massively Multilingual & Multimodal Machine Translation ReVersion: Diffusion-Based Relation Inversion from Images Support the showBy Marcus Edel

1
August 22th, 2023 - Unveiling the Future of AI: From Chessboard Diversity to Text Detection Triumphs 11:16

10M ago11:16

11:16

Diversifying AI: Towards Creative Chess with AlphaZero Graph of Thoughts: Solving Elaborate Problems with Large Language Models Dataset Quantization We Don't Need No Adam, All We Need Is EVE: On The Variance of Dual Learning Rate And Beyond SRFormer: Empowering Regression-Based Text Detection Transformer with Segmentation Support the show…

1
August 21th, 2023 - Beyond Binary: The Evolution of Language, Robotics, and Consciousness in AI 11:28

10M ago11:28

11:28

Reinforced Self-Training (ReST) for Language Modeling Large Language Models as General Pattern Machines Anaphoric Structure Emerges Between Neural Networks Consciousness in Artificial Intelligence: Insights from the Science of Consciousness Support the showBy Marcus Edel

1
August 18th, 2023 - A Dive into the Future of Machine Learning: 3D Visions, Neural Evolutions, and Speedy Optimizations 6:52

11M ago6:52

6:52

Dynamic 3D Gaussians: Tracking by Persistent Dynamic View Synthesis Self Expanding Neural Networks Fast as CHITA: Neural Network Pruning with Combinatorial Optimization Support the showBy Marcus Edel

1
August 17th, 2023 - AI Chronicles: From Carbon Footprints to Cinematic Magic 7:34

11M ago7:34

7:34

Estimating the Carbon Footprint of BLOOM, a 176B Parameter Language Model Teach LLMs to Personalize -- An Approach inspired by Writing Education DragNUWA: Fine-grained Control in Video Generation by Integrating Text, Image, and Trajectory Dual-Stream Diffusion Net for Text-to-Video Generation Support the show…

1
August 16th, 2023 - Navigating the Nexus of Neural Networks: From Bayesian Flows to Math Marvels and the Dawn of RAVEN 8:34

11M ago8:34

8:34

Bayesian Flow Networks Solving Challenging Math Word Problems Using GPT-4 Code Interpreter with Code-based Self-Verification RAVEN: In-Context Learning with Retrieval Augmented Encoder-Decoder Language Models Support the showBy Marcus Edel

1
August 15th, 2023 - Echoes of Innovation: From Crystal-Clear Speech to AI's Coding Mastery 8:04

11M ago8:04

8:04

SpeechX: Neural Codec Language Model as a Versatile Speech Transformer Platypus: Quick, Cheap, and Powerful Refinement of LLMs RestoreFormer++: Towards Real-World Blind Face Restoration from Undegraded Key-Value Pairs OctoPack: Instruction Tuning Code Large Language Models Support the showBy Marcus Edel

1
August 14th, 2023 - Conversations, Compositions, and Clarity: The Triad of AI Evolution 5:31

11M ago5:31

5:31

PIPPA: A Partially Synthetic Conversational Dataset Composable Function-preserving Expansions for Transformer Architectures Self-Alignment with Instruction Backtranslation Support the showBy Marcus Edel

1
August 11th, 2023 - Trailblazing Tech: From Robotic Pursuits to Sonic Symphonies and Alexa's Virtual Playground 5:37

11M ago5:37

5:37

Follow Anything: Open-set detection, tracking, and following in real-time AudioLDM 2: Learning Holistic Audio Generation with Self-supervised Pretraining Alexa, play with robot: Introducing the First Alexa Prize SimBot Challenge on Embodied AI Support the showBy Marcus Edel

1
August 10th, 2023 - Tech Tapestry: Weaving the Future of Collaboration, Driving, Language, and Melody 8:28

11M ago8:28

8:28

MetaGPT: Meta Programming for Multi-Agent Collaborative Framework FocalFormer3D : Focusing on Hard Instance for 3D Object Detection Shepherd: A Critic for Language Model Generation JEN-1: Text-Guided Universal Music Generation with Omnidirectional Diffusion Models Support the showBy Marcus Edel

1
August 9th, 2023 - Soundscapes to Synthesis: Today's Pioneering Journeys in Machine Learning 11:56

11M ago11:56

11:56

Separate Anything You Describe Pre-Trained Large Language Models for Industrial Control ReCLIP: Refine Contrastive Language Image Pre-Training with Source Free Domain Adaptation Simple synthetic data reduces sycophancy in large language models 3D Gaussian Splatting for Real-Time Radiance Field Rendering Support the show…

1
August 8th, 2023 - AI Digest: Soundwaves, StarCraft, and Syntheses Unveiled 11:28

11M ago11:28

11:28

A Practical Deep Learning-Based Acoustic Side Channel Attack on Keyboards AlphaStar Unplugged: Large-Scale Offline Reinforcement Learning From Discrete Tokens to High-Fidelity Audio Using Multi-Band Diffusion FLIQS: One-Shot Mixed-Precision Floating-Point and Integer Quantization Search SynJax: Structured Probability Distributions for JAX Support t…

1
August 7th, 2023 - AI Chronicles: From Vision-Language Fusion to Revolutionizing Clinical Trials" 7:00

11M ago7:00

7:00

MM-Vet: Evaluating Large Multimodal Models for Integrated Capabilities Retroformer: Retrospective Large Language Agents with Policy Gradient Optimization Scaling Clinical Trial Matching Using Large Language Models: A Case Study in Oncology Support the showBy Marcus Edel

1
August 4th, 2023 - Neural Nexus: Traversing the AI Tapestry of Technological Triumphs 16:15

11M ago16:15

16:15

RWKV: Reinventing RNNs for the Transformer Era DeepSpeed-Chat: Easy, Fast and Affordable RLHF Training of ChatGPT-like Models at All Scales OpenFlamingo: An Open-Source Framework for Training Large Autoregressive Vision-Language Models Scaling Relationship on Learning Mathematical Reasoning with Large Language Models The All-Seeing Project: Towards…

1
August 2nd, 2023 - Journey Through Neural Nexus: Decoding Tomorrow's Machine Learning Marvels 7:53

11M ago7:53

7:53

Predicting masked tokens in stochastic locations improves masked image modeling Tool Documentation Enables Zero-Shot Tool-Usage with Large Language Models Skills-in-Context Prompting: Unlocking Compositionality in Large Language Models WizMap: Scalable Interactive Visualization for Exploring Large Machine Learning Embeddings Support the show…

1
August 1st, 2023 - Adventures in Machine Learning: Unraveling the Power of AI from Hydra Effects to Robotic Minds 12:55

11M ago12:55

12:55

ToolLLM: Facilitating Large Language Models to Master 16000+ Real-world APIs Discovering Adaptable Symbolic Algorithms from Scratch RT-2: Vision-Language-Action Models Transfer Web Knowledge to Robotic Control Guiding Image Captioning Models Toward More Specific Captions LLM-Rec: Personalized Recommendation via Prompting Large Language Models The H…

1
July 31th, 2023 - Unveiling the AI Vanguard: From Reinforcement Learning Vulnerabilities to Multimodal Medical Marvels and Text-Driven Image Transformations 7:40

11M ago7:40

7:40

Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback Med-Flamingo: a Multimodal Medical Few-shot Learner PromptStyler: Prompt-driven Style Generation for Source-free Domain Generalization Support the showBy Marcus Edel

1
July 28th, 2023 - AI Odyssey: Trailblazing the Future of Machine Learning 10:34

11M ago10:34

10:34

Scaling TransNormer to 175 Billion Parameters PanGu-Coder2: Boosting Large Language Models for Code with Ranking Feedback Jina Embeddings: A Novel Set of High-Performance Sentence Embedding Models To Adapt or Not to Adapt? Real-Time Adaptation for Semantic Segmentation How to Scale Your EMA Support the show…

1
July 27th, 2023 - From Pixels to Policies: Today's Deep Dive into Machine Learning Wonders 8:34

11M ago8:34

8:34

Tracking Anything in High Quality Towards Generalist Biomedical AI Foundation Models and Fair Use Few-shot Fine-tuning vs. In-context Learning: A Fair Comparison and Evaluation Support the showBy Marcus Edel

1
July 25th, 2023 - Byte-Sized Brilliance: Decoding the Epochs of ML Evolution 12:00

11M ago12:00

12:00

The case for 4-bit precision: k-bit Inference Scaling Laws No Train No Gain: Revisiting Efficient Training Algorithms For Transformer-based Language Models PUMA: Secure Inference of LLaMA-7B in Five Minutes Optimized Network Architectures for Large Language Model Training with Billions of Parameters A Real-World WebAgent with Planning, Long Context…

1
July 24th, 2023 - AI Horizons: From Wordy Worlds to Virtual Visions 11:02

11M ago11:02

11:02

How will Language Modelers like ChatGPT Affect Occupations and Industries? CopyRNeRF: Protecting the CopyRight of Neural Radiance Fields STEVE-1: A Generative Model for Text-to-Behavior in Minecraft StyleGANEX: StyleGAN-Based Manipulation Beyond Cropped Aligned Faces Subject-Diffusion:Open Domain Personalized Text-to-Image Generation without Test-t…

1
July 21th, 2023 - Neural Narratives: Chronicles of the AI Frontier 10:24

11M ago10:24

10:24

Meta-Transformer: A Unified Framework for Multimodal Learning Divide & Bind Your Attention for Improved Generative Semantic Nursing Brain2Music: Reconstructing Music from Human Brain Activity FLASK: Fine-grained Language Model Evaluation based on Alignment Skill Sets Instruction-following Evaluation through Verbalizer Manipulation Support the show…

Podcasts Worth a Listen

Marcus Edel Podcasts

Podcasts Worth a Listen

Quick Reference Guide