Itzik Ben Shabat public
[search 0]
More
Download the App!
show episodes
 
🎙️ Welcome to the Talking Papers Podcast: Where Research Meets Conversation 🌟 Are you ready to explore the fascinating world of cutting-edge research in computer vision, machine learning, artificial intelligence, graphics, and beyond? Join us on this podcast by researchers, for researchers, as we venture into the heart of groundbreaking academic papers. At Talking Papers, we've reimagined the way research is shared. In each episode, we engage in insightful discussions with the main authors o ...
  continue reading
 
A tidal wave of computer vision innovation is quickly having an impact on everyone's lives, but not everyone has the time to sit down and read through a bunch of news articles and learn what it means for them. In Computer Vision Decoded, we sit down with Jared Heinly, the Chief Scientist at EveryPoint, to discuss topics in today’s quickly evolving world of computer vision and decode what they mean for you. If you want to be sure you understand everything happening in the world of computer vi ...
  continue reading
 
Loading …
show series
 
🎙️ **Unveiling 3DInAction with Yizhak Ben-Shabat | Talking Papers Podcast** 🎙️ 📚 *Title:* 3DInAction: Understanding Human Actions in 3D Point Clouds 📅 *Published In:* CVPR 2024 👤 *Guest:* Yizhak (Itzik) Ben-Shabat Welcome back to another exciting episode of the Talking Papers Podcast, where we bring you the latest breakthroughs in academic research…
  continue reading
 
Talking Papers Podcast Episode: "Cameras as Rays: Pose Estimation via Ray Diffusion" with Jason Zhang Welcome to the latest episode of the Talking Papers Podcast! This week's guest is Jason Zhang, a PhD student at the Robotics Institute at Carnegie Mellon University who joined us to discuss his paper, "Cameras as Rays: Pose Estimation via Ray Diffu…
  continue reading
 
Welcome to another exciting episode of the Talking Papers Podcast! In this episode, I had the pleasure of hosting Jiahao Li, a talented PhD student at Toyota Technological Institute at Chicago (TTIC), who discussed his groundbreaking research paper titled "Instant3D: Fast Text-to-3D with Sparse-View Generation and Large Reconstruction Model". This …
  continue reading
 
In this exciting episode of #TalkingPapersPodcast, we have the pleasure of hosting Ana Dodik, a second-year PhD student at MIT. We delve into her research paper titled "Variational Barycentric Coordinates." Published in SIGGRAPH Asia, 2023, this paper significantly contributes to our understanding of the optimization of generalized barycentric coor…
  continue reading
 
Welcome to another exciting episode of the Talking Papers Podcast! In this episode, we delve into the fascinating world of self-supervised learning with our special guest, Ravid Shwartz-Ziv. Together, we explore and dissect their research paper titled "Reverse Engineering Self-Supervised Learning," published in NeurIPS 2023. Self-supervised learnin…
  continue reading
 
Welcome to another exciting episode of the Talking Papers Podcast! In this installment, I had the pleasure of hosting the brilliant Zoë Marschner as we delved into the fascinating world of Constructive Solid Geometry on Neural Signed Distance Fields. This exceptional research paper, published in SIGGRAPH Asia 2023, explores the cutting-edge potenti…
  continue reading
 
🎙️Join us on this exciting episode of the Talking Papers Podcast as we sit down with the talented Sadegh Aliakbarian to explore his groundbreaking ICCV 2023 paper "HMD-NeMo: Online 3D Avatar Motion Generation From Sparse Observations" . Our guest, will take us on a journey through this pivotal research that addresses a crucial aspect of immersive m…
  continue reading
 
Join us on this exciting episode of the Talking Papers Podcast as we sit down with the brilliant Jeong Joon Park to explore his groundbreaking paper, "CC3D: Layout-Conditioned Generation of Compositional 3D Scenes," just published at ICCV 2023. Discover CC3D, a game-changing conditional generative model redefining 3D scene synthesis. Unlike traditi…
  continue reading
 
In this episode of Computer Vision Decoded, we are going to dive into our in-house computer vision expert's reaction to the iPhone 15 and iPhone 15 Pro announcement. We dive into the camera upgrades, decode what a quad sensor means, and even talk about the importance of depth maps. Episode timeline: 00:00 Intro 02:59 iPhone 15 Overview 05:15 iPhone…
  continue reading
 
Welcome to another exciting episode of the Talking Papers Podcast! In this installment, I had the pleasure of hosting Chengfenfg Xu to discuss his paper "NeRF-Det: Learning Geometry-Aware Volumetric Representation for Multi-View 3D Object Detection" which was published at ICCV2023. In recent times, NeRF has gained widespread prominence, and the fie…
  continue reading
 
Welcome to another exciting episode of the Talking Papers Podcast! In this installment, I had the pleasure of hosting Tomas Jakab to discuss his paper "MagicPony: Learning Articulated 3D Animals in the Wild" which was published at CVPR 2023. The motivation behind the MagicPony methodology stems from the challenge posed by the scarcity of labeled da…
  continue reading
 
All links are available in this blog post Welcome to another exciting episode of the Talking Papers Podcast! In this installment, I had the pleasure of hosting Shir Iluz to discuss her groundbreaking paper titled "Word-As-Image for Semantic Typography" which won the SIGGRAPH 2023 Honorable Mention award. This scientific paper introduces an innovati…
  continue reading
 
In this episode of the Talking Papers Podcast, I hosted Yawar Siddiqui to chat about his CVPR 2023 paper "Panoptic Lifting for 3D Scene Understanding with Neural Fields". All links are available in the blog post. In this paper, they proposed a new method for "lifting" 2D panoptic segmentation into a 3D volume represented as neural fields using in-t…
  continue reading
 
In this episode of the Talking Papers Podcast, I hosted Kejie Li to chat about his CVPR 2023 paper "MobileBrick: Building LEGO for 3D Reconstruction on Mobile Devices". All links are available in the blog post. In this paper, they proposed a new dataset and paradigm for evaluating 3D object reconstruction. It is very difficult to create a digital t…
  continue reading
 
All links are available in the blog post. In this episode of the Talking Papers Podcast, I hosted Jiahao Zhang to chat about our CVPR 2023 paper "Aligning Step-by-Step Instructional Diagrams to Video Demonstrations". furniture assembly diagram. To do that, we collected and annotated a brand new dataset: "IKEA Assembly in the Wild" where we aligned …
  continue reading
 
In this episode of Computer Vision Decoded, we are going to dive into Pierre Moulon's 10 years experience building OpenMVG. We also cover the impact of open-source software in the computer vision industry and everything involved in building your own project. There is a lot to learn here! Our episode guest, Pierre Moulon, is a computer vision resear…
  continue reading
 
In this episode of Computer Vision Decoded, we are going to dive into implicit neural representations. We are joined by Itzik Ben-Shabat, a Visiting Research Fellow at the Australian National Universit (ANU) and Technion – Israel Institute of Technology as well as the host of the Talking Paper Podcast. You will learn a core understanding of implici…
  continue reading
 
All links are available in the blog post: https://www.itzikbs.com/inr2vec/ In this episode of the Talking Papers Podcast, I hosted Luca De Luigi. We had a great chat about his paper “Deep Learning on Implicit Neural Representations of Shapes”, AKA INR2Vec, published in ICLR 2023 . In this paper, they take implicit neural representations to the next…
  continue reading
 
In this episode of Computer Vision Decoded, we are going to dive into 4 different ways to 3D reconstruct a scene with images. Our cohost Jared Heinly, a PhD in the computer science specializing in 3D reconstruction from images, will dive into the 4 distinct strategies and discuss the pros and cons of each. Links to content shared in this episode: L…
  continue reading
 
In this episode of the Talking Papers Podcast, I hosted Yael Vinker. We had a great chat about her paper "CLIPasso: SEmantically-Aware Object Sketching”, SIGGRAPH 2022 best paper award winner. In this paper, they convert images into sketches with different levels of abstraction. They avoid the need for sketch datasets by using the well-known CLIP m…
  continue reading
 
Join our guest, Keith Ito, founder of Scaniverse as we discuss the challenges of creating a 3D capture app for iPhones. Keith goes into depth on balancing speed with quality of 3D output and how he designed an intuitive user experience for his users. In this episode, we discuss… 01:00 - Keith's Ito's background at Google 09:44 - What is the Scanive…
  continue reading
 
All links are available in the blog post. In this episode of the Talking Papers Podcast, we hosted Amir Belder. We had a great chat about his paper "Random Walks for Adversarial Meshes”, published in SIGGRAPH 2022. In this paper, they take on the task of creating an adversarial attack for triangle meshes. This is a non-trivial task since meshes are…
  continue reading
 
In this episode of the Talking Papers Podcast, I hosted Silvia Sellán. We had a great chat about her paper "Stochastic Poisson Surface Reconstruction”, published in SIGGRAPH Asia 2022. In this paper, they take on the task of surface reconstruction with a probabilistic twist. They take the well-known Poisson Surface reconstruction algorithm and gene…
  continue reading
 
In this episode of the Talking Papers Podcast, I hosted Sameera Ranasinghe. We had a great chat about his paper "Beyond Periodicity: Towards a Unifying Framework for Activations in Coordinate-MLPs”, published in ECCV 2022 as an oral presentation. In this paper, they propose a new family of activation functions for coordinate MLPs and provide a theo…
  continue reading
 
In this episode of Computer Vision Decoded, we are going to dive into one of the hottest topics in the industry: Neural Radiance Fields (NeRFs) We are joined by Matt Tancik, a student pursuing a PhD in the computer science and electrical engineering department at UC Berkeley. He has also contributed research to the original NeRF project in 2020 alo…
  continue reading
 
In this episode of the Talking Papers Podcast, I hosted Marko Mihajlovic . We had a great chat about his paper "KeypointNeRF: Generalizing Image-based Volumetric Avatars using Relative Spatial Encoding of Keypoints”, published in ECCV 2022. In this paper, they create a generalizable NeRF for virtual avatars. To get a high-fidelity reconstruction of…
  continue reading
 
In this episode of Computer Vision Decoded, we are going to dive into image capture best practices for 3D reconstruction. At the end of this livestream, you will have learned the basics for capturing scenes and objects. We will also provide a downloadable visual guide for reference on your next 3D reconstruction project. Download the official guide…
  continue reading
 
In this episode of Computer Vision Decoded, we join Jared Heinly and Jonathan Stephens from EveryPoint for their live reaction to the iPhone 14 series announcement. They go in depth into what all the camera specs mean to the average person. We also explain basics of computational photography and how Apple is able to get great photos from a small ca…
  continue reading
 
In this episode of Computer Vision Decoded, we sit down with Jared Heinly, Chief Scientist at EveryPoint, to discuss 3D reconstruction in the wild. What does “in the wild” mean? This means 3D reconstructing objects and scenes in non-controlled environments where you may have limitations with lighting, access, reflective surfaces, etc. 00:00 Intro 0…
  continue reading
 
In this episode of the Talking Papers Podcast, I hosted David B. Lindell to chat about his paper "BACON: Band-Limited Coordinate Networks for Multiscale Scene Representation”, published in CVPR 2022. In this paper, they took on training a coordinate network. They do this by introducing a new type of neural network architecture that has an analytica…
  continue reading
 
In this episode of the Talking Papers Podcast, I hosted Hsueh-Ti Derek Liu to chat about his paper "Learning Smooth Neural Functions via Lipschitz Regularization”, published in SIGGRAPH 2022. In this paper, they took on the unique task of enforcing smoothness on Neural Fields (modelled as a neural network). They do this by introducing a regularizat…
  continue reading
 
In this episode of Computer Vision Decoded we dive into Jared Heinly's recent trip to the CVPR Conference. We cover: what the conference about, who should attend, what are the emerging trends in computer vision, how machine learning is being used in 3D reconstruction, and what NeRFs are for. 00:00 - Introduction 00:36 - What is CVPR? 02:49 - Who sh…
  continue reading
 
In this inaugural episode of Computer Vision Decoded we dive into the recent announcements at WWDC 2022 and find out what they mean for the computer vision community. We talk about what Apple is doing with their new RoomPlan API and how computer vision scientists can leverage it for better experiences. We also cover the enhancements to video and ph…
  continue reading
 
In this episode of the Talking Papers Podcast, I hosted Chamin Hewa Koneputugodage to chat about OUR paper "DiGS: Divergence guided shape implicit neural representation for unoriented point clouds”, published in CVPR 2022. In this paper, we took on the task of surface reconstruction using a novel divergence-guided approach. Unlike previous methods,…
  continue reading
 
In this episode of the Talking Papers Podcast, I hosted Dejan Azinović to chat about his paper "Neural RGB-D Surface Reconstruction”, published in CVPR 2022. In this paper, they take on the task of RGBD surface reconstruction by using novel view synthesis. They incorporate depth measurements into the radiance field formulation by learning a neural …
  continue reading
 
In this episode of the Talking Papers Podcast, I hosted Yuliang Xiu to chat about his paper "ICON: Implicit Clothed humans Obtained from Normals”, published in CVPR 2022. SMPL(-X) body model to infer clothed humans (conditioned on the normals). Additionally, they propose an inference-time feedback loop that alternates between refining the body's no…
  continue reading
 
In this episode of the Talking Papers Podcast, I hosted Itai Lang to chat about his paper "SampleNet: Differentiable Point Cloud Sampling”, published in CVPR 2020. In this paper, they propose a point soft-projection to allow differentiating through the sampling operation and enable learning task-specific point sampling. Combined with their regulari…
  continue reading
 
In this episode of the Talking Papers Podcast, I hosted Manuel Dahnert to chat about his paper “Panoptic 3D Scene Reconstruction From a Single RGB Image”, published in NeurIPS 2021. In this paper, they unify the task of reconstruction, semantic segmentation and instance segmentation in 3D from a single RGB image. They propose a holistic approach to…
  continue reading
 
In this episode of the Talking Papers Podcast, I hosted Songyou Peng to chat about his paper “Shape As Points: A Differentiable Poisson Solver”, published in NeurIPS 2021. In this paper, they take on the task of surface reconstruction and propose a hybrid representation that unifies explicit and implicit representation in addition to a differentiab…
  continue reading
 
PAPER TITLE: "VLN BERT: A Recurrent Vision-and-Language BERT for Navigation" AUTHORS: Yicong Hong, Qi Wu, Yuankai Qi, Cristian Rodriguez-Opazo, Stephen Gould ABSTRACT: Accuracy of many visiolinguistic tasks has benefited significantly from the application of vision-and-language (V&L) BERT. However, its application for the task of vision and-languag…
  continue reading
 
PAPER TITLE Neural Parts: Learning Expressive 3D Shape Abstractions with Invertible Neural Networks AUTHORS Despoina Paschalidou , Angelos Katharopoulos, Andreas Geiger, Sanja Fidler ABSTRACT Impressive progress in 3D shape extraction led to representations that can capture object geometries with high fidelity. In parallel, primitive-based methods …
  continue reading
 
PAPER TITLE: Dynamic Neural Radiance Fields for Monocular 4D Facial Avatar Reconstruction AUTHORS: Guy Gafni Justus Thies Michael Zollhöfer Matthias Nießner Project page: https://gafniguy.github.io/4D-Facial-Avatars/ CODE: 💻https://github.com/gafniguy/4D-Facial-Avatars ABSTRACT: We present dynamic neural radiance fields for modeling the appearance …
  continue reading
 
PAPER TITLE: "UC-Net: Uncertainty Inspired RGB-D Saliency Detection via Conditional Variational Autoencoders" AUTHORS: Jing Zhang, Deng-Ping Fan, Yuchao Dai, Saeed Anwar, Fatemeh Sadat Saleh, Tong Zhang, Nick Barnes ABSTRACT: In this paper, we propose the first framework (UCNet) to employ uncertainty for RGB-D saliency detection by learning from th…
  continue reading
 
PAPER TITLE: "Deep Declarative Networks: a new hope" AUTHORS: Stephen Gould, Richard Hartley, Dylan Campbell ABSTRACT: We explore a new class of end-to-end learnable models wherein data processing nodes (or network layers) are defined in terms of desired behaviour rather than an explicit forward function. Specifically, the forward function is impli…
  continue reading
 
Paper title: "DORi: Discovering Object Relationships for Moment Localization of a Natural Language Query in a Video" Authors: Cristian Rodriguez-Opazo, Edison Marrese-Taylor, Basura Fernando, Hongdong Li, Stephen Gould Abstract: This paper studies the task of temporal moment localization in a long untrimmed video using natural language query. Given…
  continue reading
 
Loading …

Quick Reference Guide