Artwork

Content provided by Innodata. All podcast content including episodes, graphics, and podcast descriptions are uploaded and provided directly by Innodata or their podcast platform partner. If you believe someone is using your copyrighted work without your permission, you can follow the process outlined here https://player.fm/legal.
Player FM - Podcast App
Go offline with the Player FM app!

Michael Nguyen | The Inside Scoop on Ground Truth Data for ML

34:04
 
Share
 

Manage episode 340991640 series 2974054
Content provided by Innodata. All podcast content including episodes, graphics, and podcast descriptions are uploaded and provided directly by Innodata or their podcast platform partner. If you believe someone is using your copyrighted work without your permission, you can follow the process outlined here https://player.fm/legal.

In this episode, Melody welcomes Michael Nguyen, VP of Global Data Practice and Partnerships at Innodata, Inc. Michael shares insights into all that AI is (and isn’t), the advances that have been made in ground truth data collection, and what it will take to overcome the biases that are still present in machine learning.

GROUND TRUTH DATA IN MACHINE LEARNING

Ground truth data is a collection of data that can’t be manufactured, it has to be captured. Primarily used by companies who develop products for AI/ML, there are several critical components to consider when building a ground truth data set, including identifying what type of data you actually need. Michael points out data that isn’t usually readily available, such as human interaction data, and how his team overcomes the hurdles to secure it.

THE HUMAN SIDE OF AI

Some data are very easy to come by, while some are significantly more difficult to secure. Any data collection that has to do with humans is more sensitive, and Michael highlights the ways that they address concerns and protect privacy for anyone who is sharing data. Object and environment data collection is pretty straightforward, while speech, simulation, and human data are much more difficult aspects of AI to capture. The only way to mitigate bias with facial recognition is to capture as much data as possible.

THE EVOLUTION AND FUTURE OF AI

In the wake of the pandemic, one major focus of AI is turning to healthcare. From robots doing surgery to self-driving cars, AI will continue to improve the quality of life for everyone and take care of things that humans are not best suited to take care of.

Michael’s Insights

“Basically anything you can think of, AI can be a part of it.” [3:50]

“When it comes to human interaction, it’s really hard to create synthetic data because everyone is different.” [11:24]

“I don’t think you can eliminate biases in AI, but what you can do is get as much data as possible.” [24:53]

[29:18] “AI can help us do a lot of things we can’t do ourselves, especially in healthcare.”

Michael’s Bio

Michael Nguyen is the VP of Global Data Practice and Partnerships at Innodata, Inc. He is an action driven and technology focused business development professional with over 20 years of experience delivering state of the art products and services to global enterprises. For the past five years, Michael has been focused on ground truth data for artificial intelligence and machine learning.

Links

Michael Nugyen LinkedIn

Innodata

  continue reading

17 episodes

Artwork
iconShare
 
Manage episode 340991640 series 2974054
Content provided by Innodata. All podcast content including episodes, graphics, and podcast descriptions are uploaded and provided directly by Innodata or their podcast platform partner. If you believe someone is using your copyrighted work without your permission, you can follow the process outlined here https://player.fm/legal.

In this episode, Melody welcomes Michael Nguyen, VP of Global Data Practice and Partnerships at Innodata, Inc. Michael shares insights into all that AI is (and isn’t), the advances that have been made in ground truth data collection, and what it will take to overcome the biases that are still present in machine learning.

GROUND TRUTH DATA IN MACHINE LEARNING

Ground truth data is a collection of data that can’t be manufactured, it has to be captured. Primarily used by companies who develop products for AI/ML, there are several critical components to consider when building a ground truth data set, including identifying what type of data you actually need. Michael points out data that isn’t usually readily available, such as human interaction data, and how his team overcomes the hurdles to secure it.

THE HUMAN SIDE OF AI

Some data are very easy to come by, while some are significantly more difficult to secure. Any data collection that has to do with humans is more sensitive, and Michael highlights the ways that they address concerns and protect privacy for anyone who is sharing data. Object and environment data collection is pretty straightforward, while speech, simulation, and human data are much more difficult aspects of AI to capture. The only way to mitigate bias with facial recognition is to capture as much data as possible.

THE EVOLUTION AND FUTURE OF AI

In the wake of the pandemic, one major focus of AI is turning to healthcare. From robots doing surgery to self-driving cars, AI will continue to improve the quality of life for everyone and take care of things that humans are not best suited to take care of.

Michael’s Insights

“Basically anything you can think of, AI can be a part of it.” [3:50]

“When it comes to human interaction, it’s really hard to create synthetic data because everyone is different.” [11:24]

“I don’t think you can eliminate biases in AI, but what you can do is get as much data as possible.” [24:53]

[29:18] “AI can help us do a lot of things we can’t do ourselves, especially in healthcare.”

Michael’s Bio

Michael Nguyen is the VP of Global Data Practice and Partnerships at Innodata, Inc. He is an action driven and technology focused business development professional with over 20 years of experience delivering state of the art products and services to global enterprises. For the past five years, Michael has been focused on ground truth data for artificial intelligence and machine learning.

Links

Michael Nugyen LinkedIn

Innodata

  continue reading

17 episodes

All episodes

×
 
Loading …

Welcome to Player FM!

Player FM is scanning the web for high-quality podcasts for you to enjoy right now. It's the best podcast app and works on Android, iPhone, and the web. Signup to sync subscriptions across devices.

 

Quick Reference Guide