Artwork

Content provided by SD Times. All podcast content including episodes, graphics, and podcast descriptions are uploaded and provided directly by SD Times or their podcast platform partner. If you believe someone is using your copyrighted work without your permission, you can follow the process outlined here https://player.fm/legal.
Player FM - Podcast App
Go offline with the Player FM app!

274: AI testing AI? A look at CriticGPT

15:08
 
Share
 

Manage episode 435230955 series 2591275
Content provided by SD Times. All podcast content including episodes, graphics, and podcast descriptions are uploaded and provided directly by SD Times or their podcast platform partner. If you believe someone is using your copyrighted work without your permission, you can follow the process outlined here https://player.fm/legal.

In this episode, we speak with Rob Whiteley, CEO of Coder, about OpenAI's recent announcement of CriticGPT, a new AI model that provides critiques of ChatGPT responses in order to help the humans training GPT models better evaluate outputs during reinforcement learning from human feedback (RLFH). According to OpenAI, CriticGPT isn't perfect, but it does help trainers catch more problems than they do on their own.
Key talking points include:

  • The downsides of having AI testing the quality of other AI models
  • Why it's important to be specific about what types of errors the model is allowed to look for
  • Is this another example of rushing into AI?

  continue reading

273 episodes

Artwork
iconShare
 
Manage episode 435230955 series 2591275
Content provided by SD Times. All podcast content including episodes, graphics, and podcast descriptions are uploaded and provided directly by SD Times or their podcast platform partner. If you believe someone is using your copyrighted work without your permission, you can follow the process outlined here https://player.fm/legal.

In this episode, we speak with Rob Whiteley, CEO of Coder, about OpenAI's recent announcement of CriticGPT, a new AI model that provides critiques of ChatGPT responses in order to help the humans training GPT models better evaluate outputs during reinforcement learning from human feedback (RLFH). According to OpenAI, CriticGPT isn't perfect, but it does help trainers catch more problems than they do on their own.
Key talking points include:

  • The downsides of having AI testing the quality of other AI models
  • Why it's important to be specific about what types of errors the model is allowed to look for
  • Is this another example of rushing into AI?

  continue reading

273 episodes

All episodes

×
 
Loading …

Welcome to Player FM!

Player FM is scanning the web for high-quality podcasts for you to enjoy right now. It's the best podcast app and works on Android, iPhone, and the web. Signup to sync subscriptions across devices.

 

Quick Reference Guide