Artwork

Content provided by Researchers across the Microsoft research community. All podcast content including episodes, graphics, and podcast descriptions are uploaded and provided directly by Researchers across the Microsoft research community or their podcast platform partner. If you believe someone is using your copyrighted work without your permission, you can follow the process outlined here https://player.fm/legal.
Player FM - Podcast App
Go offline with the Player FM app!

Abstracts: December 6, 2023

12:25
 
Share
 

Manage episode 387973110 series 2514544
Content provided by Researchers across the Microsoft research community. All podcast content including episodes, graphics, and podcast descriptions are uploaded and provided directly by Researchers across the Microsoft research community or their podcast platform partner. If you believe someone is using your copyrighted work without your permission, you can follow the process outlined here https://player.fm/legal.

Members of the research community at Microsoft work continuously to advance their respective fields. Abstracts brings its audience to the cutting edge with them through short, compelling conversations about new and noteworthy achievements.

In this episode, Xing Xie, a Senior Principal Research Manager of Microsoft Research Asia, joins host Dr. Gretchen Huizinga to discuss “Evaluating General-Purpose AI with Psychometrics.” As AI capabilities move from task specific to more general purpose, the paper explores psychometrics, a subfield of psychology, as an alternative to traditional methods for evaluating model performance and for supporting consistent and reliable systems.

Read the paper: Evaluating General-Purpose AI with Psychometrics

  continue reading

194 episodes

Artwork
iconShare
 
Manage episode 387973110 series 2514544
Content provided by Researchers across the Microsoft research community. All podcast content including episodes, graphics, and podcast descriptions are uploaded and provided directly by Researchers across the Microsoft research community or their podcast platform partner. If you believe someone is using your copyrighted work without your permission, you can follow the process outlined here https://player.fm/legal.

Members of the research community at Microsoft work continuously to advance their respective fields. Abstracts brings its audience to the cutting edge with them through short, compelling conversations about new and noteworthy achievements.

In this episode, Xing Xie, a Senior Principal Research Manager of Microsoft Research Asia, joins host Dr. Gretchen Huizinga to discuss “Evaluating General-Purpose AI with Psychometrics.” As AI capabilities move from task specific to more general purpose, the paper explores psychometrics, a subfield of psychology, as an alternative to traditional methods for evaluating model performance and for supporting consistent and reliable systems.

Read the paper: Evaluating General-Purpose AI with Psychometrics

  continue reading

194 episodes

All episodes

×
 
Loading …

Welcome to Player FM!

Player FM is scanning the web for high-quality podcasts for you to enjoy right now. It's the best podcast app and works on Android, iPhone, and the web. Signup to sync subscriptions across devices.

 

Quick Reference Guide