Prompt Refusal

Data Skeptic

Content provided by Kyle Polich. All podcast content including episodes, graphics, and podcast descriptions are uploaded and provided directly by Kyle Polich or their podcast platform partner. If you believe someone is using your copyrighted work without your permission, you can follow the process outlined here https://player.fm/legal.

2+ y ago 44:18

MP3•Episode home

The creators of large language models impose restrictions on some of the types of requests one might make of them. LLMs commonly refuse to give advice on committing crimes, producting adult content, or respond with any details about a variety of sensitive subjects. As with any content filtering system, you have false positives and false negatives.

Today's interview with Max Reuter and William Schulze discusses their paper "I'm Afraid I Can't Do That: Predicting Prompt Refusal in Black-Box Generative Language Models". In this work, they explore what types of prompts get refused and build a machine learning classifier adept at predicting if a particular prompt will be refused or not.

599 episodes

#Data Science #Datamining #Datascience #Machinelearning #Statistics #Kyle Polich #Science #Math #Tech #Skeptic

Prompt Refusal

Data Skeptic

3,171 subscribers

published 2+ y ago

MP3•Episode home

599 episodes

#Data Science #Datamining #Datascience #Machinelearning #Statistics #Kyle Polich #Science #Math #Tech #Skeptic

Semua episode

Welcome to Player FM!

Player FM is scanning the web for high-quality podcasts for you to enjoy right now. It's the best podcast app and works on Android, iPhone, and the web. Signup to sync subscriptions across devices.

Listen to 500+ topics

Similar to Data Skeptic

Podcasts Worth a Listen

Data Skeptic « » Prompt Refusal

Prompt Refusal

Podcasts Worth a Listen

Welcome to Player FM!

Similar to Data Skeptic

Quick Reference Guide

Data Skeptic « »
Prompt Refusal