There Is No Leaderboard for Safety

Machine Learning Street Talk
Generative AI AI Research

People are using AI for mental health advice and life decisions, but there's no oversight and no safety ratings. We grade models on speed and smarts. but not on whether they're safe to use. Why isn't that just as important? Featuring Andrew Gordon and Nora Petrova from Prolific, discussing AI evaluation, benchmarks, and why human preference matters. 🎙️ Full episode