@Computerphile
Sleeper Agents in Large Language Models - Computerphile
It's an older paper, but it checks out. Rob Miles discusses the problem of 'Sleeper Agents' - where LLMs could have hidden traits we don't know about until it's too late.
Computerphile is supported by Jane Street. Learn more about them (and exciting career opportunities) at: https://jane-st.co/computerphile
This video was filmed and edited by Sean Riley.
Computerphile is a sister project to Brady Haran's Numberphile. More at https://www.bradyharanblog.com
Post from Computerphile on September 12, 2025
Comments 0
No comments yet. Be the first to comment!