AI SAFETY & ETHICS

Censored LLMs as a Natural Testbed for Secret Knowledge Elicitation

Alignment Forum

AI model news: Censored LLMs as a Natural Testbed for Secret Knowledge Elicitation. From Alignment Forum.