AI SAFETY & ETHICS

Prompted CoT Early Exit Undermines the Monitoring Benefits of CoT Uncontrollability

Alignment Forum

AI news: Prompted CoT Early Exit Undermines the Monitoring Benefits of CoT Uncontrollability. From Alignment Forum.