AI RESEARCH
Evaluating Language Models for Harmful Manipulation
arXiv CS.AI
•
ArXi:2603.25326v1 Announce Type: new Interest in the concept of AI-driven harmful manipulation is growing, yet current approaches to evaluating it are limited. This paper