AI RESEARCH

Fusion-fission forecasts when AI will shift to undesirable behavior

arXiv CS.AI

ArXi:2605.14218v1 Announce Type: new The key problem facing ChatGPT-like AI's use across society is that its behavior can shift, unnoticed, from desirable to undesirable -- encouraging self-harm, extremist acts, financial losses, or costly medical and military mistakes -- and no one can yet predict when. Shifts persist in even the newest AI models despite remarkable progress in AI modeling, post-