AI RESEARCH
Fusion-fission forecasts when AI will shift to undesirable behavior
arXiv CS.AI
•
ArXi:2605.14218v1 Announce Type: new The key problem facing ChatGPT-like AI's use across society is that its behavior can shift, unnoticed, from desirable to undesirable -- encouraging self-harm, extremist acts, financial losses, or costly medical and military mistakes -- and no one can yet predict when. Shifts persist in even the newest AI models despite remarkable progress in AI modeling, post-