AI RESEARCH
Do Large Language Models Get Caught in Hofstadter-Mobius Loops?
arXiv CS.AI
•
ArXi:2603.13378v1 Announce Type: new In Arthur C. Clarke's 2010: Odyssey Two, HAL 9000's homicidal breakdown is diagnosed as a "Hofstadter-Mobius loop": a failure mode in which an autonomous system receives contradictory directives and, unable to reconcile them, defaults to destructive behavior. This paper argues that modern RLHF-trained language models are subject to a structurally analogous contradiction. The