AI RESEARCH

Value Under Ignorance in Universal Artificial Intelligence

arXiv CS.AI

ArXi:2512.17086v2 Announce Type: replace We generalize the AIXI reinforcement learning agent to admit a wider class of utility functions. Assigning a utility to each possible interaction history forces us to confront the ambiguity that some hypotheses in the agent's belief distribution only predict a finite prefix of the history, which is sometimes interpreted as implying a chance of death equal to a quantity called the semimeasure loss. This death interpretation suggests one way to assign utilities to such history prefixes.