AI RESEARCH

Do Not Imitate, Reinforce: Iterative Classification via Belief Refinement

arXiv CS.LG

ArXi:2604.22110v1 Announce Type: new Standard supervised classification trains models to imitate the exact labels provided by a perfect oracle. This imitation happens in a single pass, restricting the model to a fixed compute budget even when inputs vary in complexity. Moreover, the rigid