AI RESEARCH

Peer-Predictive Self-Training for Language Model Reasoning

arXiv CS.AI

ArXi:2604.13356v1 Announce Type: cross Mechanisms for continued self-improvement of language models without external supervision remain an open challenge. We propose Peer-Predictive Self-