AI RESEARCH

Learning with Rare Success but Rich Feedback via Reflection-Enhanced Self-Distillation

arXiv CS.LG

ArXi:2605.12741v1 Announce Type: new Enabling Large Language Models (LLMs) to continuously improve from environmental interactions is a central challenge in post-