AI RESEARCH
Learning with Rare Success but Rich Feedback via Reflection-Enhanced Self-Distillation
arXiv CS.LG
•
ArXi:2605.12741v1 Announce Type: new Enabling Large Language Models (LLMs) to continuously improve from environmental interactions is a central challenge in post-