AI RESEARCH

Self-Improvement for Fast, High-Quality Plan Generation

arXiv CS.AI

ArXi:2605.03625v1 Announce Type: new Generative models trained on synthetic plan data are a promising approach to generalized planning. Recent work has focused on finding any valid plan, rather than a high-quality solution. We address the challenge of producing high-quality plans, a computationally hard problem, in sub-exponential time. First, we nstrate that, given optimal data, a decoder-only transformer can generate high-quality plans for unseen problem instances. Second, we show how to self-improve an initial model trained on sub-optimal data.