AI RESEARCH

Discovering Reinforcement Learning Interfaces with Large Language Models

arXiv CS.LG

ArXi:2605.03408v1 Announce Type: new Reinforcement learning systems rely on environment interfaces that specify observations and reward functions, yet constructing these interfaces for new tasks often requires substantial manual effort. While recent work has automated reward design using large language models (LLMs), these approaches assume fixed observations and do not address the broader challenge of synthesizing complete task interfaces. We study RL task interface discovery from raw simulator state, where both observation mappings and reward functions must be generated.