EcoGym: Evaluating LLMs for Long-Horizon Plan-and-Execute in Interactive Economies

ArXi:2602.09514v3 Announce Type: replace-cross Long-horizon planning is widely recognized as a core capability of autonomous LLM-based agents; however, current evaluation frameworks suffer from being largely episodic, domain-specific, or insufficiently grounded in persistent economic dynamics. We