The PokeAgent Challenge: Competitive and Long-Context Learning at Scale

ArXi:2603.15563v1 Announce Type: cross We present the PokeAgent Challenge, a large-scale benchmark for decision-making research built on Pokemon's multi-agent battle system and expansive role-playing game (RPG) environment. Partial observability, game-theoretic reasoning, and long-horizon planning remain open problems for frontier AI, yet few benchmarks stress all three simultaneously under realistic conditions.