Herculean: An Agentic Benchmark for Financial Intelligence

ArXi:2605.14355v1 Announce Type: cross As AI agents improve, the central question is no longer whether they can solve isolated well-defined financial tasks, but whether they can reliably carry out financial professional work. Existing financial benchmarks offer only a partial view of this ability, as they primarily evaluate static competencies such as question answering, retrieval, summarization, and classification. We