AI RESEARCH

BrowseComp: a benchmark for browsing agents

OpenAI Blog

BrowseComp: a benchmark for browsing agents.