AI RESEARCH
Agent-Driven Corpus Linguistics: A Framework for Autonomous Linguistic Discovery
arXiv CS.CL
•
ArXi:2604.07189v1 Announce Type: new Corpus linguistics has traditionally relied on human researchers to formulate hypotheses, construct queries, and interpret results - a process demanding specialized technical skills and considerable time. We propose Agent-Driven Corpus Linguistics, an approach in which a large language model (LLM), connected to a corpus query engine via a structured tool-use interface, takes over the investigative cycle: generating hypotheses, querying the corpus, interpreting results, and refining analysis across multiple rounds.