AI RESEARCH

Benchmarking Large Language Models on Reference Extraction and Parsing in the Social Sciences and Humanities

arXiv CS.AI

ArXi:2603.13651v1 Announce Type: cross Bibliographic reference extraction and parsing are foundational for citation indexing, linking, and downstream scholarly knowledge-graph construction. However, most established evaluations focus on clean, English, end-of-document bibliographies, and. therefore. underrepresent the Social Sciences and Humanities (SSH), where citations are frequently multilingual, embedded in footnotes, abbreviated, and shaped by heterogeneous historical conventions.