AI RESEARCH
HCT-QA: A Benchmark for Question Answering on Human-Centric Tables
arXiv CS.AI
•
ArXi:2504.20047v3 Announce Type: replace-cross Tabular data embedded in PDF files, web pages, and other types of documents is prevalent in various domains. These tables, which we call human-centric tables (HCTs for short), are dense in information but often exhibit complex structural and semantic layouts. To query these HCTs, some existing solutions focus on transforming them into relational formats. However, they fail to handle the diverse and complex layouts of HCTs, making them not amenable to easy querying with SQL-based approaches.