PubTables-v2: A new large-scale dataset for full-page and multi-page table extraction

ArXi:2512.10888v2 Announce Type: replace Table extraction (TE) is a key challenge in visual document understanding. Traditional approaches detect tables first, then recognize their structure. Recently, interest has surged in developing methods, such as vision-language models (VLMs), that can extract tables directly in their full page or document context. However, progress has been difficult to nstrate due to a lack of annotated data. To address this, we create a new large-scale dataset, PubTables-v2. PubTables-v2 s a number of challenging table extraction tasks.