AI RESEARCH

LiveWeb-IE: A Benchmark For Online Web Information Extraction

arXiv CS.CL

ArXi:2603.13773v1 Announce Type: new Web information extraction (WIE) is the task of automatically extracting data from web pages, offering high utility for various applications. The evaluation of WIE systems has traditionally relied on benchmarks built from HTML snapshots captured at a single point in time. However, this offline evaluation paradigm fails to account for the temporally evolving nature of the web; consequently, performance on these static benchmarks often fails to generalize to dynamic real-world scenarios. To bridge this gap, we