AI RESEARCH

PrepBench: How Far Are We from Natural-Language-Driven Data Preparation?

arXiv CS.AI

ArXi:2605.08687v1 Announce Type: cross Data preparation is a central and time-consuming stage in data analysis workflows. Traditionally, commercial tools have relied on graphical user interfaces (GUIs) to simplify data preparation, allowing users to define transformations through visual operators and workflows. Recent advances in large language models (LLMs) raise the possibility of a paradigm shift toward natural language (NL)-driven data preparation, in which users can specify preparation intents in NL directly.