AI RESEARCH

SQLyzr: A Comprehensive Benchmark and Evaluation Platform for Text-to-SQL

arXiv CS.AI

ArXi:2604.21214v1 Announce Type: cross Text-to-SQL models have significantly improved with the adoption of Large Language Models (LLMs), leading to their increasing use in real-world applications. Although many benchmarks exist for evaluating the performance of text-to-SQL models, they often rely on a single aggregate score, lack evaluation under realistic settings, and provide limited insight into model behaviour across different query types. In this work, we present SQLyzr, a comprehensive benchmark and evaluation platform for text-to-SQL models.