AI RESEARCH

BLAST: Benchmarking LLMs with ASP-based Structured Testing

arXiv CS.AI

ArXi:2604.22306v1 Announce Type: cross Large Language Models (LLMs) have nstrated remarkable performance across a broad spectrum of tasks, including natural language understanding, dialogue systems, and code generation. Despite evident progress, less attention has been paid to their effectiveness in handling declarative paradigms such as Answer Set Programming (ASP), to date. In this paper we