LiveFMBench: Unveiling the Power and Limits of Agentic Workflows in Specification Generation

ArXi:2605.01394v1 Announce Type: cross Formal specification is essential for rigorous program verification, yet writing correct specifications remains costly and difficult to automate. Although large language models (LLMs) and agents have shown promising progress, their true capabilities and failure modes remain unclear. We present the first systematic and contamination-aware study of LLM- and agent-based formal specification generation for C programs. We