AI RESEARCH

LogicSkills: A Structured Benchmark for Formal Reasoning in Large Language Models

arXiv CS.AI

ArXi:2602.06533v2 Announce Type: replace Large language models perform well on many logical reasoning benchmarks, but it remains unclear which core logical skills they truly master. To address this, we