AI RESEARCH
LogicSkills: A Structured Benchmark for Formal Reasoning in Large Language Models
arXiv CS.AI
•
ArXi:2602.06533v2 Announce Type: replace Large language models perform well on many logical reasoning benchmarks, but it remains unclear which core logical skills they truly master. To address this, we