AI RESEARCH

Geoparsing: Diagram Parsing for Plane and Solid Geometry with a Unified Formal Language

arXiv CS.CV

ArXi:2604.11600v1 Announce Type: new Multimodal Large Language Models (MLLMs) have achieved remarkable progress but continue to struggle with geometric reasoning, primarily due to the perception bottleneck regarding fine-grained visual elements. While formal languages have aided plane geometry understanding, solid geometry which requires spatial understanding remains largely unexplored. In this paper, we address this challenge by designing a unified formal language that integrates plane and solid geometry, comprehensively covering geometric structures and semantic relations.