AI RESEARCH
GeoTikzBridge: Advancing Multimodal Code Generation for Geometric Perception and Reasoning
arXiv CS.CV
•
ArXi:2603.22687v1 Announce Type: new Multimodal Large Language Models (MLLMs) have recently nstrated remarkable perceptual and reasoning abilities. However, they struggle to perceive fine-grained geometric structures, cons