AI RESEARCH
Tokenization Allows Multimodal Large Language Models to Understand, Generate and Edit Architectural Floor Plans
arXiv CS.AI
•
ArXi:2603.11640v1 Announce Type: cross Architectural floor plan design demands joint reasoning over geometry, semantics, and spatial hierarchy, which remains a major challenge for current AI systems. Although recent diffusion and language models improve visual fidelity, they still struggle with coherent spatial reasoning and controllable generation. We present HouseMind, a multimodal large language model that unifies floor plan understanding, generation, and editing in one framework. We