AI RESEARCH
VoxelCodeBench: Benchmarking 3D World Modeling Through Code Generation
arXiv CS.LG
•
ArXi:2604.02580v1 Announce Type: new Evaluating code generation models for 3D spatial reasoning requires executing generated code in realistic environments and assessing outputs beyond surface-level correctness. We