AI RESEARCH
LangSurf: Language-Embedded Surface Gaussians for 3D Scene Understanding
arXiv CS.CV
•
ArXi:2412.17635v3 Announce Type: replace Applying Gaussian Splatting to perception tasks for 3D scene understanding is becoming increasingly popular. Most existing works primarily focus on rendering 2D feature maps from novel viewpoints, which leads to an imprecise 3D language field with outlier languages, ultimately failing to align objects in 3D space. By utilizing masked images for feature extraction, these approaches also lack essential contextual information, leading to inaccurate feature representation.