AI RESEARCH

MLG-Stereo: ViT Based Stereo Matching with Multi-Stage Local-Global Enhancement

arXiv CS.CV

ArXi:2604.20393v1 Announce Type: new With the development of deep learning, ViT-based stereo matching methods have made significant progress due to their remarkable robustness and zero-shot ability. However, due to the limitations of ViTs in handling resolution sensitivity and their relative neglect of local information, the ability of ViT-based methods to predict details and handle arbitrary-resolution images is still weaker than that of CNN-based methods.