AI RESEARCH
$x^2$-Fusion: Cross-Modality and Cross-Dimension Flow Estimation in Event Edge Space
arXiv CS.CV
•
ArXi:2603.16671v1 Announce Type: new Estimating dense 2D optical flow and 3D scene flow is essential for dynamic scene understanding. Recent work combines images, LiDAR, and event data to jointly predict 2D and 3D motion, yet most approaches operate in separate heterogeneous feature spaces. Without a shared latent space that all modalities can align to, these systems rely on multiple modality-specific blocks, leaving cross-sensor mismatches unresolved and making fusion unnecessarily complex.