用于事件-帧非对称立体视觉的双向跨模态提示 / Bidirectional Cross-Modal Prompting for Event-Frame Asymmetric Stereo
1️⃣ 一句话总结
这篇论文提出了一种名为Bi-CMPStereo的新方法,通过双向跨模态提示,有效融合事件相机和传统帧相机的互补优势,显著提升了在快速运动和复杂光照条件下进行三维立体视觉匹配的准确性和泛化能力。
Conventional frame-based cameras capture rich contextual information but suffer from limited temporal resolution and motion blur in dynamic scenes. Event cameras offer an alternative visual representation with higher dynamic range free from such limitations. The complementary characteristics of the two modalities make event-frame asymmetric stereo promising for reliable 3D perception under fast motion and challenging illumination. However, the modality gap often leads to marginalization of domain-specific cues essential for cross-modal stereo matching. In this paper, we introduce Bi-CMPStereo, a novel bidirectional cross-modal prompting framework that fully exploits semantic and structural features from both domains for robust matching. Our approach learns finely aligned stereo representations within a target canonical space and integrates complementary representations by projecting each modality into both event and frame domains. Extensive experiments demonstrate that our approach significantly outperforms state-of-the-art methods in accuracy and generalization.
用于事件-帧非对称立体视觉的双向跨模态提示 / Bidirectional Cross-Modal Prompting for Event-Frame Asymmetric Stereo
这篇论文提出了一种名为Bi-CMPStereo的新方法,通过双向跨模态提示,有效融合事件相机和传统帧相机的互补优势,显著提升了在快速运动和复杂光照条件下进行三维立体视觉匹配的准确性和泛化能力。
源自 arXiv: 2604.15312