菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-06-08
📄 Abstract - See More, Match Better: Multi-Source Feature Fusion for Two-View Correspondence Learning

Two-view correspondence learning aims to distinguish true correspondences (inliers) from false ones (outliers) in image pairs by leveraging their underlying differences. Existing methods mainly rely on coordinate-based geometric consistency. However, they often struggle with pseudo-consistent outliers in scenes containing repetitive structures, textureless regions, or locally similar geometric patterns. To address this limitation, we propose TriMatch, a multi-source feature fusion framework for two-view correspondence learning, which consists of two parts: feature extraction and feature refinement. In feature extraction, TriMatch jointly extracts geometric, texture semantic, and structural semantic features to provide complementary evidence for correspondence discrimination. To bridge the gap between semantic and geometric features, texture and structural semantic features are aligned with geometric features through dedicated Texture-Geometric Alignment and Structural-Geometric Alignment modules, respectively. We further introduce a Semantic-Guided Correspondence Modulation module, which modulates geometric features using semantic information to suppress geometrically plausible but semantically inconsistent correspondences. In feature refinement, a Hierarchical Semantic-Enhanced Correspondence Refinement strategy progressively models correspondence dependencies and recalibrates multi-context feature responses, enabling more reliable inlier-outlier discrimination. Extensive experiments demonstrate the effectiveness, robustness, and generalization capability of TriMatch.

顶级标签: computer vision machine learning
详细标签: correspondence learning feature fusion geometric consistency semantic alignment inlier detection 或 搜索:

看得更多,匹配更准:面向双视图对应学习的多源特征融合方法 / See More, Match Better: Multi-Source Feature Fusion for Two-View Correspondence Learning


1️⃣ 一句话总结

本文提出了一种名为TriMatch的多源特征融合框架,通过同时利用几何、纹理和结构三种特征信息,有效解决了重复纹理或无纹理区域中误匹配点难以识别的问题,显著提升了图像间正确匹配点的判别能力。

源自 arXiv: 2606.09262