📄 论文总结
RoMa v2:更强大、更优秀、更快速、更密集的特征匹配 / RoMa v2: Harder Better Faster Denser Feature Matching
1️⃣ 一句话总结
这篇论文提出了一种新的密集图像特征匹配模型,通过改进网络结构、训练策略和优化技术,在保持高精度的同时大幅提升了匹配速度和效率,适用于各种复杂场景。
Dense feature matching aims to estimate all correspondences between two images of a 3D scene and has recently been established as the gold-standard due to its high accuracy and robustness. However, existing dense matchers still fail or perform poorly for many hard real-world scenarios, and high-precision models are often slow, limiting their applicability. In this paper, we attack these weaknesses on a wide front through a series of systematic improvements that together yield a significantly better model. In particular, we construct a novel matching architecture and loss, which, combined with a curated diverse training distribution, enables our model to solve many complex matching tasks. We further make training faster through a decoupled two-stage matching-then-refinement pipeline, and at the same time, significantly reduce refinement memory usage through a custom CUDA kernel. Finally, we leverage the recent DINOv3 foundation model along with multiple other insights to make the model more robust and unbiased. In our extensive set of experiments we show that the resulting novel matcher sets a new state-of-the-art, being significantly more accurate than its predecessors. Code is available at this https URL
RoMa v2:更强大、更优秀、更快速、更密集的特征匹配 / RoMa v2: Harder Better Faster Denser Feature Matching
这篇论文提出了一种新的密集图像特征匹配模型,通过改进网络结构、训练策略和优化技术,在保持高精度的同时大幅提升了匹配速度和效率,适用于各种复杂场景。