NeVStereo:一种用于高保真3D任务的NeRF驱动的新视角合成-立体视觉架构 / NeVStereo: A NeRF-Driven NVS-Stereo Architecture for High-Fidelity 3D Tasks
1️⃣ 一句话总结
这篇论文提出了一个名为NeVStereo的新框架,它巧妙地将神经渲染和立体视觉技术结合起来,只用普通拍摄的彩色照片,就能同时高精度地计算出相机位置、深度图、生成新视角画面并重建出高质量的3D模型。
In modern dense 3D reconstruction, feed-forward systems (e.g., VGGT, pi3) focus on end-to-end matching and geometry prediction but do not explicitly output the novel view synthesis (NVS). Neural rendering-based approaches offer high-fidelity NVS and detailed geometry from posed images, yet they typically assume fixed camera poses and can be sensitive to pose errors. As a result, it remains non-trivial to obtain a single framework that can offer accurate poses, reliable depth, high-quality rendering, and accurate 3D surfaces from casually captured views. We present NeVStereo, a NeRF-driven NVS-stereo architecture that aims to jointly deliver camera poses, multi-view depth, novel view synthesis, and surface reconstruction from multi-view RGB-only inputs. NeVStereo combines NeRF-based NVS for stereo-friendly renderings, confidence-guided multi-view depth estimation, NeRF-coupled bundle adjustment for pose refinement, and an iterative refinement stage that updates both depth and the radiance field to improve geometric consistency. This design mitigated the common NeRF-based issues such as surface stacking, artifacts, and pose-depth coupling. Across indoor, outdoor, tabletop, and aerial benchmarks, our experiments indicate that NeVStereo achieves consistently strong zero-shot performance, with up to 36% lower depth error, 10.4% improved pose accuracy, 4.5% higher NVS fidelity, and state-of-the-art mesh quality (F1 91.93%, Chamfer 4.35 mm) compared to existing prestigious methods.
NeVStereo:一种用于高保真3D任务的NeRF驱动的新视角合成-立体视觉架构 / NeVStereo: A NeRF-Driven NVS-Stereo Architecture for High-Fidelity 3D Tasks
这篇论文提出了一个名为NeVStereo的新框架,它巧妙地将神经渲染和立体视觉技术结合起来,只用普通拍摄的彩色照片,就能同时高精度地计算出相机位置、深度图、生成新视角画面并重建出高质量的3D模型。
源自 arXiv: 2602.05423