菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-05-03
📄 Abstract - DP-SfM: Dual-Pixel Structure-from-Motion without Scale Ambiguity

Multi-view 3D reconstruction, namely, structure-from-motion followed by multi-view stereo, is a fundamental component of 3D computer vision. In general, multi-view 3D reconstruction suffers from an unknown scale ambiguity unless a reference object of known size is present in the scene. In this article, we show that multi-view images captured using a dual-pixel (DP) sensor can automatically resolve the scale ambiguity, without requiring a reference object or prior calibration. Specifically, the defocus blur observed in DP images provides sufficient information to determine the absolute scale when paired with depth maps (up to scale) recovered from multi-view 3D reconstruction. Based on this observation, we develop a simple yet effective linear method to estimate the absolute scale, followed by the intensity-based optimization stage that aligns the left and right DP images by shifting them back toward each other using cross-view blur kernels. Experiments demonstrate the effectiveness of the proposed approach across diverse scenes captured with different cameras and lenses. Code and data are available at this https URL

顶级标签: computer vision multi-modal systems
详细标签: structure-from-motion 3d reconstruction scale ambiguity dual-pixel sensor defocus blur 或 搜索:

DP-SfM:无需尺度模糊的双像素结构光运动重建 / DP-SfM: Dual-Pixel Structure-from-Motion without Scale Ambiguity


1️⃣ 一句话总结

这篇论文提出了一种新方法,利用双像素相机拍摄的多视角图像中的散焦模糊信息,在无需参照物或预先标定的情况下,自动消除三维重建中的未知尺度问题,从而得到物体的真实尺寸。

源自 arXiv: 2605.01852