V-DPM: 4D Video Reconstruction with Dynamic Point Maps

📄 Abstract - V-DPM: 4D Video Reconstruction with Dynamic Point Maps

Powerful 3D representations such as DUSt3R invariant point maps, which encode 3D shape and camera parameters, have significantly advanced feed forward 3D reconstruction. While point maps assume static scenes, Dynamic Point Maps (DPMs) extend this concept to dynamic 3D content by additionally representing scene motion. However, existing DPMs are limited to image pairs and, like DUSt3R, require post processing via optimization when more than two views are involved. We argue that DPMs are more useful when applied to videos and introduce V-DPM to demonstrate this. First, we show how to formulate DPMs for video input in a way that maximizes representational power, facilitates neural prediction, and enables reuse of pretrained models. Second, we implement these ideas on top of VGGT, a recent and powerful 3D reconstructor. Although VGGT was trained on static scenes, we show that a modest amount of synthetic data is sufficient to adapt it into an effective V-DPM predictor. Our approach achieves state of the art performance in 3D and 4D reconstruction for dynamic scenes. In particular, unlike recent dynamic extensions of VGGT such as P3, DPMs recover not only dynamic depth but also the full 3D motion of every point in the scene.

V-DPM：利用动态点图进行4D视频重建 / V-DPM: 4D Video Reconstruction with Dynamic Point Maps

1️⃣ 一句话总结

这篇论文提出了一种名为V-DPM的新方法，它通过将静态场景的3D重建技术扩展到视频领域，不仅能从视频中重建出动态场景的3D形状，还能精确追踪场景中每个点的完整3D运动轨迹，实现了更先进的4D（3D+时间）动态场景重建。

← 返回列表

菜单

AI 帮我研读全文

1️⃣ 一句话总结

密码管理

设置密码

修改密码

移除密码

菜单

AI 帮我研读全文

1️⃣ 一句话总结

获取最新论文摘要