菜单

🤖 系统
📄 Abstract - Generative Video Motion Editing with 3D Point Tracks

Camera and object motions are central to a video's narrative. However, precisely editing these captured motions remains a significant challenge, especially under complex object movements. Current motion-controlled image-to-video (I2V) approaches often lack full-scene context for consistent video editing, while video-to-video (V2V) methods provide viewpoint changes or basic object translation, but offer limited control over fine-grained object motion. We present a track-conditioned V2V framework that enables joint editing of camera and object motion. We achieve this by conditioning a video generation model on a source video and paired 3D point tracks representing source and target motions. These 3D tracks establish sparse correspondences that transfer rich context from the source video to new motions while preserving spatiotemporal coherence. Crucially, compared to 2D tracks, 3D tracks provide explicit depth cues, allowing the model to resolve depth order and handle occlusions for precise motion editing. Trained in two stages on synthetic and real data, our model supports diverse motion edits, including joint camera/object manipulation, motion transfer, and non-rigid deformation, unlocking new creative potential in video editing.

顶级标签: computer vision video aigc
详细标签: video editing motion control 3d point tracks video-to-video generative models 或 搜索:

基于3D点轨迹的生成式视频运动编辑 / Generative Video Motion Editing with 3D Point Tracks


1️⃣ 一句话总结

这篇论文提出了一种新的视频编辑方法,通过利用3D点轨迹来精确控制视频中相机和物体的复杂运动,解决了现有方法难以保持场景一致性和处理精细动作的难题。


📄 打开原文 PDF