← 返回列表

arXiv 提交日期: 2025-12-17

📄 Abstract - Spatia: Video Generation with Updatable Spatial Memory

Existing video generation models struggle to maintain long-term spatial and temporal consistency due to the dense, high-dimensional nature of video signals. To overcome this limitation, we propose Spatia, a spatial memory-aware video generation framework that explicitly preserves a 3D scene point cloud as persistent spatial memory. Spatia iteratively generates video clips conditioned on this spatial memory and continuously updates it through visual SLAM. This dynamic-static disentanglement design enhances spatial consistency throughout the generation process while preserving the model's ability to produce realistic dynamic entities. Furthermore, Spatia enables applications such as explicit camera control and 3D-aware interactive editing, providing a geometrically grounded framework for scalable, memory-driven video generation.

顶级标签: video generation multi-modal computer vision

Spatia：基于可更新空间记忆的视频生成 / Spatia: Video Generation with Updatable Spatial Memory

1️⃣ 一句话总结

这篇论文提出了一种名为Spatia的新视频生成框架，它通过引入一个可动态更新的3D点云作为空间记忆，有效解决了现有模型在生成长视频时难以保持场景空间一致性的问题，并支持精确的相机控制和3D交互编辑。

👋 没兴趣 ☆ 感兴趣 📌 待读

打开原文 PDF

源自 arXiv: 2512.15716

← 返回列表

菜单

AI 帮我研读全文

1️⃣ 一句话总结

密码管理

设置密码

修改密码

移除密码

菜单

AI 帮我研读全文

1️⃣ 一句话总结

获取最新论文摘要