StereoPilot: Learning Unified and Efficient Stereo Conversion via Generative Priors

📄 Abstract - StereoPilot: Learning Unified and Efficient Stereo Conversion via Generative Priors

The rapid growth of stereoscopic displays, including VR headsets and 3D cinemas, has led to increasing demand for high-quality stereo video content. However, producing 3D videos remains costly and complex, while automatic Monocular-to-Stereo conversion is hindered by the limitations of the multi-stage ``Depth-Warp-Inpaint'' (DWI) pipeline. This paradigm suffers from error propagation, depth ambiguity, and format inconsistency between parallel and converged stereo configurations. To address these challenges, we introduce UniStereo, the first large-scale unified dataset for stereo video conversion, covering both stereo formats to enable fair benchmarking and robust model training. Building upon this dataset, we propose StereoPilot, an efficient feed-forward model that directly synthesizes the target view without relying on explicit depth maps or iterative diffusion sampling. Equipped with a learnable domain switcher and a cycle consistency loss, StereoPilot adapts seamlessly to different stereo formats and achieves improved consistency. Extensive experiments demonstrate that StereoPilot significantly outperforms state-of-the-art methods in both visual fidelity and computational efficiency. Project page: this https URL.

StereoPilot：通过生成先验学习统一且高效的立体视频转换 / StereoPilot: Learning Unified and Efficient Stereo Conversion via Generative Priors

1️⃣ 一句话总结

这篇论文提出了一个名为StereoPilot的高效模型和一个大规模统一数据集UniStereo，能够直接、高质量地将普通2D视频转换为适用于VR和3D影院的不同格式的立体视频，解决了传统方法流程复杂、效果差的问题。

← 返回列表

菜单

AI 帮我研读全文

1️⃣ 一句话总结

密码管理

设置密码

修改密码

移除密码

菜单

AI 帮我研读全文

1️⃣ 一句话总结

获取最新论文摘要