Flowception: Temporally Expansive Flow Matching for Video Generation

📄 Abstract - Flowception: Temporally Expansive Flow Matching for Video Generation

We present Flowception, a novel non-autoregressive and variable-length video generation framework. Flowception learns a probability path that interleaves discrete frame insertions with continuous frame denoising. Compared to autoregressive methods, Flowception alleviates error accumulation/drift as the frame insertion mechanism during sampling serves as an efficient compression mechanism to handle long-term context. Compared to full-sequence flows, our method reduces FLOPs for training three-fold, while also being more amenable to local attention variants, and allowing to learn the length of videos jointly with their content. Quantitative experimental results show improved FVD and VBench metrics over autoregressive and full-sequence baselines, which is further validated with qualitative results. Finally, by learning to insert and denoise frames in a sequence, Flowception seamlessly integrates different tasks such as image-to-video generation and video interpolation.

Flowception：用于视频生成的时间扩展流匹配方法 / Flowception: Temporally Expansive Flow Matching for Video Generation

1️⃣ 一句话总结

这篇论文提出了一种名为Flowception的新型视频生成方法，它通过交替插入新帧和优化已有帧来高效生成高质量、长度可变的视频，相比传统方法减少了计算开销和误差累积，并能同时处理图像生成视频和视频插帧等任务。

← 返回列表

菜单

AI 帮我研读全文

1️⃣ 一句话总结

密码管理

设置密码

修改密码

移除密码

菜单

AI 帮我研读全文

1️⃣ 一句话总结

获取最新论文摘要