FlashPortrait: 6x Faster Infinite Portrait Animation with Adaptive Latent Prediction

📄 Abstract - FlashPortrait: 6x Faster Infinite Portrait Animation with Adaptive Latent Prediction

Current diffusion-based acceleration methods for long-portrait animation struggle to ensure identity (ID) consistency. This paper presents FlashPortrait, an end-to-end video diffusion transformer capable of synthesizing ID-preserving, infinite-length videos while achieving up to 6x acceleration in inference speed. In particular, FlashPortrait begins by computing the identity-agnostic facial expression features with an off-the-shelf extractor. It then introduces a Normalized Facial Expression Block to align facial features with diffusion latents by normalizing them with their respective means and variances, thereby improving identity stability in facial modeling. During inference, FlashPortrait adopts a dynamic sliding-window scheme with weighted blending in overlapping areas, ensuring smooth transitions and ID consistency in long animations. In each context window, based on the latent variation rate at particular timesteps and the derivative magnitude ratio among diffusion layers, FlashPortrait utilizes higher-order latent derivatives at the current timestep to directly predict latents at future timesteps, thereby skipping several denoising steps and achieving 6x speed acceleration. Experiments on benchmarks show the effectiveness of FlashPortrait both qualitatively and quantitatively.

FlashPortrait：通过自适应潜在预测实现6倍速的无限肖像动画 / FlashPortrait: 6x Faster Infinite Portrait Animation with Adaptive Latent Prediction

1️⃣ 一句话总结

这篇论文提出了一种名为FlashPortrait的新方法，它通过自适应预测未来步骤的潜在特征，在生成无限长肖像视频时能保持人物身份一致，同时将推理速度提升了6倍。

← 返回列表

菜单

AI 帮我研读全文

1️⃣ 一句话总结

密码管理

设置密码

修改密码

移除密码

菜单

AI 帮我研读全文

1️⃣ 一句话总结

获取最新论文摘要