菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-01-16
📄 Abstract - CoDance: An Unbind-Rebind Paradigm for Robust Multi-Subject Animation

Character image animation is gaining significant importance across various domains, driven by the demand for robust and flexible multi-subject rendering. While existing methods excel in single-person animation, they struggle to handle arbitrary subject counts, diverse character types, and spatial misalignment between the reference image and the driving poses. We attribute these limitations to an overly rigid spatial binding that forces strict pixel-wise alignment between the pose and reference, and an inability to consistently rebind motion to intended subjects. To address these challenges, we propose CoDance, a novel Unbind-Rebind framework that enables the animation of arbitrary subject counts, types, and spatial configurations conditioned on a single, potentially misaligned pose sequence. Specifically, the Unbind module employs a novel pose shift encoder to break the rigid spatial binding between the pose and the reference by introducing stochastic perturbations to both poses and their latent features, thereby compelling the model to learn a location-agnostic motion representation. To ensure precise control and subject association, we then devise a Rebind module, leveraging semantic guidance from text prompts and spatial guidance from subject masks to direct the learned motion to intended characters. Furthermore, to facilitate comprehensive evaluation, we introduce a new multi-subject CoDanceBench. Extensive experiments on CoDanceBench and existing datasets show that CoDance achieves SOTA performance, exhibiting remarkable generalization across diverse subjects and spatial layouts. The code and weights will be open-sourced.

顶级标签: computer vision multi-modal aigc
详细标签: character animation image-to-video motion transfer multi-subject rendering pose guidance 或 搜索:

CoDance:一种用于鲁棒多主体动画的解绑-重绑范式 / CoDance: An Unbind-Rebind Paradigm for Robust Multi-Subject Animation


1️⃣ 一句话总结

这篇论文提出了一种名为CoDance的新方法,通过‘解绑’运动与位置的强关联并‘重绑’运动到指定目标,解决了现有技术难以处理图像中任意数量、类型或位置错位的多个角色动画的问题,实现了更灵活、鲁棒的多主体动画生成。

源自 arXiv: 2601.11096