菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-05-20
📄 Abstract - Latent Dynamics for Full Body Avatar Animation

Pose-driven full-body avatars built on neural rendering produce high-quality novel views of a captured subject. Yet loose clothing and other dynamic elements deform in ways pose alone cannot explain: the same pose can correspond to many different states, because their motion depends on history, inertia, and contact. Explicit simulation and layered-garment methods can model such dynamics, but they require either a dedicated garment template, which raw multi-view capture does not naturally provide, or a test-time physics simulator with non-trivial runtime cost. A parallel line of work learns data-driven clothing avatars that avoid explicit garment layers. These methods add an auxiliary latent for variation beyond pose; at inference, they fix it, regress it from pose, or retrieve it from training data, without explicitly modeling how the latent evolves with its own dynamics. Additionally, even in everyday motion with loose clothing, existing architectures often struggle to capture fine-grained detail, producing blurry renderings and temporal artifacts. We augment a pose-conditioned 3D Gaussian avatar with a transformer-based decoder and a dynamics residual latent that captures temporal appearance and geometry variation beyond the driving signals. At inference, a learned latent dynamics model evolves the residual latent from a short pose history and the previous latent state. The model decomposes each update into driving, restoring, and dissipative forces, producing temporally coherent, history-dependent rollouts with negligible added cost. Different initial conditions yield diverse yet plausible motion trajectories, and the force decomposition exposes controls such as stiffness. Across nine captured sequences of everyday motion with diverse loose garments, quantitative metrics and a perceptual user study show improved animation quality over recent data-driven baselines.

顶级标签: computer vision machine learning video
详细标签: avatar animation neural rendering latent dynamics temporal coherence garment modeling 或 搜索:

基于潜在动力学的全身角色动画生成 / Latent Dynamics for Full Body Avatar Animation


1️⃣ 一句话总结

该论文提出了一种结合学习型潜在动力学模型和3D高斯渲染的全身角色动画方法,能够在不依赖物理模拟或预先制作服饰模板的情况下,利用历史动作信息预测并生成衣物等动态元素的自然、连贯变形,从而显著提升动画的真实感和细节质量。

源自 arXiv: 2605.21478