菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-03-16
📄 Abstract - Enhancing Hands in 3D Whole-Body Pose Estimation with Conditional Hands Modulator

Accurately recovering hand poses within the body context remains a major challenge in 3D whole-body pose estimation. This difficulty arises from a fundamental supervision gap: whole-body pose estimators are trained on full-body datasets with limited hand diversity, while hand-only estimators, trained on hand-centric datasets, excel at detailed finger articulation but lack global body awareness. To address this, we propose Hand4Whole++, a modular framework that leverages the strengths of both pre-trained whole-body and hand pose estimators. We introduce CHAM (Conditional Hands Modulator), a lightweight module that modulates the whole-body feature stream using hand-specific features extracted from a pre-trained hand pose estimator. This modulation enables the whole-body model to predict wrist orientations that are both accurate and coherent with the upper-body kinematic structure, without retraining the full-body model. In parallel, we directly incorporate finger articulations and hand shapes predicted by the hand pose estimator, aligning them to the full-body mesh via differentiable rigid alignment. This design allows Hand4Whole++ to combine globally consistent body reasoning with fine-grained hand detail. Extensive experiments demonstrate that Hand4Whole++ substantially improves hand accuracy and enhances overall full-body pose quality.

顶级标签: computer vision model training systems
详细标签: 3d pose estimation whole-body modeling hand pose modular framework feature modulation 或 搜索:

使用条件手部调制器增强3D全身姿态估计中的手部姿态 / Enhancing Hands in 3D Whole-Body Pose Estimation with Conditional Hands Modulator


1️⃣ 一句话总结

这篇论文提出了一个名为Hand4Whole++的新框架,它通过一个轻量级的条件手部调制器模块,巧妙地将擅长全局身体姿态的模型与擅长精细手部姿态的模型结合起来,从而在3D全身姿态估计中同时实现了准确的手腕定位和精细的手指动作捕捉。

源自 arXiv: 2603.14726