JointHOI:联合生成接触图增强手物交互生成 / JointHOI: Jointly Generating Contact Maps Enhances Hand Object Interaction Generation
1️⃣ 一句话总结
本文提出一种名为JointHOI的单阶段扩散模型,能够根据文本描述同时生成手与物体的三维运动以及动态的接触地图,通过将接触作为辅助信息联合学习,有效解决了以往方法中常见的手物漂浮和穿透等物理不合理问题,显著提升了交互动作的真实感和稳定性。
Text driven hand object interaction (HOI) generation is gaining attention for immersive applications and robotics, yet producing physically plausible interactions remains challenging. Even when individual motions appear natural, small contact errors can cause conspicuous artifacts such as floating and interpenetration. Prior methods mitigate these issues using explicit contact cues or implicit grasp priors, but typically rely on multi stage pipelines and fail to model temporally evolving contact. We present JointHOI, a single stage diffusion framework that jointly generates 3D hand object motion and dynamic, distance based contact maps from text. By treating contact as an auxiliary inner modality, joint generation enables the model to learn contact motion coupling during training. At inference, contact guided sampling enforces consistency between generated contact maps and motion implied geometry, improving temporal stability and reducing penetration and floating. Experiments on GRAB and ARCTIC demonstrate consistent improvements in text adherence and physical plausibility over prior methods.
JointHOI:联合生成接触图增强手物交互生成 / JointHOI: Jointly Generating Contact Maps Enhances Hand Object Interaction Generation
本文提出一种名为JointHOI的单阶段扩散模型,能够根据文本描述同时生成手与物体的三维运动以及动态的接触地图,通过将接触作为辅助信息联合学习,有效解决了以往方法中常见的手物漂浮和穿透等物理不合理问题,显著提升了交互动作的真实感和稳定性。
源自 arXiv: 2607.01768