DICArt: Advancing Category-level Articulated Object Pose Estimation in Discrete State-Spaces

📄 Abstract - DICArt: Advancing Category-level Articulated Object Pose Estimation in Discrete State-Spaces

Articulated object pose estimation is a core task in embodied AI. Existing methods typically regress poses in a continuous space, but often struggle with 1) navigating a large, complex search space and 2) failing to incorporate intrinsic kinematic constraints. In this work, we introduce DICArt (DIsCrete Diffusion for Articulation Pose Estimation), a novel framework that formulates pose estimation as a conditional discrete diffusion process. Instead of operating in a continuous domain, DICArt progressively denoises a noisy pose representation through a learned reverse diffusion procedure to recover the GT pose. To improve modeling fidelity, we propose a flexible flow decider that dynamically determines whether each token should be denoised or reset, effectively balancing the real and noise distributions during diffusion. Additionally, we incorporate a hierarchical kinematic coupling strategy, estimating the pose of each rigid part hierarchically to respect the object's kinematic structure. We validate DICArt on both synthetic and real-world datasets. Experimental results demonstrate its superior performance and robustness. By integrating discrete generative modeling with structural priors, DICArt offers a new paradigm for reliable category-level 6D pose estimation in complex environments.

DICArt：在离散状态空间中推进类别级铰接物体姿态估计 / DICArt: Advancing Category-level Articulated Object Pose Estimation in Discrete State-Spaces

1️⃣ 一句话总结

这篇论文提出了一种名为DICArt的新方法，它将复杂的铰接物体（如关节可动的机器人手臂或家具）的姿态估计问题，转化为一个在离散空间中进行‘去噪’的生成过程，并通过动态决策和分层结构约束，显著提升了估计的准确性和鲁棒性。

← 返回列表

菜单

AI 帮我研读全文

1️⃣ 一句话总结

密码管理

设置密码

修改密码

移除密码

菜单

AI 帮我研读全文

1️⃣ 一句话总结

获取最新论文摘要