菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-04-27
📄 Abstract - Unconstrained Multi-view Human Pose Estimation with Algebraic Priors

Recovering 3D human pose from multi-view imagery typically relies on precise camera calibration, which is often unavailable in real-world scenarios, thereby severely limiting the applicability of existing methods. To overcome this challenge, we propose an unconstrained framework that synergizes deep neural networks, algebraic priors, and temporal dynamics for uncalibrated multi-view human pose estimation. First, we introduce the Triangulation with Transformer Regressor (TTR), which reformulates classical triangulation into a data-driven token fusion process to bypass the dependency on explicit camera parameters. Second, to explicitly embed the inherent algebraic relations of the multi-view variety into the learning process, we propose the Gröbner basis Corrector (GC). This pioneering loss formulation enforces constraints derived from the multi-view variety to ensure the neural predictions strictly adhere to the laws of projective geometry. Finally, we devise the Temporal Equivariant Rectifier (TER), which exploits the equivariance property of human motion to impose temporal coherence and structural consistency, effectively mitigating scale ambiguity in uncalibrated settings. Extensive evaluations on standard benchmarks demonstrate that our framework establishes a new state-of-the-art for uncalibrated multi-view human pose estimation. Notably, our approach significantly closes the performance gap between calibration-free methods and fully calibrated oracles.

顶级标签: computer vision machine learning
详细标签: multi-view pose estimation algebraic priors uncalibrated cameras temporal coherence gröbner basis 或 搜索:

基于代数先验的无约束多视角人体姿态估计 / Unconstrained Multi-view Human Pose Estimation with Algebraic Priors


1️⃣ 一句话总结

本文提出了一种无需相机标定的多视角3D人体姿态估计方法,通过结合深度学习、代数几何约束和时间信息,在保持高精度的同时大幅缩小了与传统需要精确标定的方法之间的性能差距。

源自 arXiv: 2604.24312