菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-03-16
📄 Abstract - Fast SAM 3D Body: Accelerating SAM 3D Body for Real-Time Full-Body Human Mesh Recovery

SAM 3D Body (3DB) achieves state-of-the-art accuracy in monocular 3D human mesh recovery, yet its inference latency of several seconds per image precludes real-time application. We present Fast SAM 3D Body, a training-free acceleration framework that reformulates the 3DB inference pathway to achieve interactive rates. By decoupling serial spatial dependencies and applying architecture-aware pruning, we enable parallelized multi-crop feature extraction and streamlined transformer decoding. Moreover, to extract the joint-level kinematics (SMPL) compatible with existing humanoid control and policy learning frameworks, we replace the iterative mesh fitting with a direct feedforward mapping, accelerating this specific conversion by over 10,000x. Overall, our framework delivers up to a 10.9x end-to-end speedup while maintaining on-par reconstruction fidelity, even surpassing 3DB on benchmarks such as LSPET. We demonstrate its utility by deploying Fast SAM 3D Body in a vision-only teleoperation system that-unlike methods reliant on wearable IMUs-enables real-time humanoid control and the direct collection of manipulation policies from a single RGB stream.

顶级标签: computer vision model training systems
详细标签: human mesh recovery real-time inference model acceleration 3d reconstruction teleoperation 或 搜索:

Fast SAM 3D Body:加速SAM 3D Body以实现实时全身人体网格重建 / Fast SAM 3D Body: Accelerating SAM 3D Body for Real-Time Full-Body Human Mesh Recovery


1️⃣ 一句话总结

这篇论文提出了一种无需重新训练的加速框架,通过并行化特征提取和简化模型结构,将原本耗时的3D人体重建模型提速超过10倍,使其能够实时运行,并成功应用于仅需普通摄像头的机器人远程操控系统。

源自 arXiv: 2603.15603