菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-02-05
📄 Abstract - A Comparative Study of 3D Person Detection: Sensor Modalities and Robustness in Diverse Indoor and Outdoor Environments

Accurate 3D person detection is critical for safety in applications such as robotics, industrial monitoring, and surveillance. This work presents a systematic evaluation of 3D person detection using camera-only, LiDAR-only, and camera-LiDAR fusion. While most existing research focuses on autonomous driving, we explore detection performance and robustness in diverse indoor and outdoor scenes using the JRDB dataset. We compare three representative models - BEVDepth (camera), PointPillars (LiDAR), and DAL (camera-LiDAR fusion) - and analyze their behavior under varying occlusion and distance levels. Our results show that the fusion-based approach consistently outperforms single-modality models, particularly in challenging scenarios. We further investigate robustness against sensor corruptions and misalignments, revealing that while DAL offers improved resilience, it remains sensitive to sensor misalignment and certain LiDAR-based corruptions. In contrast, the camera-based BEVDepth model showed the lowest performance and was most affected by occlusion, distance, and noise. Our findings highlight the importance of utilizing sensor fusion for enhanced 3D person detection, while also underscoring the need for ongoing research to address the vulnerabilities inherent in these systems.

顶级标签: computer vision robotics systems
详细标签: 3d person detection sensor fusion lidar camera robustness evaluation 或 搜索:

三维行人检测对比研究:不同室内外环境下的传感器模态与鲁棒性 / A Comparative Study of 3D Person Detection: Sensor Modalities and Robustness in Diverse Indoor and Outdoor Environments


1️⃣ 一句话总结

这篇论文通过系统对比摄像头、激光雷达以及两者融合的方案,发现在复杂室内外场景下,融合传感器的方法能更稳定地检测三维行人,但其自身仍存在对传感器错位等问题的脆弱性,强调了融合技术的优势与持续改进的必要性。

源自 arXiv: 2602.05538