📄
Abstract - Comparative Study of Vision-Based Metric Measurement for Large-Scale Planar Scenes
Vision-based metric distance and area measurement remains challenging in large-scale outdoor environments due to long-range sensing, camera zoom, and unstable imaging conditions. This work studies planar metric measurement in a real-world reservoir monitoring scenario using PTZ cameras and compares three representative approaches: geometry-based monocular ranging, image stitching with birds-eye-view transformation, and stereo-based ranging using two jointly calibrated monocular cameras. For monocular ranging, planar localization models are derived from camera geometry and the effect of camera pitch angle is analyzed. Image stitching is investigated for large-area mapping, while a stereo-based scheme is developed for long-range measurement without dedicated stereo hardware. Experiments show clear trade-offs: monocular ranging achieves meter-level accuracy under sufficiently large pitch angles, stereo-based ranging achieves decimeter-level accuracy with reduced sensitivity to pitch variations, and image stitching is effective for small-scale scenes but degrades in stability and scalability as scene size increases.
基于视觉的大尺度平面场景度量测量方法比较研究 /
Comparative Study of Vision-Based Metric Measurement for Large-Scale Planar Scenes
1️⃣ 一句话总结
本文针对大型户外平面场景(如水库监测)中的距离和面积测量难题,对比研究了三种基于视觉的方法——单目几何测距、鸟瞰图拼接测距和双目光学测距,发现单目法在大俯仰角下可达米级精度,双目法能实现分米级精度且受俯仰角影响小,而拼接法仅适用于小范围场景,规模增大时稳定性显著下降。