增强YOLOv11n模型以在嘈杂监控视频中实现可靠的儿童检测 / Enhancing YOLOv11n for Reliable Child Detection in Noisy Surveillance Footage
1️⃣ 一句话总结
本研究提出了一种轻量且实用的方法,通过改进数据增强和推理策略,有效提升了YOLOv11n模型在低质量、复杂场景(如遮挡、模糊、光线差)下检测儿童的准确率,使其更适合在资源有限的边缘设备上实时部署,用于儿童走失预警或托育监控等实际应用。
This paper presents a practical and lightweight solution for enhancing child detection in low-quality surveillance footage, a critical component in real-world missing child alert and daycare monitoring systems. Building upon the efficient YOLOv11n architecture, we propose a deployment-ready pipeline that improves detection under challenging conditions including occlusion, small object size, low resolution, motion blur, and poor lighting commonly found in existing CCTV infrastructures. Our approach introduces a domain-specific augmentation strategy that synthesizes realistic child placements using spatial perturbations such as partial visibility, truncation, and overlaps, combined with photometric degradations including lighting variation and noise. To improve recall of small and partially occluded instances, we integrate Slicing Aided Hyper Inference (SAHI) at inference time. All components are trained and evaluated on a filtered, child-only subset of the Roboflow Daycare dataset. Compared to the baseline YOLOv11n, our enhanced system achieves a mean Average Precision at 0.5 IoU (mAP@0.5) of 0.967 and a mean Average Precision averaged over IoU thresholds from 0.5 to 0.95 (mAP@0.5:0.95) of 0.783, yielding absolute improvements of 0.7 percent and 2.3 percent, respectively, without architectural changes. Importantly, the entire pipeline maintains compatibility with low-power edge devices and supports real-time performance, making it particularly well suited for low-cost or resource-constrained industrial surveillance deployments. The example augmented dataset and the source code used to generate it are available at: this https URL
增强YOLOv11n模型以在嘈杂监控视频中实现可靠的儿童检测 / Enhancing YOLOv11n for Reliable Child Detection in Noisy Surveillance Footage
本研究提出了一种轻量且实用的方法,通过改进数据增强和推理策略,有效提升了YOLOv11n模型在低质量、复杂场景(如遮挡、模糊、光线差)下检测儿童的准确率,使其更适合在资源有限的边缘设备上实时部署,用于儿童走失预警或托育监控等实际应用。
源自 arXiv: 2602.10592