菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-05-25
📄 Abstract - SAM3-Assisted Training of Lightweight YOLO Models for Precision Pig Farming

Deep learning-based object detection has revolutionized Precision Livestock Farming (PLF), yet a critical barrier remains: high-performance Foundation Models (such as SAM 3) are too computationally intensive for edge deployment, while lightweight models (like YOLO) require prohibitive manual annotation efforts. This work proposes a fully automated knowledge distillation pipeline that leverages the Segment Anything Model 3 (SAM 3) to generate zero-shot pseudo-labels for training efficient YOLOv8 detectors. By treating SAM 3 as an offline auto-annotator, we eliminate the manual labeling bottleneck, producing models capable of real-time inference on resource-constrained hardware. We systematically evaluate this approach on the PigLife dataset, comparing SAM 3-supervised models against human-annotated baselines. Results demonstrate that a SAM 3-trained YOLOv8m achieves a mean Average Precision (mAP) of 79.4% without human intervention, while reducing inference latency by approximately 200$\times$ compared to the teacher model. Furthermore, stratified analysis reveals that in low-occlusion scenarios, the automated pipeline achieves detection rates comparable to human benchmarks ($AP_{50} > 99\%$). These findings indicate that foundation models can serve as effective, zero-annotation-cost supervisors, enabling scalable edge computing solutions for smart agriculture.

顶级标签: computer vision agriculture multi-modal
详细标签: object detection yolo sam3 knowledge distillation edge computing 或 搜索:

基于SAM3辅助训练的轻量级YOLO模型用于精准养猪业 / SAM3-Assisted Training of Lightweight YOLO Models for Precision Pig Farming


1️⃣ 一句话总结

本文提出一种自动化知识蒸馏方法,利用大型基础模型SAM3自动生成标注数据,无需人工标注即可训练轻量级YOLOv8检测模型,在精准养猪场景下实现接近人工标注的性能,同时推理速度提升约200倍,为资源受限的边缘设备部署提供了可行方案。

源自 arXiv: 2605.25860