QYOLO: Lightweight Object Detection via Quantum Inspired Shared Channel Mixing

📄 Abstract - QYOLO: Lightweight Object Detection via Quantum Inspired Shared Channel Mixing

The rapid advancement of object detection architectures has positioned single stage detectors as the dominant solution for real-time visual perception. A primary source of computational overhead in these models lies in the deep backbone stages, where C2f bottleneck modules at high stride levels accumulate a disproportionate share of parameters due to quadratic scaling with channel width. This work introduces QYOLO, a quantum-inspired channel mixing framework that achieves genuine architectural compression by replacing the two deepest backbone C2f modules at P4/16 (512 channels) and P5/32 (1024 channels) with a compact QMixBlock. The proposed block performs global channel recalibration through a sinusoidal mixing mechanism with shared learnable parameters across both backbone stages, enforcing consistent channel importance without requiring independent per-stage parameter sets. The neck and detection head remain fully classical and unchanged. Evaluation on the VisDrone2019 benchmark demonstrates that QYOLOv8n achieves a 20.2% reduction in parameter count (3.01M to 2.40M) and 12.3% GFLOPs reduction with only 0.4 pp mAP@50 degradation. QYOLOv8s achieves 21.8% reduction with 0.1 pp degradation. When combined with knowledge distillation, full accuracy parity is recovered at no cost to compression. An expanded backbone plus neck variant achieved 38 to 41% reduction at the cost of greater accuracy degradation, motivating the backbone-only final design.

QYOLO：基于量子启发式共享通道混合的轻量级目标检测 / QYOLO: Lightweight Object Detection via Quantum Inspired Shared Channel Mixing

1️⃣ 一句话总结

本文提出了一种名为QYOLO的轻量化目标检测方法，通过用受量子启发的新型模块QMixBlock替换传统YOLO骨干网络中两个最深层的C2f模块，在不改变检测头和颈部结构的前提下，大幅减少了模型参数量和计算量，并在VisDrone2019数据集上验证了其高效性——例如QYOLOv8n参数量减少20.2%、计算量降低12.3%，而精度仅轻微下降，结合知识蒸馏后可恢复原始精度。

← 返回列表

菜单

AI 帮我研读全文

1️⃣ 一句话总结

密码管理

设置密码

修改密码

移除密码

菜单

AI 帮我研读全文

1️⃣ 一句话总结

获取最新论文摘要