超越已知物体:一种基于负感知范数的开放集目标检测新框架 / Beyond Known Objects: A Novel Framework for Open-Set Object Detection using Negative-Aware Norm
1️⃣ 一句话总结
本文提出一个名为NAN-SPOT的轻量级框架,无需重新训练已有的目标检测模型,仅通过几分钟的微调和几百张图像就能准确识别出从未见过的新物体,为自动驾驶等开放场景的感知任务提供了高效且实用的解决方案。
Open-Set Object Detection (OSOD) is crucial for autonomous driving, where perception systems must recognize and localize both known and previously unseen objects in complex, dynamic environments. While recent approaches deliver promising results, they often require retraining the detector extensively to learn objectness, which describes the likelihood that a bounding box tightly encloses a valid object, regardless of whether its category was learned during training. Deviating from existing work, we hypothesize that standard off-the-shelf detectors may already contain helpful cues for objectness, owing to their training on numerous and diverse known categories. Building on this idea, we propose NAN-SPOT, a training-light framework that does not require to retrain the base object detector and estimates objectness by leveraging a hidden layer metric called Negative-Aware Norm (NAN), requiring only minutes of training on just hundreds of images. To support comprehensive evaluation, we introduce COCO-Open, an expanded version of the existing COCO-Mixed dataset, increasing unknown object annotations from 433 to 1853, making it the most exhaustively labeled dataset for OSOD to the best of our knowledge. Experimental results demonstrate that NAN-SPOT achieves even better performance on unknown object detection than methods requiring heavy training, without compromising performance on known objects. This efficiency and robustness make NAN-SPOT a promising step towards open-world perception in autonomous driving.
超越已知物体:一种基于负感知范数的开放集目标检测新框架 / Beyond Known Objects: A Novel Framework for Open-Set Object Detection using Negative-Aware Norm
本文提出一个名为NAN-SPOT的轻量级框架,无需重新训练已有的目标检测模型,仅通过几分钟的微调和几百张图像就能准确识别出从未见过的新物体,为自动驾驶等开放场景的感知任务提供了高效且实用的解决方案。
源自 arXiv: 2605.02284