菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-05-27
📄 Abstract - SAM-Enhanced Segmentation on Road Datasets: Balancing Critical Classes in Autonomous Driving

Dense semantic segmentation is essential for autonomous driving, yet many multi-modal datasets lack pixel-level annotations. The Zenseact Open Dataset (ZOD) provides rich multi-sensor data but only bounding-box labels, limiting its use for segmentation research. Our primary contribution is a Segment Anything Model (SAM)-based annotation pipeline that produces dense, pixel-level annotations for ZOD by converting bounding boxes into semantic masks. In this pilot study, we process over 100,000 frames and manually curate a 2,300-frame subset (36% acceptance rate) to establish a reliable baseline. Using these annotations, we evaluate transformer-based CLFT and CNN-based DeepLabV3+ architectures across diverse weather conditions, achieving up to 48.1% mIoU with CLFT-Hybrid. To address extreme class imbalance, where pedestrians, cyclists, and signs constitute less than 1% of pixels, we explore specialized models targeting rare classes. We further validate the pipeline on the Iseauto autonomous-vehicle platform, achieving 77.5% mIoU, and show that SAM-derived representations transfer effectively across sensor configurations via bidirectional transfer learning. All code and annotations are released to support reproducible research.

顶级标签: computer vision autonomous driving machine learning
详细标签: semantic segmentation segment anything model annotation pipeline class imbalance zenseact open dataset 或 搜索:

基于SAM增强的道路数据集分割:平衡自动驾驶中的关键类别 / SAM-Enhanced Segmentation on Road Datasets: Balancing Critical Classes in Autonomous Driving


1️⃣ 一句话总结

本文提出了一种使用SAM模型将自动驾驶数据集的边界框自动转换为像素级分割标注的方法,解决了多模态数据缺乏精细标注的问题,并通过优化模型在极端类别不平衡下仍然能准确识别行人、自行车等罕见但关键目标。

源自 arXiv: 2605.28136