SAM-Enhanced Segmentation on Road Datasets: Balancing Critical Classes in Autonomous Driving

📄 Abstract - SAM-Enhanced Segmentation on Road Datasets: Balancing Critical Classes in Autonomous Driving

Dense semantic segmentation is essential for autonomous driving, yet many multi-modal datasets lack pixel-level annotations. The Zenseact Open Dataset (ZOD) provides rich multi-sensor data but only bounding-box labels, limiting its use for segmentation research. Our primary contribution is a Segment Anything Model (SAM)-based annotation pipeline that produces dense, pixel-level annotations for ZOD by converting bounding boxes into semantic masks. In this pilot study, we process over 100,000 frames and manually curate a 2,300-frame subset (36% acceptance rate) to establish a reliable baseline. Using these annotations, we evaluate transformer-based CLFT and CNN-based DeepLabV3+ architectures across diverse weather conditions, achieving up to 48.1% mIoU with CLFT-Hybrid. To address extreme class imbalance, where pedestrians, cyclists, and signs constitute less than 1% of pixels, we explore specialized models targeting rare classes. We further validate the pipeline on the Iseauto autonomous-vehicle platform, achieving 77.5% mIoU, and show that SAM-derived representations transfer effectively across sensor configurations via bidirectional transfer learning. All code and annotations are released to support reproducible research.

基于SAM增强的道路数据集分割：平衡自动驾驶中的关键类别 / SAM-Enhanced Segmentation on Road Datasets: Balancing Critical Classes in Autonomous Driving

1️⃣ 一句话总结

本文提出了一种使用SAM模型将自动驾驶数据集的边界框自动转换为像素级分割标注的方法，解决了多模态数据缺乏精细标注的问题，并通过优化模型在极端类别不平衡下仍然能准确识别行人、自行车等罕见但关键目标。

← 返回列表

菜单

AI 帮我研读全文

1️⃣ 一句话总结

密码管理

设置密码

修改密码

移除密码

菜单

AI 帮我研读全文

1️⃣ 一句话总结

获取最新论文摘要