菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-04-07
📄 Abstract - Beyond the Beep: Scalable Collision Anticipation and Real-Time Explainability with BADAS-2.0

We present BADAS-2.0, the second generation of our collision anticipation system, building on BADAS-1.0 [7], which showed that fine-tuning V-JEPA2 [1] on large-scale ego-centric dashcam data outperforms both academic baselines and production ADAS systems. BADAS-2.0 advances the state of the art along three axes. (i) Long-tail benchmark and accuracy: We introduce a 10-group long-tail benchmark targeting rare and safety-critical scenarios. To construct it, BADAS-1.0 is used as an active oracle to score millions of unlabeled drives and surface high-risk candidates for annotation. Combined with Nexar's Atlas platform [13] for targeted data collection, this expands the dataset from 40k to 178,500 labeled videos (~2M clips), yielding consistent gains across all subgroups, with the largest improvements on the hardest long-tail cases. (ii) Knowledge distillation to edge: Domain-specific self-supervised pre-training on 2.25M unlabeled driving videos enables distillation into compact models, BADAS-2.0-Flash (86M) and BADAS-2.0-Flash-Lite (22M), achieving 7-12x speedup with near-parity accuracy, enabling real-time edge deployment. (iii) Explainability: BADAS-2.0 produces real-time object-centric attention heatmaps that localize the evidence behind predictions. BADAS-Reason [17] extends this with a vision-language model that consumes the last frame and heatmap to generate driver actions and structured textual reasoning. Inference code and evaluation benchmarks are publicly available.

顶级标签: computer vision systems model training
详细标签: collision anticipation knowledge distillation explainable ai edge deployment driving dataset 或 搜索:

超越警报:BADAS-2.0——可扩展的碰撞预测与实时可解释性系统 / Beyond the Beep: Scalable Collision Anticipation and Real-Time Explainability with BADAS-2.0


1️⃣ 一句话总结

本文介绍了第二代碰撞预测系统BADAS-2.0,它通过构建一个包含大量罕见危险场景的数据集、将大模型压缩成可在车载设备上实时运行的小模型,并生成可视化热图和文字解释,显著提升了自动驾驶系统预测碰撞的准确性、速度和可理解性。

源自 arXiv: 2604.05767