菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-02-24
📄 Abstract - Boosting Instance Awareness via Cross-View Correlation with 4D Radar and Camera for 3D Object Detection

4D millimeter-wave radar has emerged as a promising sensing modality for autonomous driving due to its robustness and affordability. However, its sparse and weak geometric cues make reliable instance activation difficult, limiting the effectiveness of existing radar-camera fusion paradigms. BEV-level fusion offers global scene understanding but suffers from weak instance focus, while perspective-level fusion captures instance details but lacks holistic context. To address these limitations, we propose SIFormer, a scene-instance aware transformer for 3D object detection using 4D radar and camera. SIFormer first suppresses background noise during view transformation through segmentation- and depth-guided localization. It then introduces a cross-view activation mechanism that injects 2D instance cues into BEV space, enabling reliable instance awareness under weak radar geometry. Finally, a transformer-based fusion module aggregates complementary image semantics and radar geometry for robust perception. As a result, with the aim of enhancing instance awareness, SIFormer bridges the gap between the two paradigms, combining their complementary strengths to address inherent sparse nature of radar and improve detection accuracy. Experiments demonstrate that SIFormer achieves state-of-the-art performance on View-of-Delft, TJ4DRadSet and NuScenes datasets. Source code is available at this http URL.

顶级标签: computer vision robotics systems
详细标签: 3d object detection sensor fusion autonomous driving 4d radar transformer 或 搜索:

通过4D雷达与相机跨视图关联提升实例感知能力,用于3D目标检测 / Boosting Instance Awareness via Cross-View Correlation with 4D Radar and Camera for 3D Object Detection


1️⃣ 一句话总结

这篇论文提出了一种名为SIFormer的新方法,它通过巧妙结合4D雷达和相机数据,并利用跨视图激活机制,有效解决了雷达数据稀疏导致的实例感知难题,从而在自动驾驶的3D目标检测任务中取得了领先的性能。

源自 arXiv: 2602.20632