菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-02-19
📄 Abstract - 3D Scene Rendering with Multimodal Gaussian Splatting

3D scene reconstruction and rendering are core tasks in computer vision, with applications spanning industrial monitoring, robotics, and autonomous driving. Recent advances in 3D Gaussian Splatting (GS) and its variants have achieved impressive rendering fidelity while maintaining high computational and memory efficiency. However, conventional vision-based GS pipelines typically rely on a sufficient number of camera views to initialize the Gaussian primitives and train their parameters, typically incurring additional processing cost during initialization while falling short in conditions where visual cues are unreliable, such as adverse weather, low illumination, or partial occlusions. To cope with these challenges, and motivated by the robustness of radio-frequency (RF) signals to weather, lighting, and occlusions, we introduce a multimodal framework that integrates RF sensing, such as automotive radar, with GS-based rendering as a more efficient and robust alternative to vision-only GS rendering. The proposed approach enables efficient depth prediction from only sparse RF-based depth measurements, yielding a high-quality 3D point cloud for initializing Gaussian functions across diverse GS architectures. Numerical tests demonstrate the merits of judiciously incorporating RF sensing into GS pipelines, achieving high-fidelity 3D scene rendering driven by RF-informed structural accuracy.

顶级标签: computer vision multi-modal model training
详细标签: 3d gaussian splatting radar sensing scene reconstruction depth prediction point cloud 或 搜索:

基于多模态高斯泼溅的三维场景渲染 / 3D Scene Rendering with Multimodal Gaussian Splatting


1️⃣ 一句话总结

这项研究提出了一种结合射频传感(如汽车雷达)与高斯泼溅技术的新方法,能够在视觉线索不佳的恶劣环境下,更高效、更鲁棒地完成高质量三维场景重建与渲染。

源自 arXiv: 2602.17124