菜单

关于 🐙 GitHub
arXiv 提交日期: 2025-12-09
📄 Abstract - Visionary: The World Model Carrier Built on WebGPU-Powered Gaussian Splatting Platform

Neural rendering, particularly 3D Gaussian Splatting (3DGS), has evolved rapidly and become a key component for building world models. However, existing viewer solutions remain fragmented, heavy, or constrained by legacy pipelines, resulting in high deployment friction and limited support for dynamic content and generative models. In this work, we present Visionary, an open, web-native platform for real-time various Gaussian Splatting and meshes rendering. Built on an efficient WebGPU renderer with per-frame ONNX inference, Visionary enables dynamic neural processing while maintaining a lightweight, "click-to-run" browser experience. It introduces a standardized Gaussian Generator contract, which not only supports standard 3DGS rendering but also allows plug-and-play algorithms to generate or update Gaussians each frame. Such inference also enables us to apply feedforward generative post-processing. The platform further offers a plug in this http URL library with a concise TypeScript API for seamless integration into existing web applications. Experiments show that, under identical 3DGS assets, Visionary achieves superior rendering efficiency compared to current Web viewers due to GPU-based primitive sorting. It already supports multiple variants, including MLP-based 3DGS, 4DGS, neural avatars, and style transformation or enhancement networks. By unifying inference and rendering directly in the browser, Visionary significantly lowers the barrier to reproduction, comparison, and deployment of 3DGS-family methods, serving as a unified World Model Carrier for both reconstructive and generative paradigms.

顶级标签: systems computer vision model training
详细标签: 3d gaussian splatting neural rendering webgpu world model real-time rendering 或 搜索:

Visionary:一个基于WebGPU与高斯泼溅技术的世界模型承载平台 / Visionary: The World Model Carrier Built on WebGPU-Powered Gaussian Splatting Platform


1️⃣ 一句话总结

这篇论文提出了一个名为Visionary的、基于Web浏览器的开放平台,它利用最新的WebGPU技术和标准化接口,让各种动态的3D高斯泼溅模型和生成式AI算法能够直接在网页中高效、便捷地运行和展示,极大地降低了相关技术的使用和部署门槛。


源自 arXiv: 2512.08478