菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-03-05
📄 Abstract - Probabilistic Dreaming for World Models

"Dreaming" enables agents to learn from imagined experiences, enabling more robust and sample-efficient learning of world models. In this work, we consider innovations to the state-of-the-art Dreamer model using probabilistic methods that enable: (1) the parallel exploration of many latent states; and (2) maintaining distinct hypotheses for mutually exclusive futures while retaining the desirable gradient properties of continuous latents. Evaluating on the MPE SimpleTag domain, our method outperforms standard Dreamer with a 4.5% score improvement and 28% lower variance in episode returns. We also discuss limitations and directions for future work, including how optimal hyperparameters (e.g. particle count K) scale with environmental complexity, and methods to capture epistemic uncertainty in world models.

顶级标签: reinforcement learning agents model training
详细标签: world models dreamer probabilistic methods latent exploration sample efficiency 或 搜索:

用于世界模型的概率梦境方法 / Probabilistic Dreaming for World Models


1️⃣ 一句话总结

这项研究通过引入概率方法改进了先进的Dreamer模型,使其能够同时探索多种潜在状态并维持对未来不同可能性的假设,从而在虚拟环境中更稳定、高效地学习世界模型,实验证明其性能优于原版模型。

源自 arXiv: 2603.04715