探索生成流网络中的多个高分值子空间 / Exploring Multiple High-Scoring Subspaces in Generative Flow Networks
1️⃣ 一句话总结
这篇论文提出了一种名为CMAB-GFN的新方法,通过结合组合多臂老虎机来引导生成流网络的探索过程,使其能更高效地发现多种高质量的解,同时避免在低质量区域浪费资源。
As a probabilistic sampling framework, Generative Flow Networks (GFlowNets) show strong potential for constructing complex combinatorial objects through the sequential composition of elementary components. However, existing GFlowNets often suffer from excessive exploration over vast state spaces, leading to over-sampling of low-reward regions and convergence to suboptimal distributions. Effectively biasing GFlowNets toward high-reward solutions remains a non-trivial challenge. In this paper, we propose CMAB-GFN, which integrates a combinatorial multi-armed bandit (CMAB) framework with GFlowNet policies. The CMAB component prunes low-quality actions, yielding compact high-scoring subspaces for exploration. Restricting GFNs to these compact high-scoring subspaces accelerates the discovery of high-value candidates, while the exploration of different subspaces ensures that diversity is not sacrificed. Experimental results on multiple tasks demonstrate that CMAB-GFN generates higher-reward candidates than existing approaches.
探索生成流网络中的多个高分值子空间 / Exploring Multiple High-Scoring Subspaces in Generative Flow Networks
这篇论文提出了一种名为CMAB-GFN的新方法,通过结合组合多臂老虎机来引导生成流网络的探索过程,使其能更高效地发现多种高质量的解,同时避免在低质量区域浪费资源。
源自 arXiv: 2602.11491