Compute Allocation in Evolutionary Search: From Depth-Breadth to Multi-Armed Bandits

📄 Abstract - Compute Allocation in Evolutionary Search: From Depth-Breadth to Multi-Armed Bandits

LLM-guided evolutionary search (Evolve systems) has reached state-of-the-art results on mathematical and combinatorial tasks, yet most existing systems report only the best of many runs and leave the run-to-run distribution undocumented. We ask how a fixed budget of LLM calls should be allocated, and how reliably a single run reaches the reported numbers. Sweeping the depth-breadth grid over five models and three tasks, we identify two empirical regularities: a fitness-compute envelope along which capability ordering largely collapses on effective FLOPs, and a bilinear depth-breadth fit with task-specific interaction; both are gated by model-task capability. Motivated by these regularities, we propose BaSE (Bandit-based Self-Evolving), a multi-armed bandit that allocates LLM calls across parallel trajectories. Without changing the model, prompt, or evaluator, BaSE improves mean fitness by 12.3% over the strongest island-protocol baseline across 8 (model, task) cells, with the largest gains on high-variance settings: a reliability gain from allocation alone.

进化搜索中的计算资源分配：从深度-广度策略到多臂老虎机算法 / Compute Allocation in Evolutionary Search: From Depth-Breadth to Multi-Armed Bandits

1️⃣ 一句话总结

本文研究了在大语言模型引导的进化搜索中，如何最优分配有限的计算资源（如LLM调用次数），发现不同任务的深度-广度配置存在规律，并据此提出一种名为BaSE的多臂老虎机算法，该算法无需修改模型或提示，仅通过智能分配计算资源，就能在多种任务上显著提升平均搜索效果和可靠性。

← 返回列表

菜单

AI 帮我研读全文

1️⃣ 一句话总结

密码管理

设置密码

修改密码

移除密码

菜单

AI 帮我研读全文

1️⃣ 一句话总结

获取最新论文摘要