优化一切:一个用于优化任意文本参数的通用API / optimize_anything: A Universal API for Optimizing any Text Parameter
1️⃣ 一句话总结
本文提出一个基于大语言模型的通用优化系统,通过将问题转化为用评分函数评估文本改进的过程,在六个截然不同的任务(如提升AI推理准确率、降低云成本、生成高效代码)上均达到或超越专业工具的效果,证明了文本优化可以成为一种跨领域的通用问题求解范式。
Can a single LLM-based optimization system match specialized tools across fundamentally different domains? We show that when optimization problems are formulated as improving a text artifact evaluated by a scoring function, a single AI-based optimization system-supporting single-task search, multi-task search with cross-problem transfer, and generalization to unseen inputs-achieves state-of-the-art results across six diverse tasks. Our system discovers agent architectures that nearly triple Gemini Flash's ARC-AGI accuracy (32.5% to 89.5%), finds scheduling algorithms that cut cloud costs by 40%, generates CUDA kernels where 87% match or beat PyTorch, and outperforms AlphaEvolve's reported circle packing solution (n=26). Ablations across three domains reveal that actionable side information yields faster convergence and substantially higher final scores than score-only feedback, and that multi-task search outperforms independent optimization given equivalent per-problem budget through cross-task transfer, with benefits scaling with the number of related tasks. Together, we show for the first time that text optimization with LLM-based search is a general-purpose problem-solving paradigm, unifying tasks traditionally requiring domain-specific algorithms under a single framework. We open-source optimize\_anything with support for multiple backends as part of the GEPA project at this https URL .
优化一切:一个用于优化任意文本参数的通用API / optimize_anything: A Universal API for Optimizing any Text Parameter
本文提出一个基于大语言模型的通用优化系统,通过将问题转化为用评分函数评估文本改进的过程,在六个截然不同的任务(如提升AI推理准确率、降低云成本、生成高效代码)上均达到或超越专业工具的效果,证明了文本优化可以成为一种跨领域的通用问题求解范式。
源自 arXiv: 2605.19633