大型语言模型能在约束条件下进行推理和优化吗? / Can Large Language Models Reason and Optimize Under Constraints?
1️⃣ 一句话总结
这篇论文通过测试大型语言模型解决电力系统最优潮流这一复杂的约束优化问题,发现当前最先进的模型在结构化推理和约束处理方面存在显著不足,揭示了它们在应对现实世界工程优化任务时的能力缺陷。
Large Language Models (LLMs) have demonstrated great capabilities across diverse natural language tasks; yet their ability to solve abstraction and optimization problems with constraints remains scarcely explored. In this paper, we investigate whether LLMs can reason and optimize under the physical and operational constraints of Optimal Power Flow (OPF) problem. We introduce a challenging evaluation setup that requires a set of fundamental skills such as reasoning, structured input handling, arithmetic, and constrained optimization. Our evaluation reveals that SoTA LLMs fail in most of the tasks, and that reasoning LLMs still fail in the most complex settings. Our findings highlight critical gaps in LLMs' ability to handle structured reasoning under constraints, and this work provides a rigorous testing environment for developing more capable LLM assistants that can tackle real-world power grid optimization problems.
大型语言模型能在约束条件下进行推理和优化吗? / Can Large Language Models Reason and Optimize Under Constraints?
这篇论文通过测试大型语言模型解决电力系统最优潮流这一复杂的约束优化问题,发现当前最先进的模型在结构化推理和约束处理方面存在显著不足,揭示了它们在应对现实世界工程优化任务时的能力缺陷。
源自 arXiv: 2603.23004