IESR:基于高效MCTS的模块化推理方法,用于大型语言模型的文本转SQL任务 / IESR:Efficient MCTS-Based Modular Reasoning for Text-to-SQL with Large Language Models
1️⃣ 一句话总结
本文提出了一种名为IESR的高效模块化推理框架,它结合了蒙特卡洛树搜索和多数投票机制,让轻量级大语言模型无需微调就能在复杂的文本转SQL任务中取得顶尖性能,并揭示了当前模型在数学计算和常识推理上的不足。
Text-to-SQL is a key natural language processing task that maps natural language questions to SQL queries, enabling intuitive interaction with web-based databases. Although current methods perform well on benchmarks like BIRD and Spider, they struggle with complex reasoning, domain knowledge, and hypothetical queries, and remain costly in enterprise deployment. To address these issues, we propose a framework named IESR(Information Enhanced Structured Reasoning) for lightweight large language models: (i) leverages LLMs for key information understanding and schema linking, and decoupling mathematical computation and SQL generation, (ii) integrates a multi-path reasoning mechanism based on Monte Carlo Tree Search (MCTS) with majority voting, and (iii) introduces a trajectory consistency verification module with a discriminator model to ensure accuracy and consistency. Experimental results demonstrate that IESR achieves state-of-the-art performance on the complex reasoning benchmark LogicCat (24.28 EX) and the Archer dataset (37.28 EX) using only compact lightweight models without fine-tuning. Furthermore, our analysis reveals that current coder models exhibit notable biases and deficiencies in physical knowledge, mathematical computation, and common-sense reasoning, highlighting important directions for future research. We released code at this https URL.
IESR:基于高效MCTS的模块化推理方法,用于大型语言模型的文本转SQL任务 / IESR:Efficient MCTS-Based Modular Reasoning for Text-to-SQL with Large Language Models
本文提出了一种名为IESR的高效模块化推理框架,它结合了蒙特卡洛树搜索和多数投票机制,让轻量级大语言模型无需微调就能在复杂的文本转SQL任务中取得顶尖性能,并揭示了当前模型在数学计算和常识推理上的不足。
源自 arXiv: 2602.05385