超越静态流程:学习面向文本到SQL的动态工作流 / Beyond Static Pipelines: Learning Dynamic Workflows for Text-to-SQL
1️⃣ 一句话总结
这篇论文提出了一个名为SquRL的强化学习框架,它能让大型语言模型在文本转SQL任务中动态地选择并组合不同的处理步骤,从而比固定的静态方法更能适应复杂和陌生的查询,显著提升了实际应用中的效果。
Text-to-SQL has recently achieved impressive progress, yet remains difficult to apply effectively in real-world scenarios. This gap stems from the reliance on single static workflows, fundamentally limiting scalability to out-of-distribution and long-tail scenarios. Instead of requiring users to select suitable methods through extensive experimentation, we attempt to enable systems to adaptively construct workflows at inference time. Through theoretical and empirical analysis, we demonstrate that optimal dynamic policies consistently outperform the best static workflow, with performance gains fundamentally driven by heterogeneity across candidate workflows. Motivated by this, we propose SquRL, a reinforcement learning framework that enhances LLMs' reasoning capability in adaptive workflow construction. We design a rule-based reward function and introduce two effective training mechanisms: dynamic actor masking to encourage broader exploration, and pseudo rewards to improve training efficiency. Experiments on widely-used Text-to-SQL benchmarks demonstrate that dynamic workflow construction consistently outperforms the best static workflow methods, with especially pronounced gains on complex and out-of-distribution queries. The codes are available at this https URL
超越静态流程:学习面向文本到SQL的动态工作流 / Beyond Static Pipelines: Learning Dynamic Workflows for Text-to-SQL
这篇论文提出了一个名为SquRL的强化学习框架,它能让大型语言模型在文本转SQL任务中动态地选择并组合不同的处理步骤,从而比固定的静态方法更能适应复杂和陌生的查询,显著提升了实际应用中的效果。
源自 arXiv: 2602.15564