通过智能法律信息收集与评分引导优化提升判决文书生成 / Enhancing Judgment Document Generation via Agentic Legal Information Collection and Rubric-Guided Optimization
1️⃣ 一句话总结
本文提出一个名为Judge-R1的统一框架,通过引入动态规划代理自动收集准确的法律条文和判例,并结合基于强化学习的评分优化方法,让AI生成的判决文书在事实引用、法律依据和逻辑推理上更专业可靠。
Automating the drafting of judgment documents is pivotal to judicial efficiency, yet it remains challenging due to the dual requirements of comprehensive retrieval of legal information and rigorous logical reasoning. Existing approaches, typically relying on standard Retrieval-Augmented Generation and Supervised Fine-Tuning, often suffer from insufficient evidence recall, hallucinated statutory references, and logically flawed legal reasoning. To bridge this gap, we propose Judge-R1, a unified framework designed to enhance LLM-based judgment document generation by jointly improving legal information collection and judgment document generation. First, we introduce Agentic Legal Information Collection, which employs a dynamic planning agent to retrieve precise statutes and precedents from multiple sources. Second, we implement Rubric-Guided Optimization, a reinforcement learning phase utilizing Group Relative Policy Optimization (GRPO) with a comprehensive legal reward function to enforce adherence to judicial standards and reasoning logic. Extensive experiments on the JuDGE benchmark demonstrate that Judge-R1 significantly outperforms state-of-the-art baselines in both legal accuracy and generation quality.
通过智能法律信息收集与评分引导优化提升判决文书生成 / Enhancing Judgment Document Generation via Agentic Legal Information Collection and Rubric-Guided Optimization
本文提出一个名为Judge-R1的统一框架,通过引入动态规划代理自动收集准确的法律条文和判例,并结合基于强化学习的评分优化方法,让AI生成的判决文书在事实引用、法律依据和逻辑推理上更专业可靠。
源自 arXiv: 2605.02011