菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-07-02
📄 Abstract - Many Voices, One Reward: Multi-Role Rubric Generation for LLM Judging and Reward Modeling

Reliable reward and preference signals are critical for evaluating and optimizing large language models on open-ended tasks. Rubric-based judges offer a transparent way to decompose such judgments into explicit evaluation criteria, but existing annotation-free rubric generators typically rely on a single generic evaluator. As a result, they may overlook important dimensions of human preference, a failure mode we term dimensional blind spots. To address this limitation, we propose Multi-Role Rubric Generation (MRRG), a training-free and reference-free framework that elicits evaluation criteria from multiple complementary roles and consolidates them into an auditable rubric-based scorer. This scorer can be used both to validate pairwise preferences and to provide rewards for GRPO-style Reinforcement Learning with Verifiable Rewards (RLVR). Experiments on preference validation benchmarks show that MRRG consistently outperforms single-role rubric generation baselines across multiple backbone models. Further RLVR experiments demonstrate that MRRG yields a stronger reward signal for improving open-ended generation.

顶级标签: llm model evaluation
详细标签: rubric generation reward modeling preference validation reinforcement learning open-ended generation 或 搜索:

多角色评分标准生成:用于大模型评判与奖励建模的统一框架 / Many Voices, One Reward: Multi-Role Rubric Generation for LLM Judging and Reward Modeling


1️⃣ 一句话总结

本文提出一种无需训练和外部参考的框架(MRRG),通过让多个互补的角色各自生成评分标准,再整合成一份全面的评估表,从而克服单一评估角色可能忽略某些用户偏好维度的问题,既能用于验证模型答案的优劣,也能为强化学习提供更可靠的奖励信号。

源自 arXiv: 2607.01830