Many Voices, One Reward: Multi-Role Rubric Generation for LLM Judging and Reward Modeling

📄 Abstract - Many Voices, One Reward: Multi-Role Rubric Generation for LLM Judging and Reward Modeling

Reliable reward and preference signals are critical for evaluating and optimizing large language models on open-ended tasks. Rubric-based judges offer a transparent way to decompose such judgments into explicit evaluation criteria, but existing annotation-free rubric generators typically rely on a single generic evaluator. As a result, they may overlook important dimensions of human preference, a failure mode we term dimensional blind spots. To address this limitation, we propose Multi-Role Rubric Generation (MRRG), a training-free and reference-free framework that elicits evaluation criteria from multiple complementary roles and consolidates them into an auditable rubric-based scorer. This scorer can be used both to validate pairwise preferences and to provide rewards for GRPO-style Reinforcement Learning with Verifiable Rewards (RLVR). Experiments on preference validation benchmarks show that MRRG consistently outperforms single-role rubric generation baselines across multiple backbone models. Further RLVR experiments demonstrate that MRRG yields a stronger reward signal for improving open-ended generation.

多角色评分标准生成：用于大模型评判与奖励建模的统一框架 / Many Voices, One Reward: Multi-Role Rubric Generation for LLM Judging and Reward Modeling

1️⃣ 一句话总结

本文提出一种无需训练和外部参考的框架（MRRG），通过让多个互补的角色各自生成评分标准，再整合成一份全面的评估表，从而克服单一评估角色可能忽略某些用户偏好维度的问题，既能用于验证模型答案的优劣，也能为强化学习提供更可靠的奖励信号。

← 返回列表

菜单

AI 帮我研读全文

1️⃣ 一句话总结

密码管理

设置密码

修改密码

移除密码

菜单

AI 帮我研读全文

1️⃣ 一句话总结

获取最新论文摘要