DyCoRM: Dynamic Criterion-Aware Reward Modeling for Text-to-Image Generation

📄 Abstract - DyCoRM: Dynamic Criterion-Aware Reward Modeling for Text-to-Image Generation

With the continued advancement of text-to-image (T2I) generation, producing high-quality images is becoming increasingly attainable; consequently, user demands are shifting toward images that better satisfy their specific requirements. As reward models play an increasingly important role in assessing whether generated images align with user preference, this trend introduces an important challenge for reward modeling: rather than relying solely on static and general evaluation dimensions, reward models should account for the task-relevant and fine-grained criteria through which users assess whether generated images meet their specific requirements. To address this challenge, we propose DyCoRM, a dynamic, criterion-aware reward model that grounds task-relevant criteria and performs criterion-aware preference comparison. To support this setting, we construct DyCoDataset-20K, which provides dynamic criteria together with criterion-level annotations, and further derive DyCoBench-1K, a benchmark for systematically evaluating reward models under dynamic criteria. We further introduce DyCoPick, which applies criterion-aware reward modeling to selecting T2I images. Our contributions establish the first reward modeling framework for dynamic and fine-grained evaluation and practical application in T2I generation.

DyCoRM：面向文本到图像生成的动态准则感知奖励建模 / DyCoRM: Dynamic Criterion-Aware Reward Modeling for Text-to-Image Generation

1️⃣ 一句话总结

本文提出了一种名为DyCoRM的动态奖励模型，它能够根据用户对图像的具体要求（如“风格要复古”或“颜色要鲜艳”）自动提取评价准则并进行精细打分，从而更精准地从多个生成图像中选出最符合用户个性化需求的图片。

← 返回列表

菜单

AI 帮我研读全文

1️⃣ 一句话总结

密码管理

设置密码

修改密码

移除密码

菜单

AI 帮我研读全文

1️⃣ 一句话总结

获取最新论文摘要