菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-06-04
📄 Abstract - Beyond Absolute Scores: Relative Edit-induced Difference for Generalizable Image Aesthetic Assessment

Traditional Image Aesthetic Assessment (IAA) methods mainly rely on regressing absolute Mean Opinion Scores (MOS). However, such a paradigm overlooks the inherently dynamic nature of human aesthetic perception, which relies on subconscious comparison against implicit visual references. Consequently, the lack of causal reasoning regarding aesthetic differences prevents models from learning generalizable aesthetic principles, thus limiting their generalization across diverse scenarios. In this work, we rethink the IAA task and propose Relative Edit-induced Difference Aesthetic learning (RED-Aes), a novel framework that leverages controllable image editing models to simulate the human aesthetic reasoning process. Instead of fitting absolute score distributions, RED-Aes explicitly learns the visual factors that drive aesthetic changes. To support this paradigm, we construct the RED-20k dataset, which comprises editing-based image pairs, quantitative aesthetic differences, and Chain-of-Thought (CoT) reasoning. Furthermore, we introduce a three-stage training strategy guided by a relative ranking consistency reward, optimizing the model solely via relative supervision. Extensive experiments demonstrate that RED-Aes achieves state-of-the-art performance on multiple public benchmarks, exhibiting superior generalization capabilities.

顶级标签: computer vision machine learning
详细标签: aesthetic assessment relative learning image editing dataset generalization 或 搜索:

超越绝对评分:基于编辑差异的相对学习实现通用图像美学评估 / Beyond Absolute Scores: Relative Edit-induced Difference for Generalizable Image Aesthetic Assessment


1️⃣ 一句话总结

本文提出一种名为RED-Aes的新方法,通过利用图像编辑工具模拟人类对比审美过程,让模型学习不同编辑操作如何改变图像美感,而非直接预测评分,从而大幅提升模型在多种场景下的泛化能力。

源自 arXiv: 2606.05778