菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-02-25
📄 Abstract - E-comIQ-ZH: A Human-Aligned Dataset and Benchmark for Fine-Grained Evaluation of E-commerce Posters with Chain-of-Thought

Generative AI is widely used to create commercial posters. However, rapid advances in generation have outpaced automated quality assessment. Existing models emphasize generic esthetics or low level distortions and lack the functional criteria required for e-commerce design. It is especially challenging for Chinese content, where complex characters often produce subtle but critical textual artifacts that are overlooked by existing methods. To address this, we introduce E-comIQ-ZH, a framework for evaluating Chinese e-commerce posters. We build the first dataset E-comIQ-18k to feature multi dimensional scores and expert calibrated Chain of Thought (CoT) rationales. Using this dataset, we train E-comIQ-M, a specialized evaluation model that aligns with human expert judgment. Our framework enables E-comIQ-Bench, the first automated and scalable benchmark for the generation of Chinese e-commerce posters. Extensive experiments show our E-comIQ-M aligns more closely with expert standards and enables scalable automated assessment of e-commerce posters. All datasets, models, and evaluation tools will be released to support future research in this this http URL will be available at this https URL.

顶级标签: multi-modal model evaluation aigc
详细标签: e-commerce posters quality assessment chinese content chain-of-thought benchmark 或 搜索:

E-comIQ-ZH:一个用于电商海报细粒度评估的、符合人类判断的数据集与基准框架 / E-comIQ-ZH: A Human-Aligned Dataset and Benchmark for Fine-Grained Evaluation of E-commerce Posters with Chain-of-Thought


1️⃣ 一句话总结

这篇论文针对中文电商海报质量缺乏有效自动化评估工具的问题,创建了一个包含多维评分和专家推理说明的大规模数据集,并基于此训练了一个能像人类专家一样评判海报质量的专用模型,为电商海报生成提供了首个可扩展的自动化评估基准。

源自 arXiv: 2602.21698