ServImage:来自真实世界商业影像服务的图像生成与编辑基准 / ServImage: An Image Generation and Editing Benchmark from Real-world Commercial Imaging Services
1️⃣ 一句话总结
该研究提出了一个名为ServImage的商业图像基准,通过分析超过29万美元的真实付费设计项目数据,建立了一套包含任务、评分和支付预测模型的系统,用于评估AI生成的图像是否具有实际商业价值。
Recent image generation and editing models demonstrate robust adherence to instructions and high visual quality on academic benchmarks. However, their performance on paid, real-world design projects remains uncertain. We introduce \textbf{ServImage}, a benchmark that explicitly correlates model outputs with economic value in commercial design projects. ServImage consists of (i) \textbf{\textit{ServImageBench}}: a dataset of 1.07k paid commercial design tasks and 2.05k designer deliverables totaling over \$295k, covering portrait, product, and digital content, along with 33k candidate images and 33k human annotations. (ii) \textbf{\textit{ServImageScore}}: an integrated scoring system that combines three quality dimensions: baseline requirements fulfilment, visual execution quality, and commercial necessity satisfaction. These three dimensions are designed to characterize the factors that drive human payment decisions and indicate whether an image is commercially acceptable. (iii) \textbf{\textit{ServImageModel}}: under this scoring system, we propose a payment prediction model trained on the human-annotated candidate images, achieving 82.00\% accuracy in predicting human payment decisions and producing calibrated payment probabilities. ServImage provides a comprehensive foundation for assessing the commercial viability of image generation models and offers a scalable resource for future research on economically grounded vision systems \href{this https URL}{Github.}
ServImage:来自真实世界商业影像服务的图像生成与编辑基准 / ServImage: An Image Generation and Editing Benchmark from Real-world Commercial Imaging Services
该研究提出了一个名为ServImage的商业图像基准,通过分析超过29万美元的真实付费设计项目数据,建立了一套包含任务、评分和支付预测模型的系统,用于评估AI生成的图像是否具有实际商业价值。
源自 arXiv: 2604.24023