菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-04-21
📄 Abstract - SmartPhotoCrafter: Unified Reasoning, Generation and Optimization for Automatic Photographic Image Editing

Traditional photographic image editing typically requires users to possess sufficient aesthetic understanding to provide appropriate instructions for adjusting image quality and camera parameters. However, this paradigm relies on explicit human instruction of aesthetic intent, which is often ambiguous, incomplete, or inaccessible to non-expert users. In this work, we propose SmartPhotoCrafter, an automatic photographic image editing method which formulates image editing as a tightly coupled reasoning-to-generation process. The proposed model first performs image quality comprehension and identifies deficiencies by the Image Critic module, and then the Photographic Artist module realizes targeted edits to enhance image appeal, eliminating the need for explicit human instructions. A multi-stage training pipeline is adopted: (i) Foundation pretraining to establish basic aesthetic understanding and editing capabilities, (ii) Adaptation with reasoning-guided multi-edit supervision to incorporate rich semantic guidance, and (iii) Coordinated reasoning-to generation reinforcement learning to jointly optimize reasoning and generation. During training, SmartPhotoCrafter emphasizes photo-realistic image generation, while supporting both image restoration and retouching tasks with consistent adherence to color- and tone-related semantics. We also construct a stage-specific dataset, which progressively builds reasoning and controllable generation, effective cross-module collaboration, and ultimately high-quality photographic enhancement. Experiments demonstrate that SmartPhotoCrafter outperforms existing generative models on the task of automatic photographic enhancement, achieving photo-realistic results while exhibiting higher tonal sensitivity to retouching instructions. Project page: this https URL.

顶级标签: computer vision model training multi-modal
详细标签: image editing reasoning-to-generation photographic enhancement reinforcement learning image quality 或 搜索:

SmartPhotoCrafter:面向自动照片图像编辑的统一推理、生成与优化 / SmartPhotoCrafter: Unified Reasoning, Generation and Optimization for Automatic Photographic Image Editing


1️⃣ 一句话总结

本文提出了一种名为SmartPhotoCrafter的自动照片编辑方法,它像一位智能摄影师,先自动分析照片的不足之处,再针对性地进行修复和美化,无需用户提供任何专业指令,就能生成媲美专业后期效果的图片。

源自 arXiv: 2604.19587