菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-03-15
📄 Abstract - ChArtist: Generating Pictorial Charts with Unified Spatial and Subject Control

A pictorial chart is an effective medium for visual storytelling, seamlessly integrating visual elements with data charts. However, creating such images is challenging because the flexibility of visual elements often conflicts with the rigidity of chart structures. This process thus requires a creative deformation that maintains both data faithfulness and visual aesthetics. Current methods that extract dense structural cues from natural images (e.g., edge or depth maps) are ill-suited as conditioning signals for pictorial chart generation. We present ChArtist, a domain-specific diffusion model for generating pictorial charts automatically, offering two distinct types of control: 1) spatial control that aligns well with the chart structure, and 2) subject-driven control that respects the visual characteristics of a reference image. To achieve this, we introduce a skeleton-based spatial control representation. This representation encodes only the data-encoding information of the chart, allowing for the easy incorporation of reference visuals without a rigid outline constraint. We implement our method based on the Diffusion Transformer (DiT) and leverage an adaptive position encoding mechanism to manage these two controls. We further introduce Spatially Gated Attention to modulate the interaction between spatial control and subject control. To support the fine-tuning of pre-trained models for this task, we created a large-scale dataset of 30,000 triplets (skeleton, reference image, pictorial chart). We also propose a unified data accuracy metric to evaluate the data faithfulness of the generated charts. We believe this work demonstrates that current generative models can achieve data-driven visual storytelling by moving beyond general-purpose conditions to task-specific representations. Project page: this https URL.

顶级标签: computer vision multi-modal aigc
详细标签: image generation diffusion models visual storytelling data visualization conditional generation 或 搜索:

ChArtist:通过统一的空间与主题控制生成图示化图表 / ChArtist: Generating Pictorial Charts with Unified Spatial and Subject Control


1️⃣ 一句话总结

这篇论文提出了一个名为ChArtist的专用AI模型,它能够根据用户提供的数据结构和参考图片,自动生成既准确反映数据、又具有视觉美感的图示化图表。

源自 arXiv: 2603.14209