菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-03-17
📄 Abstract - LICA: Layered Image Composition Annotations for Graphic Design Research

We introduce LICA (Layered Image Composition Annotations), a large-scale dataset of 1,550,244 multi-layer graphic design compositions designed to advance structured understanding and generation of graphic layouts1. In addition to ren- dered PNG images, LICA represents each design as a hierarchical composition of typed components including text, image, vector, and group elements, each paired with rich per-element metadata such as spatial geometry, typographic attributes, opacity, and visibility. The dataset spans 20 design categories and 971,850 unique templates, providing broad coverage of real-world design structures. We further introduce graphic design video as a new and largely unexplored challenge for current vision-language models through 27,261 animated layouts annotated with per-component keyframes and motion parameters. Beyond scale, LICA establishes a new paradigm of research tasks for graphic design, enabling structured investiga- tions into problems such as layer-aware inpainting, structured layout generation, controlled design editing, and temporally-aware generative modeling. By repre- senting design as a system of compositional layers and relationships, the dataset supports research on models that operate directly on design structure rather than pixels alone.

顶级标签: computer vision multi-modal data
详细标签: graphic design dataset layout generation structured understanding video generation 或 搜索:

LICA:用于平面设计研究的层级化图像合成标注数据集 / LICA: Layered Image Composition Annotations for Graphic Design Research


1️⃣ 一句话总结

这篇论文提出了一个名为LICA的大规模、结构化平面设计数据集,它不仅包含大量分层标注的设计素材,还引入了动态设计视频作为新挑战,旨在推动AI模型从单纯处理像素转向理解和生成具有层级关系的设计结构。

源自 arXiv: 2603.16098