菜单

关于 🐙 GitHub
arXiv 提交日期: 2025-12-15
📄 Abstract - ShowTable: Unlocking Creative Table Visualization with Collaborative Reflection and Refinement

While existing generation and unified models excel at general image generation, they struggle with tasks requiring deep reasoning, planning, and precise data-to-visual mapping abilities beyond general scenarios. To push beyond the existing limitations, we introduce a new and challenging task: creative table visualization, requiring the model to generate an infographic that faithfully and aesthetically visualizes the data from a given table. To address this challenge, we propose ShowTable, a pipeline that synergizes MLLMs with diffusion models via a progressive self-correcting process. The MLLM acts as the central orchestrator for reasoning the visual plan and judging visual errors to provide refined instructions, the diffusion execute the commands from MLLM, achieving high-fidelity results. To support this task and our pipeline, we introduce three automated data construction pipelines for training different modules. Furthermore, we introduce TableVisBench, a new benchmark with 800 challenging instances across 5 evaluation dimensions, to assess performance on this task. Experiments demonstrate that our pipeline, instantiated with different models, significantly outperforms baselines, highlighting its effective multi-modal reasoning, generation, and error correction capabilities.

顶级标签: multi-modal model training model evaluation
详细标签: table visualization infographic generation multimodal reasoning self-correcting pipeline benchmark 或 搜索:

ShowTable:通过协同反思与精炼解锁创意表格可视化 / ShowTable: Unlocking Creative Table Visualization with Collaborative Reflection and Refinement


1️⃣ 一句话总结

这篇论文提出了一个名为ShowTable的新方法,它通过让大语言模型和扩散模型协同工作,像‘设计师’和‘画师’一样反复沟通与修正,从而自动将枯燥的表格数据转换成既准确又美观的信息图表。


源自 arXiv: 2512.13303