菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-03-25
📄 Abstract - Towards Training-Free Scene Text Editing

Scene text editing seeks to modify textual content in natural images while maintaining visual realism and semantic consistency. Existing methods often require task-specific training or paired data, limiting their scalability and adaptability. In this paper, we propose TextFlow, a training-free scene text editing framework that integrates the strengths of Attention Boost (AttnBoost) and Flow Manifold Steering (FMS) to enable flexible, high-fidelity text manipulation without additional training. Specifically, FMS preserves the structural and style consistency by modeling the visual flow of characters and background regions, while AttnBoost enhances the rendering of textual content through attention-based guidance. By jointly leveraging these complementary modules, our approach performs end-to-end text editing through semantic alignment and spatial refinement in a plug-and-play manner. Extensive experiments demonstrate that our framework achieves visual quality and text accuracy comparable to or superior to those of training-based counterparts, generalizing well across diverse scenes and languages. This study advances scene text editing toward a more efficient, generalizable, and training-free paradigm. Code is available at this https URL

顶级标签: computer vision aigc model training
详细标签: scene text editing training-free attention guidance flow manifold image manipulation 或 搜索:

迈向免训练的场景文本编辑 / Towards Training-Free Scene Text Editing


1️⃣ 一句话总结

这篇论文提出了一种名为TextFlow的免训练框架,它通过结合注意力增强和流形引导技术,无需额外训练就能在自然图像中灵活、高保真地修改文字,同时保持视觉真实感和风格一致性。

源自 arXiv: 2603.24571