菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-02-03
📄 Abstract - Symbol-Aware Reasoning with Masked Discrete Diffusion for Handwritten Mathematical Expression Recognition

Handwritten Mathematical Expression Recognition (HMER) requires reasoning over diverse symbols and 2D structural layouts, yet autoregressive models struggle with exposure bias and syntactic inconsistency. We present a discrete diffusion framework that reformulates HMER as iterative symbolic refinement instead of sequential generation. Through multi-step remasking, the proposal progressively refines both symbols and structural relations, removing causal dependencies and improving structural consistency. A symbol-aware tokenization and Random-Masking Mutual Learning further enhance syntactic alignment and robustness to handwriting diversity. On the MathWriting benchmark, the proposal achieves 5.56\% CER and 60.42\% EM, outperforming strong Transformer and commercial baselines. Consistent gains on CROHME 2014--2023 demonstrate that discrete diffusion provides a new paradigm for structure-aware visual recognition beyond generative modeling.

顶级标签: computer vision natural language processing multi-modal
详细标签: handwritten recognition mathematical expressions discrete diffusion symbolic reasoning visual recognition 或 搜索:

基于符号感知推理与掩码离散扩散的手写数学表达式识别 / Symbol-Aware Reasoning with Masked Discrete Diffusion for Handwritten Mathematical Expression Recognition


1️⃣ 一句话总结

这篇论文提出了一种新的非自回归方法,通过离散扩散模型逐步优化符号和结构,有效解决了手写数学表达式识别中因顺序生成导致的错误累积和结构不一致问题,显著提升了识别准确率。

源自 arXiv: 2602.03370