FlowExtract:从维护流程图中提取程序性知识 / FlowExtract: Procedural Knowledge Extraction from Maintenance Flowcharts
1️⃣ 一句话总结
这篇论文提出了一个名为FlowExtract的系统,它能够自动从工业维护流程图中提取出结构化的、可查询的程序步骤关系图,解决了现有视觉模型难以理解此类图表连接关系的难题。
Maintenance procedures in manufacturing facilities are often documented as flowcharts in static PDFs or scanned images. They encode procedural knowledge essential for asset lifecycle management, yet inaccessible to modern operator support systems. Vision-language models, the dominant paradigm for image understanding, struggle to reconstruct connection topology from such diagrams. We present FlowExtract, a pipeline for extracting directed graphs from ISO 5807-standardized flowcharts. The system separates element detection from connectivity reconstruction, using YOLOv8 and EasyOCR for standard domain-aligned node detection and text extraction, combined with a novel edge detection method that analyzes arrowhead orientations and traces connecting lines backward to source nodes. Evaluated on industrial troubleshooting guides, FlowExtract achieves very high node detection and substantially outperforms vision-language model baselines on edge extraction, offering organizations a practical path toward queryable procedural knowledge representations. The implementation is available athttps://github.com/guille-gil/FlowExtract.
FlowExtract:从维护流程图中提取程序性知识 / FlowExtract: Procedural Knowledge Extraction from Maintenance Flowcharts
这篇论文提出了一个名为FlowExtract的系统,它能够自动从工业维护流程图中提取出结构化的、可查询的程序步骤关系图,解决了现有视觉模型难以理解此类图表连接关系的难题。
源自 arXiv: 2604.06770