菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-06-21
📄 Abstract - Automated sign detection across the Electronic Babylonian Library: A large-scale dataset and end-to-end cuneiform OCR pipeline

Learning to read cuneiform tablets is an extremely demanding task; consequently, of the roughly half million excavated tablets, only a small fraction has been analysed by Assyriologists. Computer vision offers a promising avenue for decipherment but requires large, densely annotated datasets. To address this limitation, the largest annotated cuneiform sign dataset to date is used, and a Deformable Detection Transformer (DETR)-based object detection model is evaluated under two class granularities of 173 and 106 classes. The proposed system integrates automatic tablet-side extraction, heuristic line grouping, and n-gram-based textual similarity evaluation to bridge visual sign detection and textual structure, and achieves consistent improvements of up to 28-37% over prior work on COCO-style detection metrics. At inference, the method is applied to 87,668 tablet fragments from the Electronic Babylonian Library (eBL) corpus, producing nearly 2.9 million sign detections. Although the approach operates without linguistic priors and remains sensitive to tablet damage and layout variability, it provides a scalable and interpretable foundation for corpus-wide cuneiform analysis and supports future integration with multimodal and linguistic modelling frameworks.

顶级标签: computer vision data multi-modal
详细标签: object detection cuneiform analysis benchmark ocrcuneiform transformer 或 搜索:

电子巴比伦图书馆中的自动楔形文字符号检测:大规模数据集与端到端OCR流水线 / Automated sign detection across the Electronic Babylonian Library: A large-scale dataset and end-to-end cuneiform OCR pipeline


1️⃣ 一句话总结

本文构建了目前规模最大的楔形文字符号标注数据集,并开发了一套基于变形检测Transformer(DETR)的自动识别流水线,能从数十万块陶板碎片中快速、准确地检测出近290万个楔形文字符号,为亚述学研究和跨学科文献分析提供了高效工具。

源自 arXiv: 2606.22608