菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-05-18
📄 Abstract - A Dataset for the Recognition of Historical and Handwritten Music Scores in Western Notation

A large amount of musical heritage has been digitised by memory institutions: libraries, museums, and archives. Nevertheless, the field of Optical Music Recognition (OMR) has struggled with making this music machine-readable, despite advances in deep learning, mostly because no datasets for training systems in realistic conditions were available. The MusiCorpus dataset aims to remedy this situation by providing 1,309 pages of historical sheet music, primarily handwritten, with MusicXML transcriptions and symbol annotations. It is the largest dataset of handwritten music to date and the first dataset containing a realistic and representative sample of musical document collections from memory institutions, suitable for training and evaluating both end-to-end and object detection-based OMR systems and comparing their performance.

顶级标签: computer vision data machine learning
详细标签: optical music recognition historical manuscripts handwritten music dataset symbol annotations 或 搜索:

用于识别西方记谱法历史手写乐谱的数据集 / A Dataset for the Recognition of Historical and Handwritten Music Scores in Western Notation


1️⃣ 一句话总结

该论文介绍了MusiCorpus数据集,包含1309页历史手写乐谱及对应的机器可读标注,是目前最大的手写乐谱数据集,旨在解决光学音乐识别领域因缺乏真实场景训练数据而难以自动识别馆藏乐谱的难题。

源自 arXiv: 2605.18436