Real-Time Music Following in Score Sheet Images via Multi-Resolution Prediction

Henkel, Florian and Widmer, Gerhard (2021) Real-Time Music Following in Score Sheet Images via Multi-Resolution Prediction. Frontiers in Computer Science, 3. ISSN 2624-9898

[thumbnail of pubmed-zip/versions/1/package-entries/fcomp-03-718340/fcomp-03-718340.pdf] Text
pubmed-zip/versions/1/package-entries/fcomp-03-718340/fcomp-03-718340.pdf - Published Version

Download (2MB)

Abstract

The task of real-time alignment between a music performance and the corresponding score (sheet music), also known as score following, poses a challenging multi-modal machine learning problem. Training a system that can solve this task robustly with live audio and real sheet music (i.e., scans or score images) requires precise ground truth alignments between audio and note-coordinate positions in the score sheet images. However, these kinds of annotations are difficult and costly to obtain, which is why research in this area mainly utilizes synthetic audio and sheet images to train and evaluate score following systems. In this work, we propose a method that does not solely rely on note alignments but is additionally capable of leveraging data with annotations of lower granularity, such as bar or score system alignments. This allows us to use a large collection of real-world piano performance recordings coarsely aligned to scanned score sheet images and, as a consequence, improve over current state-of-the-art approaches.

Item Type: Article
Subjects: Opene Prints > Computer Science
Depositing User: Managing Editor
Date Deposited: 10 Feb 2023 07:14
Last Modified: 15 Sep 2023 04:47
URI: http://geographical.go2journals.com/id/eprint/210

Actions (login required)

View Item
View Item