Home > Published Issues > 2024 > Volume 12, No. 1, 2024 >
JOIG 2024 Vol.12(1): 32-39
doi: 10.18178/joig.12.1.32-39

Musical Note Position and Duration Recognition Model in Optical Music Recognition Using Convolutional Neural Network

Douglas Rakasiwi Nugroho * and Amalia Zahra
Department of Computer Science, Bina Nusantara University, Jakarta, Indonesia
Email: douglas.nugroho@binus.ac.id (D.R.N.); amalia.zahra@binus.edu (A.Z.)
*Corresponding author

Manuscript received June 2, 2023; revised June 27, 2023; accepted October 3, 2023; published February 6, 2024.

Abstract—This study aims to solve Optical Music Recognition (OMR) problems using a non-End-to-End (non-E2E) approach. Therefore, separate models for Position Recognition (PR) and Duration Recognition (DR) are constructed, with both employing Convolutional Neural Networks (CNN). In terms of constructing a non-E2E architecture to solve OMR problems, this study obtains superior evaluation results compared to previous research, with the PR and DR models achieving accuracies of 97.88% and 99.23%, respectively. In addition, this study employs template matching in conjunction with several supplementary tasks to identify the positions of musical notes and generate the corresponding note sequences in the intended reading format. Our Optical Music Recognition (OMR) system can accomplish comparable results to the E2E architecture by utilizing these techniques.

Keywords—non-end-to-end optical music recognition, convolutional neural network, position recognition model, duration recognition model, template matching

Cite: Douglas Rakasiwi Nugroho and Amalia Zahra, "Musical Note Position and Duration Recognition Model in Optical Music Recognition Using Convolutional Neural Network," Journal of Image and Graphics, Vol. 12, No. 1, pp. 32-39, 2024.

Copyright © 2024 by the authors. This is an open access article distributed under the Creative Commons Attribution License (CC BY-NC-ND 4.0), which permits use, distribution and reproduction in any medium, provided that the article is properly cited, the use is non-commercial and no modifications or adaptations are made.