AI RESEARCH

From Image to Music Language: A Two-Stage Structure Decoding Approach for Complex Polyphonic OMR

arXiv CS.CV

ArXi:2604.20522v1 Announce Type: cross We propose a new approach for the second stage of a practical two-stage Optical Music Recognition (OMR) pipeline. Given symbol and event candidates from the visual pipeline, we decode them into an editable, verifiable, and exportable score structure. We focus on complex polyphonic staff notation, especially piano scores, where voice separation and intra-measure timing are the main bottlenecks.