AI RESEARCH
PianoCoRe: Combined and Refined Piano MIDI Dataset
arXiv CS.LG
•
ArXi:2605.06627v1 Announce Type: cross Symbolic music datasets with matched scores and performances are essential for many music information retrieval (MIR) tasks. Yet, existing resources often cover a narrow range of composers, lack performance variety, omit note-level alignments, or use inconsistent naming formats. This work presents PianoCoRe, a large-scale piano MIDI dataset that unifies and refines major open-source piano corpora. The dataset contains 250,046 performances of 5,625 pieces.