AI RESEARCH

A large-scale heterogeneous 3D magnetic resonance brain imaging dataset for self-supervised learning

arXiv CS.CV

ArXi:2506.14432v3 Announce Type: replace-cross We present FOMO260K, a large-scale, heterogeneous dataset of 260,927 brain Magnetic Resonance Imaging (MRI) scans from 77,589 MRI sessions and 55,378 subjects, aggregated from 910 publicly available sources. The dataset includes both clinical- and research-grade images, multiple MRI sequences, and a wide range of anatomical and pathological variability, including scans with large brain anomalies. Minimal preprocessing was applied to preserve the original image characteristics while reducing entry barriers for new users.