AI RESEARCH
100,000+ Movie Reviews from Kazakhstan: Russian, Kazakh, and Code-Switched Texts
arXiv CS.CL
•
ArXi:2605.08600v2 Announce Type: replace We present a new publicly available corpus of 100,502 movie reviews from Kazakhstan collected from kino.kz, spanning 2001-2025 and covering 4,943 unique titles. The dataset is multilingual, consisting mainly of Russian reviews alongside Kazakh and code-switched texts. Reviews are manually annotated for language and sentiment polarity, and 11,309 reviews additionally contain explicit user-provided ratings.