Detecting Basic Values in A Noisy Russian Social Media Text Data: A Multi-Stage Classification Framework

ArXi:2603.18822v1 Announce Type: new This study presents a multi-stage classification framework for detecting human values in noisy Russian language social media, validated on a random sample of 7.5M public text posts. Drawing on Schwartz's theory of basic human values, we design a multi-stage pipeline that includes spam and nonpersonal content filtering, targeted selection of value relevant and politically relevant posts, LLM based annotation, and multi-label classification.