AI RESEARCH

SemEval-2026 Task 7: Everyday Knowledge Across Diverse Languages and Cultures

arXiv CS.CL

ArXi:2605.02601v1 Announce Type: new We present our shared task on evaluating the adaptability of LLMs and NLP systems across multiple languages and cultures. The task data consist of an extended version of our manually constructed BLEnD benchmark (Myung 2024), covering than 30 language-culture pairs, predominantly representing low-resource languages spoken across multiple continents. As the task is designed strictly for evaluation, participants were not permitted to use the data for