AI RESEARCH
A survey of diversity quantification in natural language processing: The why, what, where and how
arXiv CS.CL
•
ArXi:2507.20858v3 Announce Type: replace The concept of diversity has received increasing attention in natural language processing (NLP) in recent years. It became an advocated property of datasets and systems, and many measures are used to quantify it. However, it is often addressed in an ad hoc manner, with few explicit justifications of its endorsement and many cross-paper inconsistencies. There have been very few attempts to take a step back and understand the conceptualization of diversity in