AI RESEARCH

GKnow: Measuring the Entanglement of Gender Bias and Factual Gender

arXiv CS.CL

ArXi:2605.12299v1 Announce Type: new Recent works have analyzed the impact of individual components of neural networks on gendered predictions, often with a focus on mitigating gender bias. However, mechanistic interpretations of gender tend to (i) focus on a very specific gender-related task, such as gendered pronoun prediction, or (ii) fail to distinguish between the production of factually gendered outputs (the correct assumption of gender given a word that carries gender as a semantic property) and gender biased outputs (based on a stereotype