AI RESEARCH

Truth or Tribe: How In-group Favoritism Prioritize Facts in Persona Agents

arXiv CS.AI

ArXi:2605.01329v1 Announce Type: new In-group favoritism refers to the phenomena of favoring members of one's in-group over out-group members and is widely observed in numerous social cooperative behaviors. Recently, in-group favoritism biases have also been identified in generative language models. However, whether the in-group favoritism exists when persona agents are faced with contradicting information (e.g., misinformation), and how to mitigate the adverse effects of in-group favoritism biases in persona agents have been understudied.