AI RESEARCH
To Whom Do Language Models Align? Measuring Principal Hierarchies Under High-Stakes Competing Demands
arXiv CS.AI
•
ArXi:2605.12120v1 Announce Type: new Language models deployed in high-stakes professional settings face conflicting demands from users, institutional authorities, and professional norms. How models act when these demands conflict reveals a principal hierarchy -- an implicit ordering over competing stakeholders that determines, for instance, whether a medical AI receiving a cost-reduction directive from a hospital administrator complies at the expense of evidence-based care, or refuses because professional standards require it.