Evaluating Evidence Grounding Under User Pressure in Instruction-Tuned Language Models

ArXi:2603.20162v1 Announce Type: new In contested domains, instruction-tuned language models must balance user-alignment pressures against faithfulness to the in-context evidence. To evaluate this tension, we