AI RESEARCH

Model Organisms Are Leaky: Perplexity Differencing Often Reveals Finetuning Objectives

arXiv CS.CL

ArXi:2605.00994v1 Announce Type: new Finetuning can significantly modify the behavior of large language models, including