Iterative Finetuning is Mostly Idempotent

ArXi:2605.01130v1 Announce Type: new If a model has some behavioral tendency, such as sycophancy or misalignment, and it is trained on its own outputs, will the tendency be amplified in the next generation of models? We study this question by