AI RESEARCH

Demystifying Mergeability: Interpretable Properties to Predict Model Merging Success

arXiv CS.LG

ArXi:2601.22285v4 Announce Type: replace Model merging combines knowledge from separately fine-tuned models, yet success factors remain poorly understood. While recent work treats mergeability as an intrinsic property, we show with an architecture-agnostic framework that it fundamentally depends on both the merging method and the partner tasks. Using linear optimization over a set of interpretable pairwise metrics (e.g., gradient L2 distance), we uncover properties correlating with post-merge performance across four merging methods.