AI RESEARCH

Model Merging: Foundations and Algorithms

arXiv CS.LG

ArXi:2605.01580v1 Announce Type: new Modern deep learning usually treats models as separate artifacts: trained independently, specialized for particular purposes, and replaced when improved versions appear. This thesis studies model merging as an alternative paradigm: combining independently trained neural networks directly in weight space, with little or no optimization and without requiring access to the original