AI RESEARCH
ES-Merging: Biological MLLM Merging via Embedding Space Signals
arXiv CS.AI
•
ArXi:2603.14405v1 Announce Type: cross Biological multimodal large language models (MLLMs) have emerged as powerful foundation models for scientific discovery. However, existing models are specialized to a single modality, limiting their ability to solve inherently cross-modal scientific problems. While model merging is an efficient method to combine the different modalities into a unified MLLM, existing methods rely on input-agnostic parameter space heuristics that fail to faithfully capture modality specialization.