AI RESEARCH
MERLIN: Building Low-SNR Robust Multimodal LLMs for Electromagnetic Signals
arXiv CS.CV
•
ArXi:2603.08174v1 Announce Type: new The paradigm of Multimodal Large Language Models (MLLMs) offers a promising blueprint for advancing the electromagnetic (EM) domain. However, prevailing approaches often deviate from the native MLLM paradigm, instead using task-specific or pipelined architectures that lead to fundamental limitations in model performance and generalization. Fully realizing the MLLM potential in EM domain requires overcoming three main challenges: (1) Data. The scarcity of high-quality datasets with paired EM signals and descriptive text annotations used for MLLMs pre-