AI RESEARCH

MARS: Sound Generation via Multi-Channel Autoregression on Spectrograms

arXiv CS.AI

ArXi:2509.26007v2 Announce Type: replace-cross Research on audio generation has progressively developed along both waveform-based and spectrogram-based directions, giving rise to diverse strategies for representing and generating audio. At the same time, advances in image synthesis have shown that autoregression across scales, rather than tokens, improves coherence and detail. Building on these ideas, we