Qwen3.5 Omni - Qwen’s latest generation of fully omnimodal LLM

r/singularity
Generative AI Open Source AI

Qwen3.5-Omni is Qwen’s latest generation of fully omnimodal LLM, ing the understanding of text, images, audio, and audio-visual content. Both the Thinker and Talker in Qwen3.5-Omni adopt the Hybrid-Attention MoE. Qwen3.5-Omni series includes Instruct versions in three sizes: Plus, Flash, and Light, with for 256k long-context input. The model can process than 10 hours of audio input and over 400 seconds of 720P audio-visual input at 1