Alibaba Qwen Team Releases Qwen3.5 Omni: A Native Multimodal Model for Text, Audio, Video, and Realtime Interaction
The panorama of multimodal massive language fashions (MLLMs) has shifted from experimental ‘wrappers’—the place separate imaginative and prescient or audio encoders are stitched onto a text-based spine—to native, end-to-end ‘omnimodal’ architectures. Alibaba Qwen staff newest launch, Qwen3.5-Omni, represents a big milestone on this evolution. Designed as a direct competitor to flagship fashions like Gemini 3.1…
