Alibaba’s Qwen AI Releases Compact Dense Qwen3-VL 4B/8B (Instruct & Thinking) With FP8 Checkpoints
Do you really want a large VLM when dense Qwen3-VL 4B/8B (Instruct/Thinking) with FP8 runs in low VRAM but retains 256K→1M context and the total functionality floor? Alibaba’s Qwen workforce has expanded its multimodal lineup with dense Qwen3-VL models at 4B and 8B scales, every delivery in two job profiles—Instruct and Thinking—plus FP8-quantized checkpoints for…