Google DeepMind Releases Gemma 4 12B: An Encoder-Free Multimodal Model with Native audio that runs on a 16 GB laptop
Google DeepMind simply launched (*16*), a dense multimodal mannequin that strips out conventional encoders completely. Vision and audio circulate straight into the LLM spine. The result’s a mannequin that runs agentic workflows on a shopper laptop with 16 GB of RAM. It ships beneath the Apache 2.0 license. Model Overview & Access Gemma 4 12B…
