Hugging Face Releases Smol2Operator: A Fully Open-Source Pipeline to Train a 2.2B VLM into an Agentic GUI Coder
Hugging Face (HF) has launched Smol2Operator, a reproducible, end-to-end recipe that turns a small vision-language mannequin (VLM) with no prior UI grounding into a GUI-operating, tool-using agent. The launch covers knowledge transformation utilities, coaching scripts, remodeled datasets, and the ensuing 2.2B-parameter mannequin checkpoint—positioned as a full blueprint for constructing GUI brokers from scratch somewhat than…
