StepFun AI Releases Step-Audio 2 Mini: An Open-Source 8B Speech-to-Speech AI Model that Surpasses GPT-4o-Audio
The StepFun AI group has launched Step-Audio 2 Mini, an 8B parameter speech-to-speech giant audio language mannequin (LALM) that delivers expressive, grounded, and real-time audio interplay. Launched beneath the Apache 2.0 license, this open-source mannequin achieves state-of-the-art efficiency throughout speech recognition, audio understanding, and speech dialog benchmarks—surpassing business techniques similar to GPT-4o-Audio. https://huggingface.co/stepfun-ai/Step-Audio-2-mini Key Options…
