Zyphra Releases ZAYA1-8B-Diffusion-Preview: The First MoE Diffusion Model Converted From an Autoregressive LLM With Up to 7.7x Speedup
Zyphra, the San Francisco-based AI lab behind the ZAYA1 mannequin household, launched ZAYA1-8B-Diffusion-Preview — a preview of its early work in diffusion-language fashions. The launch demonstrates that an present autoregressive language mannequin will be transformed right into a discrete diffusion mannequin with no systematic lack of analysis efficiency, whereas delivering substantial inference speedups on AMD…
