Tilde Research Introduces Aurora: A Leverage-Aware Optimizer That Fixes a Hidden Neuron Death Problem in Muon
Researchers at Tilde Research have launched Aurora, a new optimizer for coaching neural networks that addresses a structural flaw in the widely-used Muon optimizer. The flaw quietly kills off a important fraction of MLP neurons throughout coaching and retains them completely useless. Aurora comes with a 1.1B parameter pretraining experiment, a new state-of-the-art end result…
