Rethinking Muon Beyond Pretraining: Spectral Failures and High-Pass Remedies for VLA and RLVR
by [unverified] · free · Last verified 2026-06-21T03:07:54.160Z
Muon is a matrix-aware optimizer that leverages Newton-Schulz (NS) iterations to enforce spectral gradient orthogonalization by driving all singular values of the momentum matrix toward 1. While this uniform spectral whitening enhances exploration and outperforms AdamW in LLM pretraining, we show...
https://huggingface.co/papers/2605.19282 ↗F
F—Critical
Adoption: FQuality: FFreshness: A+Citations: FEngagement: F
Specifications
- Pricing
- free
- Capabilities
- Integrations
- Use Cases
- API Available
- No
- Tags
- auto-discovered
- Added
- 2026-06-21T03:07:54.160Z
- Completeness
- 0%
Index Score
0Adoption
0
Quality
0
Freshness
100
Citations
0
Engagement
0