Researcher Keller Jordan has managed to join OpenAI based on a single blog post about the Muon optimizer, which may be being used for GPT-5 training; Muon is an optimizer for the hidden layers of neural networks that uses Newton-Schultz iteration to achieve update matrix orthogonalization, and is faster to train than AdamW; Keller has criticized the research literature on optimizers for being full of methods that have failed to be adopted methods and advocates validating the effectiveness of new methods in competitive training tasks.
❯
Search
Scan to open current page
Top
Checking in, please wait
Click for today's check-in bonus!
You have earned {{mission.data.mission.credit}} points today!
My Coupons
-
¥CouponsLimitation of useExpired and UnavailableLimitation of use
before
Limitation of usePermanently validCoupon ID:×Available for the following products: Available for the following products categories: Unrestricted use:Available for all products and product types
No coupons available!
Unverify
Daily tasks completed:
