Linear Layer
Subscribe
Sign in
Understanding neural network optimizers…
Julian Lehrer
Jul 1
1
1
how and why we build algorithms like AdamW and Muon
Read →
Comments
This site requires JavaScript to run correctly. Please
turn on JavaScript
or unblock scripts
Understanding neural network optimizers…
how and why we build algorithms like AdamW and Muon