XuezheMax / apolloLinks
Apollo: An Adaptive Parameter-wise Diagonal Quasi-Newton Method for Nonconvex Stochastic Optimization
☆182Updated 3 years ago
Alternatives and similar repositories for apollo
Users that are interested in apollo are comparing it to the libraries listed below
Sorting:
- pytorch implement of Lookahead Optimizer☆195Updated 3 years ago
- [ICML 2021 Oral] We show pure attention suffers rank collapse, and how different mechanisms combat it.☆166Updated 4 years ago
- Official PyTorch Repo for "ReZero is All You Need: Fast Convergence at Large Depth"