ag14774 / diffdistLinks
☆62Updated 5 years ago
Alternatives and similar repositories for diffdist
Users that are interested in diffdist are comparing it to the libraries listed below
Sorting:
- "Layer-wise Adaptive Rate Scaling" in PyTorch☆87Updated 5 years ago
- An implementation of shampoo☆78Updated 7 years ago
- On Network Design Spaces for Visual Recognition☆96Updated 5 years ago
- Code for paper "SWALP: Stochastic Weight Averaging forLow-Precision Training".☆62Updated 6 years ago
- A Re-implementation of Fixed-update Initialization