michaelrzhang / lookaheadView external linksLinks
Implementation for the Lookahead Optimizer.
☆244Apr 29, 2022Updated 3 years ago
Alternatives and similar repositories for lookahead
Users that are interested in lookahead are comparing it to the libraries listed below
Sorting:
- Over9000 optimizer☆424Nov 22, 2022Updated 3 years ago
- lookahead optimizer (Lookahead Optimizer: k steps forward, 1 step back) for pytorch☆338Aug 9, 2019Updated 6 years ago
- Tensorflow Optimizers☆11Sep 1, 2019Updated 6 years ago
- On the Variance of the Adaptive Learning Rate and Beyond☆2,549Jul 31, 2021Updated 4 years ago
- Successfully training approximations to full-rank matrices for efficiency in deep learning.☆17Jan 5, 2021Updated 5 years ago
- Unofficial PyTorch Implementation of EvoNorm☆123Aug 29, 2021Updated 4 years ago
- Hypergradient descent☆147May 31, 2024Updated last year
- Implementation of Methods Proposed in Preventing Gradient Attenuation in Lipschitz Constrained Convolutional Networks (NeurIPS 2019)☆36Jun 25, 2020Updated 5 years ago
- torch-optimizer -- collection of optimizers for Pytorch☆3,161Mar 22, 2024Updated last year
- Ranger - a synergistic optimizer using RAdam (Rectified Adam), Gradient Centralization and LookAhead in one codebase☆1,208Dec 22, 2023Updated 2 years ago
- pytorch implement of Lookahead Optimizer☆195Jun 20, 2022Updated 3 years ago
- Mode Connectivity and Fast Geometric Ensembles in PyTorch☆283Oct 24, 2022Updated 3 years ago
- AdamP: Slowing Down the Slowdown for Momentum Optimizers on Scale-invariant Weights (ICLR 2021)☆415Jan 13, 2021Updated 5 years ago
- Repository for NeurIPS 2020 Spotlight "AdaBelief Optimizer: Adapting stepsizes by the belief in observed gradients"☆1,068Aug 9, 2024Updated last year
- Configure Python functions explicitly and safely☆128Nov 18, 2024Updated last year
- Official adversarial mixup resynthesis repository☆35Feb 14, 2020Updated 6 years ago
- [NeurIPS'20] GradAug: A New Regularization Method for Deep Neural Networks☆94Dec 16, 2020Updated 5 years ago
- Code release for paper "Random Search and Reproducibility for NAS"☆167Jul 1, 2019Updated 6 years ago
- ☆14Sep 4, 2020Updated 5 years ago
- Source code accompanying our CVPR 2019 paper: "NetTailor: Tuning the architecture, not just the weights."☆54Aug 14, 2021Updated 4 years ago
- higher is a pytorch library allowing users to obtain higher order gradients over losses spanning training loops rather than individual tr…☆1,627Mar 25, 2022Updated 3 years ago
- Pytorch optimizers implementing Hilbert Constrained Gradient Descent☆19May 9, 2019Updated 6 years ago
- An optimizer that trains as fast as Adam and as good as SGD.☆2,913Jul 23, 2023Updated 2 years ago
- Twin Auxiliary Classifiers GAN (NeurIPS 2019) [Spotlight]☆15Sep 19, 2019Updated 6 years ago
- Code for "Self-Distillation as Instance-Specific Label Smoothing"☆16Oct 22, 2020Updated 5 years ago
- ☆31Mar 29, 2018Updated 7 years ago
- [ICML 2021] The official PyTorch Implementations of Positive-Negative Momentum Optimizers.☆27Aug 30, 2022Updated 3 years ago
- batchboost is a variation on MixUp that instead of mixing just two images, mixes many images together.☆44Jan 26, 2020Updated 6 years ago
- A LARS implementation in PyTorch☆353Feb 21, 2020Updated 5 years ago
- Code for Paper ''Dual Student: Breaking the Limits of the Teacher in Semi-Supervised Learning'' [ICCV 2019]☆119Aug 20, 2020Updated 5 years ago
- An implementation of (Induced) Set Attention Block, from the Set Transformers paper☆67Jan 10, 2023Updated 3 years ago
- Codebase for Image Classification Research, written in PyTorch.☆2,169Mar 20, 2024Updated last year
- PyTorch layer-by-layer model profiler☆607May 23, 2021Updated 4 years ago
- Official Implementation of 'Fast AutoAugment' in PyTorch.☆1,614Jun 16, 2021Updated 4 years ago
- Stochastic Weight Averaging in PyTorch☆977Aug 1, 2021Updated 4 years ago
- Code for ICLR2018 paper: Improving GAN Training via Binarized Representation Entropy (BRE) Regularization - Y. Cao · W Ding · Y.C. Lui · …☆20Jun 12, 2018Updated 7 years ago
- TensorFlow implementation of "noisy K-FAC" and "noisy EK-FAC".☆60Jan 12, 2019Updated 7 years ago
- Implementations of ideas from recent papers☆392Dec 22, 2020Updated 5 years ago
- ☆38Nov 13, 2020Updated 5 years ago