Implementation for the Lookahead Optimizer.
☆246Apr 29, 2022Updated 3 years ago
Alternatives and similar repositories for lookahead
Users that are interested in lookahead are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Over9000 optimizer☆424Nov 22, 2022Updated 3 years ago
- lookahead optimizer (Lookahead Optimizer: k steps forward, 1 step back) for pytorch☆338Aug 9, 2019Updated 6 years ago
- Tensorflow Optimizers☆11Sep 1, 2019Updated 6 years ago
- On the Variance of the Adaptive Learning Rate and Beyond☆2,550Jul 31, 2021Updated 4 years ago
- torch-optimizer -- collection of optimizers for Pytorch☆3,170Mar 22, 2024Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- pytorch implement of Lookahead Optimizer☆195Jun 20, 2022Updated 3 years ago
- Unofficial PyTorch Implementation of EvoNorm☆123Aug 29, 2021Updated 4 years ago
- Successfully training approximations to full-rank matrices for efficiency in deep learning.☆16Jan 5, 2021Updated 5 years ago
- AdamP: Slowing Down the Slowdown for Momentum Optimizers on Scale-invariant Weights (ICLR 2021)☆415Jan 13, 2021Updated 5 years ago
- Implementation of Methods Proposed in Preventing Gradient Attenuation in Lipschitz Constrained Convolutional Networks (NeurIPS 2019)☆36Jun 25, 2020Updated 5 years ago
- Ranger - a synergistic optimizer using RAdam (Rectified Adam), Gradient Centralization and LookAhead in one codebase☆1,206Dec 22, 2023Updated 2 years ago
- Mode Connectivity and Fast Geometric Ensembles in PyTorch☆285Oct 24, 2022Updated 3 years ago
- Lookahead optimizer ("Lookahead Optimizer: k steps forward, 1 step back") for tensorflow☆25Sep 3, 2019Updated 6 years ago
- ☆13Jul 25, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Repository for NeurIPS 2020 Spotlight "AdaBelief Optimizer: Adapting stepsizes by the belief in observed gradients"☆1,071Aug 9, 2024Updated last year
- This is the code associated with the paper A Variational Inequality Perspective for Generative Adversarial Networks.☆43May 1, 2019Updated 6 years ago
- Configure Python functions explicitly and safely☆129Nov 18, 2024Updated last year
- Code release for paper "Random Search and Reproducibility for NAS"☆167Jul 1, 2019Updated 6 years ago
- [Oral; Neurips OPT2024 ] μLO: Compute-Efficient Meta-Generalization of Learned Optimizers☆16Feb 12, 2026Updated last month
- ☆14Sep 4, 2020Updated 5 years ago
- Code for "Self-Distillation as Instance-Specific Label Smoothing"☆15Oct 22, 2020Updated 5 years ago
- Official adversarial mixup resynthesis repository☆35Feb 14, 2020Updated 6 years ago
- A LARS implementation in PyTorch☆353Feb 21, 2020Updated 6 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- higher is a pytorch library allowing users to obtain higher order gradients over losses spanning training loops rather than individual tr…☆1,627Mar 25, 2022Updated 4 years ago
- An optimizer that trains as fast as Adam and as good as SGD.☆2,907Jul 23, 2023Updated 2 years ago
- Code for our ICLR'2021 paper "DrNAS: Dirichlet Neural Architecture Search"☆43Apr 12, 2021Updated 4 years ago
- Codebase for Image Classification Research, written in PyTorch.☆2,167Mar 20, 2024Updated 2 years ago
- Source code accompanying our CVPR 2019 paper: "NetTailor: Tuning the architecture, not just the weights."☆53Aug 14, 2021Updated 4 years ago
- Implementation of the Functional Neural Process models☆42Jul 17, 2020Updated 5 years ago
- Pytorch optimizers implementing Hilbert Constrained Gradient Descent☆19May 9, 2019Updated 6 years ago
- Implementations of ideas from recent papers☆391Dec 22, 2020Updated 5 years ago
- 16th Place Solution for Google QUEST QA Labeling on Kaggle☆10Feb 28, 2020Updated 6 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Self-Similarity Priors: Neural Collages as Differentiable Fractal Representations☆29Nov 26, 2022Updated 3 years ago
- ☆38Nov 13, 2020Updated 5 years ago
- batchboost is a variation on MixUp that instead of mixing just two images, mixes many images together.☆44Jan 26, 2020Updated 6 years ago
- [NeurIPS'20] GradAug: A New Regularization Method for Deep Neural Networks☆93Dec 16, 2020Updated 5 years ago
- Lookahead mechanism for optimizers in Keras.☆50Jun 24, 2021Updated 4 years ago
- Stochastic Weight Averaging in PyTorch☆979Aug 1, 2021Updated 4 years ago
- Code for Paper ''Dual Student: Breaking the Limits of the Teacher in Semi-Supervised Learning'' [ICCV 2019]☆119Aug 20, 2020Updated 5 years ago