alphadl / lookahead.pytorchLinks

lookahead optimizer (Lookahead Optimizer: k steps forward, 1 step back) for pytorch

☆337

Alternatives and similar repositories for lookahead.pytorch

Users that are interested in lookahead.pytorch are comparing it to the libraries listed below

Sorting:

lonePatient / lookahead_pytorch
pytorch implement of Lookahead Optimizer
☆192Updated 3 years ago
Yonghongwei / Gradient-Centralization
A New Optimization Technique for Deep Neural Networks
☆537Updated 3 years ago
majumderb / rezero
Official PyTorch Repo for "ReZero is All You Need: Fast Convergence at Large Depth"
☆410Updated last year
timgaripov / swa
Stochastic Weight Averaging in PyTorch
☆974Updated 4 years ago
PistonY / torch-toolbox
🛠 Toolbox to extend PyTorch functionalities
☆421Updated last year
mpyrozhok / adamwr
Implements https://arxiv.org/abs/1711.05101 AdamW optimizer, cosine learning rate scheduler and "Cyclical Learning Rates for Training Neu…
☆150Updated 6 years ago
zsef123 / EfficientNets-PyTorch
A PyTorch implementation of " EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks."
☆313Updated 5 years ago
egg-west / AdamW-pytorch
Implementation and experiments for AdamW on Pytorch
☆94Updated 5 years ago
nmhkahn / torchsummaryX
torchsummaryX: Improved visualization tool of torchsummary
☆303Updated 3 years ago
hongyi-zhang / mixup
Implementation of the mixup training method
☆466Updated 7 years ago
karanchahal / distiller
A large scale study of Knowledge Distillation.
☆220Updated 5 years ago
pytorch / contrib
Implementations of ideas from recent papers
☆392Updated 4 years ago
4uiiurz1 / pytorch-auto-augment
PyTorch implementation of AutoAugment.
☆159Updated 5 years ago
mgrankin / over9000
Over9000 optimizer
☆426Updated 2 years ago
NVIDIA / runx
Deep Learning Experiment Management
☆640Updated 2 years ago
miguelvr / dropblock
Implementation of DropBlock: A regularization method for convolutional networks in PyTorch.
☆596Updated 5 years ago
vandit15 / Class-balanced-loss-pytorch
Pytorch implementation of the paper "Class-Balanced Loss Based on Effective Number of Samples"
☆800Updated last year
joe-siyuan-qiao / WeightStandardization
Standardizing weights to accelerate micro-batch training
☆549Updated 3 years ago
lessw2020 / Ranger-Deep-Learning-Optimizer
Ranger - a synergistic optimizer using RAdam (Rectified Adam), Gradient Centralization and LookAhead in one codebase
☆1,202Updated last year
sseung0703 / KD_methods_with_TF
Knowledge distillation methods implemented with Tensorflow (now there are 11 (+1) methods, and will be added more.)
☆265Updated 5 years ago
zasdfgbnm / TorchSnooper
Debug PyTorch code using PySnooper
☆797Updated 4 years ago
ildoonet / pytorch-gradual-warmup-lr
Gradually-Warmup Learning Rate Scheduler for PyTorch
☆991Updated 9 months ago
YirongMao / softmax_variants
PyTorch code for softmax variants: center loss, cosface loss, large-margin gaussian mixture, COCOLoss, ring loss
☆255Updated 7 years ago
cybertronai / pytorch-lamb
Implementation of https://arxiv.org/abs/1904.00962
☆376Updated 4 years ago
PhilJd / contiguous_pytorch_params
Accelerate training by storing parameters in one contiguous chunk of memory.
☆290Updated 4 years ago
prigoyal / pytorch_memonger
Experimental ground for optimizing memory of pytorch models
☆366Updated 7 years ago
yangkky / distributed_tutorial
☆261Updated 5 years ago
achaiah / pywick
High-level batteries-included neural network training library for Pytorch
☆402Updated 3 years ago
lxtGH / OctaveConv_pytorch
Pytorch implementation of newly added convolution
☆586Updated 5 years ago
Lyken17 / pytorch-memonger
Sublinear memory optimization for deep learning. https://arxiv.org/abs/1604.06174
☆599Updated 5 years ago