egg-west / AdamW-pytorchLinks
Implementation and experiments for AdamW on Pytorch
☆94Updated 5 years ago
Alternatives and similar repositories for AdamW-pytorch
Users that are interested in AdamW-pytorch are comparing it to the libraries listed below
Sorting:
- A Pytorch implementation of "LegoNet: Efficient Convolutional Neural Networks with Lego Filters" (ICML 2019).☆140Updated 5 years ago
- Implementation of Octave Convolution from Drop an Octave: Reducing Spatial Redundancy in Convolutional Neural Networks with Octave Convol…☆57Updated 6 years ago
- A PyTorch implementation of shake-shake☆111Updated 5 years ago
- Implements https://arxiv.org/abs/1711.05101 AdamW optimizer, cosine learning rate scheduler and "Cyclical Learning Rates for Training Neu…☆152Updated 6 years ago
- Utilities for Pytorch☆88Updated 3 years ago
- [ICLR'19] Complement Objective Training☆75Updated 6 years ago
- pytorch implement of Lookahead Optimizer☆195Updated 3 years ago
- Random miniprojects with pytorch.☆170Updated 7 years ago
- Using the CLR algorithm for training (https://arxiv.org/abs/1506.01186)☆108Updated 7 years ago
- Implementation of OctConv in Pytorch (https://arxiv.org/abs/1904.05049)☆214Updated 6 years ago
- Mish Deep Learning Activation Function for PyTorch / FastAI☆161Updated 5 years ago
- lookahead optimizer (Lookahead Optimizer: k steps forward, 1 step back) for pytorch☆337Updated 6 years ago
- Improving Generalization via Scalable Neighborhood Component Analysis☆137Updated 2 years ago
- homura is a library for fast prototyping DL research☆106Updated 3 years ago
- Minimal API for receptive field calculation in PyTorch☆68Updated 3 years ago
- PyTorch implementation of shake-shake regularization☆48Updated 5 years ago
- A PyTorch implementation of " EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks."☆313Updated 5 years ago
- ☆182Updated 2 years ago
- An official collection of code in different frameworks that reproduces experiments in "Group Normalization"☆117Updated 4 years ago
- A TensorFlow re-implementation of Momentum Contrast (MoCo): https://arxiv.org/abs/1911.05722☆159Updated 2 years ago
- Full implementation of the paper "Rethinking Softmax with Cross-Entropy: Neural Network Classifier as Mutual Information Estimator".☆101Updated 5 years ago
- Code for reproducing results of the paper "Layer rotation: a surprisingly powerful indicator of generalization in deep networks?"☆50Updated 6 years ago
- Filter Response Normalization Layer in PyTorch☆121Updated 5 years ago
- A large scale study of Knowledge Distillation.☆219Updated 5 years ago
- Implementations of ideas from recent papers☆392Updated 4 years ago
- Simple Tensorflow implementation of "Adaptive Gradient Methods with Dynamic Bound of Learning Rate" (ICLR 2019)☆150Updated 6 years ago
- Pytorch implementation of group normalization in https://arxiv.org/abs/1803.08494 (Following the PyTorch Style)☆87Updated 6 years ago
- Knowledge Transfer via Distillation of Activation Boundaries Formed by Hidden Neurons (AAAI 2019)☆105Updated 6 years ago
- A simpler version of the self-attention layer from SAGAN, and some image classification results.☆214Updated 6 years ago
- ☆169Updated 4 years ago