egg-west / AdamW-pytorch
Implementation and experiments for AdamW on Pytorch
☆93Updated 5 years ago
Alternatives and similar repositories for AdamW-pytorch:
Users that are interested in AdamW-pytorch are comparing it to the libraries listed below
- Random miniprojects with pytorch.☆173Updated 6 years ago
- A PyTorch implementation of shake-shake☆111Updated 4 years ago
- Utilities for Pytorch☆89Updated 2 years ago
- Implementation of Octave Convolution from Drop an Octave: Reducing Spatial Redundancy in Convolutional Neural Networks with Octave Convol…☆56Updated 5 years ago
- A Pytorch implementation of "LegoNet: Efficient Convolutional Neural Networks with Lego Filters" (ICML 2019).☆140Updated 4 years ago
- Bridging the gap Between Stability and Scalability in Neural Architecture Search☆141Updated 3 years ago
- Mish Deep Learning Activation Function for PyTorch / FastAI☆161Updated 4 years ago
- ICLR 2018 reproducibility challenge - Multi-Scale Dense Convolutional Networks for Efficient Prediction☆137Updated 6 years ago
- Implementation of soft parameter sharing for neural networks☆69Updated 4 years ago
- Code for NeurIPS 2019 Paper, "L_DMI: An Information-theoretic Noise-robust Loss Function"☆118Updated last year
- PyTorch code for softmax variants: center loss, cosface loss, large-margin gaussian mixture, COCOLoss, ring loss☆254Updated 6 years ago
- Implementation of DropBlock in Pytorch☆81Updated 6 years ago
- Tensorflow code for Differentiable architecture search☆73Updated 6 years ago
- ☆182Updated last year
- A PyTorch implementation of the paper Mixup: Beyond Empirical Risk Minimization in PyTorch☆123Updated 7 years ago
- Filter Response Normalization Layer in PyTorch☆121Updated 4 years ago
- lookahead optimizer (Lookahead Optimizer: k steps forward, 1 step back) for pytorch☆334Updated 5 years ago
- [ICLR'19] Complement Objective Training☆76Updated 6 years ago
- Pytorch implementation of SNAS☆75Updated 6 years ago
- Minimal API for receptive field calculation in PyTorch☆67Updated 2 years ago
- Code for reproducing results of the paper "Layer rotation: a surprisingly powerful indicator of generalization in deep networks?"☆50Updated 5 years ago
- pytorch implement of Lookahead Optimizer☆188Updated 2 years ago
- Implements https://arxiv.org/abs/1711.05101 AdamW optimizer, cosine learning rate scheduler and "Cyclical Learning Rates for Training Neu…☆149Updated 5 years ago
- Code for https://arxiv.org/abs/1810.04622☆140Updated 5 years ago
- Implementations of ideas from recent papers☆391Updated 4 years ago
- Knowledge Transfer via Distillation of Activation Boundaries Formed by Hidden Neurons (AAAI 2019)☆105Updated 5 years ago
- Code release for paper "Random Search and Reproducibility for NAS"☆167Updated 5 years ago
- Improving Generalization via Scalable Neighborhood Component Analysis☆136Updated last year
- Why ReLU networks yield high-confidence predictions far away from the training data and how to mitigate the problem [CVPR 2019, oral]☆182Updated 5 years ago
- Improving Consistency-Based Semi-Supervised Learning with Weight Averaging☆185Updated 5 years ago