taki0112 / AdaBound-Tensorflow
Simple Tensorflow implementation of "Adaptive Gradient Methods with Dynamic Bound of Learning Rate" (ICLR 2019)
☆149Updated 5 years ago
Related projects: ⓘ
- Simple Tensorflow implementation of "On The Variance Of The Adaptive Learning Rate And Beyond"☆97Updated 4 years ago
- Using the CLR algorithm for training (https://arxiv.org/abs/1506.01186)☆108Updated 6 years ago
- Keras implementation of AdaBound☆130Updated 4 years ago
- A TensorFlow implementation of Group Normalization on the task of image classification☆208Updated 5 years ago
- Implementation of OctConv in Pytorch (https://arxiv.org/abs/1904.05049)☆213Updated 5 years ago
- Simple Tensorflow implementation of "On the Convergence of Adam and Beyond" (ICLR 2018)☆104Updated 5 years ago
- repo that holds code for improving on dropout using Stochastic Delta Rule☆142Updated 5 years ago
- A memory efficient implementation of densenet.☆82Updated 4 years ago
- Unofficial implementation of Octave Convolutions (OctConv) in TensorFlow / Keras.☆100Updated 4 years ago
- PyTorch implementation of PNASNet-5 on ImageNet☆315Updated 2 years ago
- lookahead optimizer for keras☆170Updated 4 years ago
- Models and examples built with Chainer☆118Updated 2 years ago
- Corrupted labels and label smoothing☆128Updated 6 years ago
- Implementation and experiments for AdamW on Pytorch☆93Updated 4 years ago
- Knowledge Distillation using Tensorflow☆141Updated 5 years ago
- An implementation of "mixup: Beyond Empirical Risk Minimization"☆283Updated 6 years ago
- Implementations of ideas from recent papers☆391Updated 3 years ago
- ☆169Updated 3 years ago
- Model Compression CLI Tool for Keras.☆157Updated 5 years ago
- TensorFlow implementation of PNASNet-5 on ImageNet☆102Updated 5 years ago
- Label Refinery: Improving ImageNet Classification through Label Progression☆280Updated 6 years ago
- Tensorflow code for Differentiable architecture search☆73Updated 5 years ago
- PyTorch implementation of CVPR'18 - Perturbative Neural Networks http://xujuefei.com/pnn.html☆138Updated 5 years ago
- Code for reproducing results of the paper "Layer rotation: a surprisingly powerful indicator of generalization in deep networks?"☆50Updated 5 years ago
- Random miniprojects with pytorch.☆173Updated 5 years ago
- Mish Deep Learning Activation Function for PyTorch / FastAI☆160Updated 4 years ago
- Knowledge distillation methods implemented with Tensorflow (now there are 11 (+1) methods, and will be added more.)☆265Updated 4 years ago
- A specially designed light version of Fast AutoAugment☆170Updated 4 years ago
- Complementary code for the Targeted Dropout paper☆257Updated 4 years ago
- Octave convolution☆34Updated 2 years ago