taki0112 / AdaBound-Tensorflow
Simple Tensorflow implementation of "Adaptive Gradient Methods with Dynamic Bound of Learning Rate" (ICLR 2019)
☆150Updated 5 years ago
Alternatives and similar repositories for AdaBound-Tensorflow:
Users that are interested in AdaBound-Tensorflow are comparing it to the libraries listed below
- Simple Tensorflow implementation of "On The Variance Of The Adaptive Learning Rate And Beyond"☆97Updated 4 years ago
- A memory efficient implementation of densenet.☆82Updated 5 years ago
- Simple Tensorflow implementation of "On the Convergence of Adam and Beyond" (ICLR 2018)☆104Updated 5 years ago
- Keras implementation of AdaBound☆130Updated 5 years ago
- Unofficial implementation of Octave Convolutions (OctConv) in TensorFlow / Keras.☆100Updated 5 years ago
- Using the CLR algorithm for training (https://arxiv.org/abs/1506.01186)☆108Updated 6 years ago
- lookahead optimizer for keras☆170Updated 5 years ago
- Corrupted labels and label smoothing☆128Updated 7 years ago
- A TensorFlow implementation of Group Normalization on the task of image classification☆208Updated 6 years ago
- TensorFlow implementation of PNASNet-5 on ImageNet☆101Updated 6 years ago
- repo that holds code for improving on dropout using Stochastic Delta Rule☆142Updated 6 years ago
- Implementation of OctConv in Pytorch (https://arxiv.org/abs/1904.05049)☆213Updated 5 years ago
- Octave convolution☆34Updated 3 years ago
- Implementations of ideas from recent papers☆391Updated 4 years ago
- Keras implementation of CoordConv for all Convolution layers☆147Updated 2 years ago
- keras implementation of AdamW from Fixing Weight Decay Regularization in Adam (https://arxiv.org/abs/1711.05101)☆70Updated 6 years ago
- [ECCV 2018] Sparsely Aggreagated Convolutional Networks https://arxiv.org/abs/1801.05895☆124Updated 6 years ago
- Models and examples built with Chainer☆119Updated 2 years ago
- An optimizer that trains as fast as Adam and as good as SGD in Tensorflow☆45Updated 5 years ago
- AdamW optimizer for Keras☆114Updated 5 years ago
- Code for reproducing results of the paper "Layer rotation: a surprisingly powerful indicator of generalization in deep networks?"☆50Updated 5 years ago
- A set of simple examples ported from PyTorch for Tensorflow Eager Execution☆73Updated 6 years ago
- Efficient Data Loading Pipeline in Pure Python☆211Updated 4 years ago
- GAN-QP: A Novel GAN Framework without Gradient Vanishing and Lipschitz Constraint☆119Updated 5 years ago
- PyTorch implementation of PNASNet-5 on ImageNet☆317Updated 2 years ago
- An implementation of "mixup: Beyond Empirical Risk Minimization"☆285Updated 7 years ago
- RAdam implemented in Keras & TensorFlow☆325Updated 3 years ago
- Simple Tensorflow implementation of "Partial Convolution based Padding" (partialconv)☆91Updated 6 years ago
- Keras implementation of Octave Convolutions☆53Updated 5 years ago
- Mish Deep Learning Activation Function for PyTorch / FastAI☆161Updated 4 years ago