taki0112 / RAdam-TensorflowLinks

Simple Tensorflow implementation of "On The Variance Of The Adaptive Learning Rate And Beyond"

☆97

Alternatives and similar repositories for RAdam-Tensorflow

Users that are interested in RAdam-Tensorflow are comparing it to the libraries listed below

Sorting:

taki0112 / AdaBound-Tensorflow
Simple Tensorflow implementation of "Adaptive Gradient Methods with Dynamic Bound of Learning Rate" (ICLR 2019)
☆150Updated 6 years ago
Kyubyong / label_smoothing
Corrupted labels and label smoothing
☆129Updated 7 years ago
titu1994 / keras_rectified_adam
Implementation of Rectified Adam in Keras
☆70Updated 5 years ago
kozistr / AdaBound-tensorflow
An optimizer that trains as fast as Adam and as good as SGD in Tensorflow
☆45Updated 6 years ago
sdoria / SimpleSelfAttention
A simpler version of the self-attention layer from SAGAN, and some image classification results.
☆212Updated 5 years ago
titu1994 / keras-LAMB-Optimizer
Implementation of the LAMB optimizer for Keras from the paper "Reducing BERT Pre-Training Time from 3 Days to 76 Minutes"
☆75Updated 6 years ago
pgmmpk / tfrecord
Python way to Read/Write TFRecords
☆64Updated 7 years ago
taki0112 / partial_conv-Tensorflow
Simple Tensorflow implementation of "Partial Convolution based Padding" (partialconv)
☆91Updated 6 years ago
AlexiaJM / MaximumMarginGANs
Code for paper: "Support Vector Machines, Wasserstein's distance and gradient-penalty GANs maximize a margin"
☆178Updated 5 years ago
shaohua0116 / WGAN-GP-TensorFlow
TensorFlow implementations of Wasserstein GAN with Gradient Penalty (WGAN-GP), Least Squares GAN (LSGAN), GANs with the hinge loss.
☆44Updated 6 years ago
tunz / tcop-pytorch
tunz's CUDA pytorch operator (MaskedSoftmax)
☆75Updated 6 years ago
titu1994 / tf-eager-examples
A set of simple examples ported from PyTorch for Tensorflow Eager Execution
☆73Updated 7 years ago
titu1994 / keras-padam
Keras implementation of Padam from "Closing the Generalization Gap of Adaptive Gradient Methods in Training Deep Neural Networks"
☆17Updated 6 years ago
GLambard / AdamW_Keras
AdamW optimizer for Keras
☆115Updated 5 years ago
jalola / compare-tensorflow-pytorch
Compare outputs between layers written in Tensorflow and layers written in Pytorch
☆72Updated 7 years ago
graykode / modelsummary
All Model summary in PyTorch similar to `model.summary()` in Keras
☆88Updated 6 years ago
sgugger / Adam-experiments
Experiments with Adam/AdamW/amsgrad
☆201Updated 6 years ago
OverLordGoldDragon / keras-adamw
Keras/TF implementation of AdamW, SGDW, NadamW, Warm Restarts, and Learning Rate multipliers
☆168Updated 3 years ago
titu1994 / keras-octconv
Keras implementation of Octave Convolutions
☆53Updated 6 years ago
taki0112 / SphereGAN-Tensorflow
Simple Tensorflow implementation of SphereGAN (CVPR 2019 Oral)
☆56Updated 5 years ago
ildoonet / remote-dataloader
PyTorch DataLoader processed in multiple remote computation machines for heavy data processings
☆67Updated 5 years ago
leonardblier / alrao
Implementation of "Learning with Random Learning Rates" in PyTorch.
☆102Updated 5 years ago
CyberZHG / keras-lookahead
Lookahead mechanism for optimizers in Keras.
☆50Updated 4 years ago
taki0112 / AMSGrad-Tensorflow
Simple Tensorflow implementation of "On the Convergence of Adam and Beyond" (ICLR 2018)
☆104Updated 6 years ago
kristpapadopoulos / keras-stochastic-weight-averaging
Keras callback function for stochastic weight averaging
☆56Updated 3 years ago
lessw2020 / mish
Mish Deep Learning Activation Function for PyTorch / FastAI
☆161Updated 5 years ago
golbin / TensorFlow-Multi-GPUs
Samples for Multi GPUs in TensorFlow
☆83Updated 7 years ago
vinojjayasundara / textcaps
Official Implementation of "Textcaps: Handwritten Character Recognition With Very Small Datasets" (WACV 2019).
☆136Updated 5 years ago
kakaobrain / autoclint
A specially designed light version of Fast AutoAugment
☆171Updated 5 years ago
ducha-aiki / google-retrieval-challenge-2019-fastai-starter
fast.ai starter kit for Google Landmark Retrieval 2019 challenge
☆61Updated 6 years ago