OverLordGoldDragon / keras-adamwLinks

Keras/TF implementation of AdamW, SGDW, NadamW, Warm Restarts, and Learning Rate multipliers

☆168

Alternatives and similar repositories for keras-adamw

Users that are interested in keras-adamw are comparing it to the libraries listed below

Sorting:

GLambard / AdamW_Keras
AdamW optimizer for Keras
☆115Updated 6 years ago
titu1994 / keras-adabound
Keras implementation of AdaBound
☆130Updated 6 years ago
CyberZHG / keras-lookahead
Lookahead mechanism for optimizers in Keras.
☆50Updated 4 years ago
CyberZHG / keras-radam
RAdam implemented in Keras & TensorFlow
☆326Updated 3 years ago
CyberZHG / keras-lr-multiplier
Learning rate multiplier
☆46Updated 4 years ago
shaoanlu / AdamW-and-SGDW
keras implementation of AdamW from Fixing Weight Decay Regularization in Adam (https://arxiv.org/abs/1711.05101)
☆71Updated 7 years ago
kristpapadopoulos / keras-stochastic-weight-averaging
Keras callback function for stochastic weight averaging
☆56Updated 3 years ago
titu1994 / keras-one-cycle
Implementation of One-Cycle Learning rate policy (adapted from Fast.ai lib)
☆287Updated 5 years ago
google / bi-tempered-loss
Robust Bi-Tempered Logistic Loss Based on Bregman Divergences. https://arxiv.org/pdf/1906.03361.pdf
☆147Updated 3 years ago
titu1994 / keras_rectified_adam
Implementation of Rectified Adam in Keras
☆70Updated 6 years ago
CyberZHG / keras-gradient-accumulation
Gradient accumulation for Keras
☆35Updated 4 years ago
4uiiurz1 / keras-cosine-annealing
Keras implementation of Cosine Annealing Scheduler
☆44Updated 5 years ago
CyberZHG / keras-adabound
AdaBound optimizer in Keras
☆56Updated 5 years ago
sdoria / SimpleSelfAttention
A simpler version of the self-attention layer from SAGAN, and some image classification results.
☆214Updated 6 years ago
taki0112 / RAdam-Tensorflow
Simple Tensorflow implementation of "On The Variance Of The Adaptive Learning Rate And Beyond"
☆97Updated 5 years ago
netrack / keras-metrics
Metrics for Keras. DEPRECATED since Keras 2.3.0
☆163Updated 3 years ago
surmenok / keras_lr_finder
Plots the change of the loss function of a Keras model when the learning rate is exponentially increasing.
☆258Updated 5 months ago
simon-larsson / keras-swa
Simple stochastic weight averaging callback for Keras
☆63Updated 4 years ago
titu1994 / keras-efficientnets
Keras Implementation of EfficientNets
☆186Updated 5 years ago
AndreasMadsen / python-lrcurve
Creates a learning-curve plot for Jupyter/Colab notebooks that is updated in real-time.
☆177Updated 3 years ago
titu1994 / Snapshot-Ensembles
Snapshot Ensemble in Keras
☆311Updated 8 years ago
lessw2020 / Ranger-Mish-ImageWoof-5
Repo to build on / reproduce the record breaking Ranger-Mish-SelfAttention setup on FastAI ImageWoof dataset 5 epochs
☆116Updated 6 years ago
titu1994 / keras-attention-augmented-convs
Keras implementation of Attention Augmented Convolutional Neural Networks
☆121Updated 5 years ago
mgrankin / over9000
Over9000 optimizer
☆425Updated 2 years ago
ClementWalter / Keras-FewShotLearning
Some State-of-the-Art few shot learning algorithms in tensorflow 2
☆212Updated 2 years ago
jkoutsikakis / pytorch-wrapper
Provides a systematic and extensible way to build, train, evaluate, and tune deep learning models using PyTorch.
☆94Updated last year
Tony607 / Focal_Loss_Keras
Multi-class classification with focal loss for imbalanced datasets
☆82Updated 6 years ago
arthurdouillard / keras-snapshot_ensembles
Implementation in Keras of: Snapshot Ensembles: Train 1, get M for free (https://arxiv.org/abs/1704.00109)
☆26Updated 7 years ago
Vermeille / Torchelie
Torchélie is a set of utility functions, layers, losses, models, trainers and other things for PyTorch.
☆110Updated 2 months ago
maciej-sypetkowski / kaggle-rcic-1st
1st Place Solution for Kaggle Recursion Cellular Image Classification Challenge -- https://www.kaggle.com/c/recursion-cellular-image-clas…
☆144Updated 6 years ago