CyberZHG/keras-radam

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/CyberZHG/keras-radam)

CyberZHG / keras-radam

RAdam implemented in Keras & TensorFlow

☆324

Alternatives and similar repositories for keras-radam

Users that are interested in keras-radam are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

CyberZHG / keras-lookahead
View on GitHub
Lookahead mechanism for optimizers in Keras.
☆50Jun 24, 2021Updated 5 years ago
titu1994 / keras_rectified_adam
View on GitHub
Implementation of Rectified Adam in Keras
☆70Aug 24, 2019Updated 6 years ago
LiyuanLucasLiu / RAdam
View on GitHub
On the Variance of the Adaptive Learning Rate and Beyond
☆2,548Jul 31, 2021Updated 4 years ago
bojone / keras_lookahead
View on GitHub
lookahead optimizer for keras
☆168Oct 14, 2019Updated 6 years ago
CyberZHG / keras-gradient-accumulation
View on GitHub
Gradient accumulation for Keras
☆35Jun 27, 2021Updated 5 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
OverLordGoldDragon / keras-adamw
View on GitHub
Keras/TF implementation of AdamW, SGDW, NadamW, Warm Restarts, and Learning Rate multipliers
☆168Jan 6, 2022Updated 4 years ago
qubvel / efficientnet
View on GitHub
Implementation of EfficientNet model. Keras and TensorFlow Keras.
☆2,099Jan 24, 2024Updated 2 years ago
GLambard / AdamW_Keras
View on GitHub
AdamW optimizer for Keras
☆115Aug 9, 2019Updated 6 years ago
taki0112 / RAdam-Tensorflow
View on GitHub
Simple Tensorflow implementation of "On The Variance Of The Adaptive Learning Rate And Beyond"
☆97Apr 1, 2020Updated 6 years ago
bojone / keras_radam
View on GitHub
RAdam optimizer for keras
☆71Oct 14, 2019Updated 6 years ago
titu1994 / keras_mixnets
View on GitHub
Keras Implementation of MixNets: Mixed Depthwise Convolutions
☆40Mar 30, 2020Updated 6 years ago
CyberZHG / keras-transformer-xl
View on GitHub
Transformer-XL with checkpoint loader
☆67Jan 22, 2022Updated 4 years ago
lessw2020 / Ranger-Deep-Learning-Optimizer
View on GitHub
Ranger - a synergistic optimizer using RAdam (Rectified Adam), Gradient Centralization and LookAhead in one codebase
☆1,207Dec 22, 2023Updated 2 years ago
titu1994 / keras_novograd
View on GitHub
Keras implementation of NovoGrad
☆20Aug 21, 2020Updated 5 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
tensorflow / addons
View on GitHub
Useful extra functionality for TensorFlow 2.x maintained by SIG-addons
☆1,700Sep 4, 2025Updated 10 months ago
mgrankin / over9000
View on GitHub
Over9000 optimizer
☆424Nov 22, 2022Updated 3 years ago
pronkinnikita / pytorch-pretrained-BERT
View on GitHub
📖The Big-&-Extending-Repository-of-Transformers: Pretrained PyTorch models for Google's BERT, OpenAI GPT & GPT-2, Google/CMU Transformer…
☆16Jun 9, 2019Updated 7 years ago
ngxbac / Kaggle-Jigsaw
View on GitHub
☆23Jun 27, 2019Updated 7 years ago
CyberZHG / keras-adaptive-softmax
View on GitHub
Adaptive embedding and softmax
☆17Jan 22, 2022Updated 4 years ago
CyberZHG / keras-transformer
View on GitHub
Transformer implemented in Keras
☆369Jan 22, 2022Updated 4 years ago
CyberZHG / keras-lr-multiplier
View on GitHub
Learning rate multiplier
☆46Jun 22, 2021Updated 5 years ago
4uiiurz1 / keras-cosine-annealing
View on GitHub
Keras implementation of Cosine Annealing Scheduler
☆43Apr 6, 2020Updated 6 years ago
titu1994 / keras-efficientnets
View on GitHub
Keras Implementation of EfficientNets
☆183Mar 17, 2020Updated 6 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
SeanSdahl / RangerOptimizerTensorflow
View on GitHub
This repository provides a class which can be used for the optimization of a tf.keras model. It combines the two optimizers Lookahead and…
☆14Oct 26, 2024Updated last year
CyberZHG / keras-lamb
View on GitHub
Layer-wise Adaptive Moments optimizer for Batch training
☆15Apr 3, 2019Updated 7 years ago
keras-team / keras-tuner
View on GitHub
A Hyperparameter Tuning Library for Keras
☆2,923Dec 1, 2025Updated 7 months ago
bojone / accum_optimizer_for_keras
View on GitHub
wrapping a keras optimizer to implement gradient accumulation
☆118Aug 29, 2020Updated 5 years ago
Zelgunn / CustomKerasLayers
View on GitHub
ResBlock, DenseBlock and SpatialTransformer layers made with the Keras Layer API and TF2.0.
☆16Jan 20, 2022Updated 4 years ago
titu1994 / lambda_networks_pt
View on GitHub
Lambda Networks implemented in PyTorch
☆13Feb 22, 2021Updated 5 years ago
titu1994 / tf-TabNet
View on GitHub
A Tensorflow 2.0 implementation of TabNet.
☆245Apr 27, 2023Updated 3 years ago
juntang-zhuang / Adabelief-Optimizer
View on GitHub
Repository for NeurIPS 2020 Spotlight "AdaBelief Optimizer: Adapting stepsizes by the belief in observed gradients"
☆1,071Aug 9, 2024Updated last year
kristpapadopoulos / keras-stochastic-weight-averaging
View on GitHub
Keras callback function for stochastic weight averaging
☆56Jun 11, 2022Updated 4 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
shaoanlu / AdamW-and-SGDW
View on GitHub
keras implementation of AdamW from Fixing Weight Decay Regularization in Adam (https://arxiv.org/abs/1711.05101)
☆71Jul 23, 2018Updated 8 years ago
daigo0927 / tf-simple-metric-learning
View on GitHub
Simple metric learning methods via tf.keras
☆19Aug 25, 2020Updated 5 years ago
digantamisra98 / Mish
View on GitHub
Official Repository for "Mish: A Self Regularized Non-Monotonic Neural Activation Function" [BMVC 2020]
☆1,300Jul 20, 2026Updated last week
yu4u / mixup-generator
View on GitHub
An implementation of "mixup: Beyond Empirical Risk Minimization"
☆286Nov 5, 2017Updated 8 years ago
titu1994 / keras-attention-augmented-convs
View on GitHub
Keras implementation of Attention Augmented Convolutional Neural Networks
☆120Mar 6, 2020Updated 6 years ago
Separius / BERT-keras
View on GitHub
Keras implementation of BERT with pre-trained weights
☆813Jul 26, 2019Updated 7 years ago
kpot / keras-transformer
View on GitHub
Keras library for building (Universal) Transformers, facilitating BERT and GPT models
☆540May 30, 2020Updated 6 years ago