Keras/TF implementation of AdamW, SGDW, NadamW, Warm Restarts, and Learning Rate multipliers
☆169Jan 6, 2022Updated 4 years ago
Alternatives and similar repositories for keras-adamw
Users that are interested in keras-adamw are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- AdamW optimizer for Keras☆116Aug 9, 2019Updated 6 years ago
- Keras implementation of Padam from "Closing the Generalization Gap of Adaptive Gradient Methods in Training Deep Neural Networks"☆17Sep 6, 2018Updated 7 years ago
- Gradient accumulation for Keras☆35Jun 27, 2021Updated 4 years ago
- RAdam implemented in Keras & TensorFlow☆324Jan 22, 2022Updated 4 years ago
- keras implementation of AdamW from Fixing Weight Decay Regularization in Adam (https://arxiv.org/abs/1711.05101)☆71Jul 23, 2018Updated 7 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Python library for various computer vision problems with a focus on easy usage.☆18Jan 25, 2021Updated 5 years ago
- Learning rate multiplier☆46Jun 22, 2021Updated 4 years ago
- Keras implementation of AdaBound☆130Nov 4, 2019Updated 6 years ago
- Keras implementation of Cosine Annealing Scheduler☆43Apr 6, 2020Updated 6 years ago
- Implementation of EfficientNet model. Keras and TensorFlow Keras.☆2,105Jan 24, 2024Updated 2 years ago
- SNAIL Attention Block for Keras.☆17Mar 30, 2020Updated 6 years ago
- A Tensorflow 2.0 implementation of TabNet.☆245Apr 27, 2023Updated 3 years ago
- Plots the change of the loss function of a Keras model when the learning rate is exponentially increasing.☆258May 27, 2025Updated last year
- Keras implementation of Global Context Attention blocks☆46Apr 29, 2019Updated 7 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- GUI for albumentations library☆11Sep 13, 2019Updated 6 years ago
- Lambda Networks implemented in PyTorch☆13Feb 22, 2021Updated 5 years ago
- Full knowledge and control of the train state.☆19Sep 23, 2020Updated 5 years ago
- ☆1,207Jun 5, 2020Updated 5 years ago
- A reusable implementation of EfficientNet in TensorFlow 2.0 and Keras☆17Feb 11, 2022Updated 4 years ago
- Keras implementation of Octave Convolutions☆52Apr 23, 2019Updated 7 years ago
- Implementation of One-Cycle Learning rate policy (adapted from Fast.ai lib)☆289Jun 30, 2020Updated 5 years ago
- lookahead optimizer for keras☆168Oct 14, 2019Updated 6 years ago
- diffGrad: An Optimization Method for Convolutional Neural Networks☆55Oct 12, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- `junior must know his place` team solution☆10Aug 15, 2023Updated 2 years ago
- Tensorflow port implementation of Single Headed Attention RNN☆16Feb 1, 2020Updated 6 years ago
- ☆10Nov 30, 2022Updated 3 years ago
- A Hyperparameter Tuning Library for Keras☆2,924Dec 1, 2025Updated 5 months ago
- Semantic segmentation pipeline using Catalyst.☆20Apr 3, 2020Updated 6 years ago
- Implementation of Rectified Adam in Keras☆70Aug 24, 2019Updated 6 years ago
- Implementation of Squeeze and Excitation Networks in Keras☆401Mar 10, 2020Updated 6 years ago
- Lookahead mechanism for optimizers in Keras.☆50Jun 24, 2021Updated 4 years ago
- RAdam optimizer for keras☆71Oct 14, 2019Updated 6 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Layer-wise Sparsification of Distributed Deep Learning☆10Jul 6, 2020Updated 5 years ago
- Transformer-XL with checkpoint loader☆67Jan 22, 2022Updated 4 years ago
- wrapping a keras optimizer to implement gradient accumulation☆118Aug 29, 2020Updated 5 years ago
- Attention mechanism for processing sequential data that considers the context for each timestamp.☆657Jan 22, 2022Updated 4 years ago
- Collection of the latest, greatest, deep learning optimizers (for Pytorch) - CNN, NLP suitable☆218Apr 4, 2021Updated 5 years ago
- Hash Embedding code for the paper "Hash Embeddings for Efficient Word Representations"☆42Dec 15, 2017Updated 8 years ago
- Simple gradient checkpointing for eager mode execution☆46Dec 12, 2020Updated 5 years ago