YanaiEliyahu / AdasOptimizer
ADAS is short for Adaptive Step Size, it's an optimizer that unlike other optimizers that just normalize the derivative, it fine-tunes the step size, truly making step size scheduling obsolete, achieving state-of-the-art training performance
☆85Updated 4 years ago
Alternatives and similar repositories for AdasOptimizer:
Users that are interested in AdasOptimizer are comparing it to the libraries listed below
- ☆77Updated 8 months ago
- Auto-Magical Deploy AI model at large scale, high performance, and easy to use☆66Updated last year
- Light Face Detection using PyTorch Lightning☆84Updated last year
- Pre-trained NFNets with 99% of the accuracy of the official paper "High-Performance Large-Scale Image Recognition Without Normalization".…☆30Updated 4 years ago
- Deep Learning project template best practices with Pytorch Lightning, Hydra, Tensorboard.☆156Updated 3 years ago
- a lightweight transformer library for PyTorch☆71Updated 3 years ago
- ☆52Updated 4 years ago
- Catalyst.Segmentation☆28Updated 3 years ago
- Implementation of modern data augmentation techniques in TensorFlow 2.x to be used in your training pipeline.☆34Updated 4 years ago
- graftr: an interactive shell to view and edit PyTorch checkpoints.☆111Updated 4 years ago
- Implementation of Feedback Transformer in Pytorch☆105Updated 4 years ago
- Simple stochastic weight averaging callback for Keras☆63Updated 3 years ago
- ☆14Updated 3 years ago
- Collection of the latest, greatest, deep learning optimizers (for Pytorch) - CNN, NLP suitable☆212Updated 3 years ago
- Implementation of Online Label Smoothing in PyTorch☆94Updated 2 years ago
- Minimal implementation of adaptive gradient clipping (https://arxiv.org/abs/2102.06171) in TensorFlow 2.☆83Updated 3 years ago
- Cyclemoid implementation for PyTorch☆87Updated 2 years ago
- Implements sharpness-aware minimization (https://arxiv.org/abs/2010.01412) in TensorFlow 2.☆60Updated 3 years ago
- TF2 implementation of knowledge distillation using the "function matching" hypothesis from https://arxiv.org/abs/2106.05237.☆87Updated 3 years ago
- EfficientNet, MobileNetV3, MobileNetV2, MixNet, etc in JAX w/ Flax Linen and Objax☆127Updated last year
- ☆47Updated 4 years ago
- PhoMT: A High-Quality and Large-Scale Benchmark Dataset for Vietnamese-English Machine Translation (EMNLP 2021)☆42Updated 7 months ago
- Knowledge Distillation Toolkit☆89Updated 4 years ago
- Self Driving Car for Digital Race contest that is sponsored by FPT Corp.☆24Updated 3 years ago
- The spiritual successor to knockknock for PyTorch Lightning, get notified when your training ends☆77Updated 3 months ago
- A collection of code snippets for my PyTorch Lightning projects☆107Updated 4 years ago
- Electra pre-trained model using Vietnamese corpus☆66Updated last year
- Implementation of the Adan (ADAptive Nesterov momentum algorithm) Optimizer in Pytorch☆251Updated 2 years ago
- Lite Inference Toolkit (LIT) for PyTorch☆161Updated 3 years ago
- Create SSH tunel to a running colab notebook☆67Updated 3 years ago