YanaiEliyahu / AdasOptimizerLinks
ADAS is short for Adaptive Step Size, it's an optimizer that unlike other optimizers that just normalize the derivative, it fine-tunes the step size, truly making step size scheduling obsolete, achieving state-of-the-art training performance
☆85Updated 5 years ago
Alternatives and similar repositories for AdasOptimizer
Users that are interested in AdasOptimizer are comparing it to the libraries listed below
Sorting:
- ☆77Updated last year
- a mini Deep Learning framework supporting GPU accelerations written with CUDA☆32Updated 5 years ago
- graftr: an interactive shell to view and edit PyTorch checkpoints.☆114Updated 5 years ago
- TF2 implementation of knowledge distillation using the "function matching" hypothesis from https://arxiv.org/abs/2106.05237.☆88Updated 4 years ago
- Lite Inference Toolkit (LIT) for PyTorch☆160Updated 4 years ago
- Official code for the Stochastic Polyak step-size optimizer☆139Updated last year
- Minimal implementation of adaptive gradient clipping (https://arxiv.org/abs/2102.06171) in TensorFlow 2.☆86Updated 4 years ago
- Deep Learning project template best practices with Pytorch Lightning, Hydra, Tensorboard.☆160Updated 4 years ago
- Pre-trained NFNets with 99% of the accuracy of the official paper "High-Performance Large-Scale Image Recognition Without Normalization".…☆30Updated 4 years ago
- Zalo AI Challenge 2020 - Top 2 @ Voice Verification☆15Updated 3 years ago
- Collection of the latest, greatest, deep learning optimizers (for Pytorch) - CNN, NLP suitable☆218Updated 4 years ago
- Learning to Initialize Neural Networks for Stable and Efficient Training☆138Updated 3 years ago
- AutoML for image augmentation. AutoAlbument uses the Faster AutoAugment algorithm to find optimal augmentation policies. Documentation - …☆207Updated 4 years ago
- Pre-trained NFNets with 99% of the accuracy of the official paper "High-Performance Large-Scale Image Recognition Without Normalization".☆158Updated 4 years ago
- Implementation of Feedback Transformer in Pytorch☆108Updated 4 years ago
- PyTorch dataset extended with map, cache etc. (tensorflow.data like)☆331Updated 3 years ago
- Unofficial PyTorch Implementation of EvoNorm☆123Updated 4 years ago
- Implementation of Fast Transformer in Pytorch☆177Updated 4 years ago
- Light Face Detection using PyTorch Lightning☆83Updated 2 years ago
- a lightweight transformer library for PyTorch☆72Updated 4 years ago
- Knowledge Distillation Toolkit☆89Updated 5 years ago
- EfficientNet, MobileNetV3, MobileNetV2, MixNet, etc in JAX w/ Flax Linen and Objax☆129Updated 2 years ago
- Implementation of the Adan (ADAptive Nesterov momentum algorithm) Optimizer in Pytorch☆252Updated 3 years ago
- ☆54Updated 5 years ago
- Aloception is a set of package for computer vision: aloscene, alodataset, alonet.☆93Updated 8 months ago
- ☆47Updated 5 years ago
- Cyclemoid implementation for PyTorch☆90Updated 3 years ago
- Code for scaling Transformers☆26Updated 5 years ago
- Model for document segmentation trained on the midv-500-models dataset.☆78Updated 5 years ago
- This repository contains the results for the paper: "Descending through a Crowded Valley - Benchmarking Deep Learning Optimizers"☆184Updated 4 years ago