juntang-zhuang / Adabelief-Optimizer
Repository for NeurIPS 2020 Spotlight "AdaBelief Optimizer: Adapting stepsizes by the belief in observed gradients"
☆1,059Updated 6 months ago
Alternatives and similar repositories for Adabelief-Optimizer:
Users that are interested in Adabelief-Optimizer are comparing it to the libraries listed below
- Ranger - a synergistic optimizer using RAdam (Rectified Adam), Gradient Centralization and LookAhead in one codebase☆1,196Updated last year
- Implementation of LambdaNetworks, a new approach to image recognition that reaches SOTA with less compute☆1,534Updated 4 years ago
- torch-optimizer -- collection of optimizers for Pytorch☆3,082Updated 10 months ago
- On the Variance of the Adaptive Learning Rate and Beyond☆2,542Updated 3 years ago
- Source code for "On the Relationship between Self-Attention and Convolutional Layers"☆1,096Updated 2 years ago
- A New Optimization Technique for Deep Neural Networks☆535Updated 3 years ago
- Reformer, the efficient Transformer, in Pytorch☆2,152Updated last year
- SAM: Sharpness-Aware Minimization (PyTorch)☆1,830Updated last year
- Ranger deep learning optimizer rewrite to use newest components☆327Updated last year
- [NeurIPS‘2021] "TransGAN: Two Pure Transformers Can Make One Strong GAN, and That Can Scale Up", Yifan Jiang, Shiyu Chang, Zhangyang Wang☆1,661Updated 2 years ago
- Over9000 optimizer☆426Updated 2 years ago
- Code for Noisy Student Training. https://arxiv.org/abs/1911.04252☆756Updated 3 years ago
- Toolbox of models, callbacks, and datasets for AI/ML researchers.☆1,715Updated 2 weeks ago
- An implementation of Performer, a linear attention-based transformer, in Pytorch☆1,115Updated 3 years ago
- Learning Rate Warmup in PyTorch☆403Updated 2 weeks ago
- Fast Block Sparse Matrices for Pytorch☆546Updated 4 years ago
- Deep Learning Experiment Management☆639Updated 2 years ago
- A pytorch port of google-research/google-research/robust_loss/☆672Updated 3 years ago
- NFNets and Adaptive Gradient Clipping for SGD implemented in PyTorch. Find explanation at tourdeml.github.io/blog/☆345Updated last year
- Shape and dimension inference (Keras-like) for PyTorch layers and neural networks☆570Updated 2 years ago
- Gradually-Warmup Learning Rate Scheduler for PyTorch☆986Updated 4 months ago
- Cockpit: A Practical Debugging Tool for Training Deep Neural Networks☆474Updated 2 years ago
- Library for faster pinned CPU <-> GPU transfer in Pytorch☆685Updated 4 years ago
- High-level batteries-included neural network training library for Pytorch☆398Updated 3 years ago
- higher is a pytorch library allowing users to obtain higher order gradients over losses spanning training loops rather than individual tr…☆1,607Updated 2 years ago
- Implementation of https://arxiv.org/abs/1904.00962☆371Updated 4 years ago
- Pretrained EfficientNet, EfficientNet-Lite, MixNet, MobileNetV3 / V2, MNASNet A1 and B1, FBNet, Single-Path NAS☆1,568Updated 8 months ago
- Standalone TFRecord reader/writer with PyTorch data loaders☆874Updated 6 months ago
- pip install antialiased-cnns to improve stability and accuracy☆1,661Updated 10 months ago
- Official PyTorch Repo for "ReZero is All You Need: Fast Convergence at Large Depth"☆408Updated 6 months ago