Decoupled Weight Decay Regularization (ICLR 2019)
☆292Jan 9, 2019Updated 7 years ago
Alternatives and similar repositories for AdamW-and-SGDW
Users that are interested in AdamW-and-SGDW are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆254Nov 23, 2016Updated 9 years ago
- keras implementation of AdamW from Fixing Weight Decay Regularization in Adam (https://arxiv.org/abs/1711.05101)☆71Jul 23, 2018Updated 7 years ago
- Experiments with Adam/AdamW/amsgrad☆201Sep 5, 2018Updated 7 years ago
- 2.86% and 15.85% on CIFAR-10 and CIFAR-100☆297Oct 9, 2018Updated 7 years ago
- Code for "Aggregated Momentum: Stability Through Passive Damping", Lucas et al. 2018☆35Nov 6, 2018Updated 7 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Partially Adaptive Momentum Estimation method in the paper "Closing the Generalization Gap of Adaptive Gradient Methods in Training Deep …☆39Apr 13, 2023Updated 2 years ago
- AdamW optimizer for Keras☆116Aug 9, 2019Updated 6 years ago
- ☆22Nov 24, 2018Updated 7 years ago
- On the Variance of the Adaptive Learning Rate and Beyond☆2,549Jul 31, 2021Updated 4 years ago
- Apollo: An Adaptive Parameter-wise Diagonal Quasi-Newton Method for Nonconvex Stochastic Optimization☆182Nov 21, 2021Updated 4 years ago
- Sampled Quasi-Newton Methods for Deep Learning☆21May 21, 2020Updated 5 years ago
- TensorFlow implementation of (Momentum) Stochastic Variance-Adapted Gradient.☆44May 11, 2018Updated 7 years ago
- Code for visualizing the loss landscape of neural nets☆3,161Apr 5, 2022Updated 3 years ago
- i-RevNet Pytorch Code☆397Feb 16, 2021Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Unsupervised instance segmentation via active robot interaction☆76Jul 1, 2022Updated 3 years ago
- Implementation and experiments for AdamW on Pytorch☆94Nov 23, 2019Updated 6 years ago
- ☆218May 23, 2018Updated 7 years ago
- ☆135Oct 23, 2017Updated 8 years ago
- Lua implementation of Entropy-SGD☆81Apr 9, 2018Updated 7 years ago
- Full implementation of the paper "Rethinking Softmax with Cross-Entropy: Neural Network Classifier as Mutual Information Estimator".☆101Mar 9, 2020Updated 6 years ago
- Code for paper "Which Training Methods for GANs do actually Converge? (ICML 2018)"☆920Aug 27, 2019Updated 6 years ago
- Implementation of Adversarial Variational Optimization in PyTorch☆42Aug 7, 2018Updated 7 years ago
- Small scale experiments with group normalization☆57Apr 4, 2018Updated 7 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- This is a clean implementation of AM-GANs, supporting our paper “Activation Maximization Generative Adversarial Nets”.☆16Sep 26, 2020Updated 5 years ago
- Code for Stochastic Hyperparameter Optimization through Hypernetworks☆28Jun 11, 2018Updated 7 years ago
- Forward-mode Automatic Differentiation for TensorFlow☆139Mar 12, 2018Updated 8 years ago
- CondenseNet: Light weighted CNN for mobile devices☆691Nov 11, 2019Updated 6 years ago
- Code for "Training Generative Adversarial Networks with Binary Neurons by End-to-end Backpropagation"☆26Oct 30, 2019Updated 6 years ago
- Neural Architecture Search with Bayesian Optimisation and Optimal Transport☆136Jan 28, 2019Updated 7 years ago
- An optimizer that trains as fast as Adam and as good as SGD.☆2,907Jul 23, 2023Updated 2 years ago
- A PyTorch implementation of shake-shake☆112Apr 21, 2020Updated 5 years ago
- Pytorch implementation of MaxPoolingLoss.☆177Jun 9, 2018Updated 7 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Code for Switchable Normalization from "Differentiable Learning-to-Normalize via Switchable Normalization", https://arxiv.org/abs/1806.10…☆869Jun 11, 2020Updated 5 years ago
- customized GPflow with simple Tensorflow API☆17Aug 7, 2019Updated 6 years ago
- LSTM and QRNN Language Model Toolkit for PyTorch☆1,990Feb 12, 2022Updated 4 years ago
- Hybrid Discriminative-Generative Training via Contrastive Learning☆75May 1, 2023Updated 2 years ago
- PyTorch implementation of the Quasi-Recurrent Neural Network - up to 16 times faster than NVIDIA's cuDNN LSTM☆1,264Feb 12, 2022Updated 4 years ago
- Stochastic Weight Averaging in PyTorch☆977Aug 1, 2021Updated 4 years ago
- Code for "Differentiable Compositional Kernel Learning for Gaussian Processes" https://arxiv.org/abs/1806.04326☆71Jul 10, 2018Updated 7 years ago