Decoupled Weight Decay Regularization (ICLR 2019)
☆296Jan 9, 2019Updated 7 years ago
Alternatives and similar repositories for AdamW-and-SGDW
Users that are interested in AdamW-and-SGDW are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆256Nov 23, 2016Updated 9 years ago
- keras implementation of AdamW from Fixing Weight Decay Regularization in Adam (https://arxiv.org/abs/1711.05101)☆71Jul 23, 2018Updated 7 years ago
- Experiments with Adam/AdamW/amsgrad☆201Sep 5, 2018Updated 7 years ago
- 2.86% and 15.85% on CIFAR-10 and CIFAR-100☆296Oct 9, 2018Updated 7 years ago
- Code for "Aggregated Momentum: Stability Through Passive Damping", Lucas et al. 2018☆36Nov 6, 2018Updated 7 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Partially Adaptive Momentum Estimation method in the paper "Closing the Generalization Gap of Adaptive Gradient Methods in Training Deep …☆40Apr 13, 2023Updated 3 years ago
- AdamW optimizer for Keras☆116Aug 9, 2019Updated 6 years ago
- ☆22Nov 24, 2018Updated 7 years ago
- On the Variance of the Adaptive Learning Rate and Beyond☆2,550Jul 31, 2021Updated 4 years ago
- Apollo: An Adaptive Parameter-wise Diagonal Quasi-Newton Method for Nonconvex Stochastic Optimization☆181Nov 21, 2021Updated 4 years ago
- Sampled Quasi-Newton Methods for Deep Learning☆21May 21, 2020Updated 6 years ago
- TensorFlow implementation of (Momentum) Stochastic Variance-Adapted Gradient.☆45May 11, 2018Updated 8 years ago
- Code for visualizing the loss landscape of neural nets☆3,181Apr 5, 2022Updated 4 years ago
- i-RevNet Pytorch Code☆396Feb 16, 2021Updated 5 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Unsupervised instance segmentation via active robot interaction☆76Jul 1, 2022Updated 3 years ago
- Implementation and experiments for AdamW on Pytorch☆94Nov 23, 2019Updated 6 years ago
- ☆218May 23, 2018Updated 8 years ago
- ☆137Oct 23, 2017Updated 8 years ago
- Lua implementation of Entropy-SGD☆81Apr 9, 2018Updated 8 years ago
- Full implementation of the paper "Rethinking Softmax with Cross-Entropy: Neural Network Classifier as Mutual Information Estimator".☆101Mar 9, 2020Updated 6 years ago
- Code for paper "Which Training Methods for GANs do actually Converge? (ICML 2018)"☆920Aug 27, 2019Updated 6 years ago
- Implementation of Adversarial Variational Optimization in PyTorch☆42Aug 7, 2018Updated 7 years ago
- Small scale experiments with group normalization☆58Apr 4, 2018Updated 8 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- This is a clean implementation of AM-GANs.☆16Sep 26, 2020Updated 5 years ago
- Code for Stochastic Hyperparameter Optimization through Hypernetworks☆28Jun 11, 2018Updated 7 years ago
- Forward-mode Automatic Differentiation for TensorFlow☆139Mar 12, 2018Updated 8 years ago
- CondenseNet: Light weighted CNN for mobile devices☆691Nov 11, 2019Updated 6 years ago
- Code for "Training Generative Adversarial Networks with Binary Neurons by End-to-end Backpropagation"☆26Oct 30, 2019Updated 6 years ago
- Neural Architecture Search with Bayesian Optimisation and Optimal Transport☆136Jan 28, 2019Updated 7 years ago
- An optimizer that trains as fast as Adam and as good as SGD.☆2,905Jul 23, 2023Updated 2 years ago
- A PyTorch implementation of shake-shake☆112Apr 21, 2020Updated 6 years ago
- Pytorch implementation of MaxPoolingLoss.☆177Jun 9, 2018Updated 8 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- customized GPflow with simple Tensorflow API☆17Aug 7, 2019Updated 6 years ago
- Code for Switchable Normalization from "Differentiable Learning-to-Normalize via Switchable Normalization", https://arxiv.org/abs/1806.10…☆869Jun 11, 2020Updated 5 years ago
- LSTM and QRNN Language Model Toolkit for PyTorch☆1,990Feb 12, 2022Updated 4 years ago
- Hybrid Discriminative-Generative Training via Contrastive Learning☆75May 1, 2023Updated 3 years ago
- PyTorch implementation of the Quasi-Recurrent Neural Network - up to 16 times faster than NVIDIA's cuDNN LSTM☆1,262Feb 12, 2022Updated 4 years ago
- Stochastic Weight Averaging in PyTorch☆983Aug 1, 2021Updated 4 years ago
- Code for "Differentiable Compositional Kernel Learning for Gaussian Processes" https://arxiv.org/abs/1806.04326☆71Jul 10, 2018Updated 7 years ago