sgugger / Adam-experimentsLinks
Experiments with Adam/AdamW/amsgrad
☆201Updated 7 years ago
Alternatives and similar repositories for Adam-experiments
Users that are interested in Adam-experiments are comparing it to the libraries listed below
Sorting:
- Using the CLR algorithm for training (https://arxiv.org/abs/1506.01186)☆108Updated 7 years ago
- Complementary code for the Targeted Dropout paper☆255Updated 6 years ago
- Corrupted labels and label smoothing☆129Updated 8 years ago
- repo that holds code for improving on dropout using Stochastic Delta Rule☆141Updated 6 years ago
- Implementation of "Learning with Random Learning Rates" in PyTorch.☆102Updated 6 years ago
- PyTorch implementation of the paper Dynamic Routing Between Capsules by Sara Sabour, Nicholas Frosst and Geoffrey Hinton☆170Updated 7 years ago
- A simpler version of the self-attention layer from SAGAN, and some image classification results.☆214Updated 6 years ago
- Efficient Data Loading Pipeline in Pure Python☆213Updated 5 years ago
- Implementations of ideas from recent papers☆392Updated 5 years ago
- Implements pytorch code for the Accelerated SGD algorithm.☆215Updated 7 years ago
- AI challenge deadlines with link to software baselines and evaluation results.☆201Updated 6 years ago
- PyTorch implementation of CVPR'18 - Perturbative Neural Networks http://xujuefei.com/pnn.html☆138Updated 7 years ago
- A PyTorch implementation of the paper Mixup: Beyond Empirical Risk Minimization in PyTorch☆124Updated 8 years ago
- tunz's CUDA pytorch operator (MaskedSoftmax)☆75Updated 6 years ago
- Simple Tensorflow implementation of "Adaptive Gradient Methods with Dynamic Bound of Learning Rate" (ICLR 2019)☆150Updated 6 years ago
- All Model summary in PyTorch similar to `model.summary()` in Keras☆88Updated 6 years ago
- Code used to generate the results appearing in "Train longer, generalize better: closing the generalization gap in large batch training o…☆149Updated 8 years ago
- Smooth Loss Functions for Deep Top-k Classification☆259Updated 4 years ago
- ☆306Updated 4 years ago
- TensorFlow implementation of PNASNet-5 on ImageNet☆101Updated 7 years ago
- This is an example of how to train a MNIST network in Python and run it in c++ with pytorch 1.0☆96Updated 7 years ago
- Simple Tensorflow implementation of "On The Variance Of The Adaptive Learning Rate And Beyond"☆97Updated 5 years ago
- Utilities for Pytorch☆88Updated 3 years ago
- Random miniprojects with pytorch.☆170Updated 7 years ago
- An implementation of shampoo☆78Updated 7 years ago
- This repo houses the new PNN code, along with our responses to the issue raised in the recent Reddit discussion. The code is based on Mic…☆234Updated 6 years ago
- Accelerated Deep Learning with PyTorch at Jupyter Day Atlanta II☆126Updated 7 years ago
- ☆70Updated 9 years ago
- Files to create the figures in the paper "Super-Convergence: Very Fast Training of Residual Networks Using Large Learning Rates"☆191Updated 8 years ago
- Use TensorFlow efficiently☆96Updated 4 years ago