sgugger / Adam-experiments
Experiments with Adam/AdamW/amsgrad
☆194Updated 6 years ago
Related projects ⓘ
Alternatives and complementary repositories for Adam-experiments
- Using the CLR algorithm for training (https://arxiv.org/abs/1506.01186)☆108Updated 6 years ago
- A simpler version of the self-attention layer from SAGAN, and some image classification results.☆212Updated 5 years ago
- Implementations of ideas from recent papers☆391Updated 3 years ago
- Complementary code for the Targeted Dropout paper☆256Updated 5 years ago
- repo that holds code for improving on dropout using Stochastic Delta Rule☆142Updated 5 years ago
- Corrupted labels and label smoothing☆128Updated 7 years ago
- Implements pytorch code for the Accelerated SGD algorithm.☆215Updated 6 years ago
- Smooth Loss Functions for Deep Top-k Classification☆246Updated 3 years ago
- A PyTorch implementation of the paper Mixup: Beyond Empirical Risk Minimization in PyTorch☆123Updated 6 years ago
- PyTorch implementation of the paper Dynamic Routing Between Capsules by Sara Sabour, Nicholas Frosst and Geoffrey Hinton☆169Updated 6 years ago
- Utilities for Pytorch☆89Updated 2 years ago
- Implementation of "Learning with Random Learning Rates" in PyTorch.☆102Updated 5 years ago
- [NO MAINTENANCE INTENDED] A PyTorch implementation of CapsNet architecture in the NIPS 2017 paper "Dynamic Routing Between Capsules".☆169Updated 5 years ago
- Accelerated Deep Learning with PyTorch at Jupyter Day Atlanta II☆127Updated 6 years ago
- 🔥 Reproducibly benchmarking Keras and PyTorch models☆367Updated 3 years ago
- PyTorch implementation of CVPR'18 - Perturbative Neural Networks http://xujuefei.com/pnn.html☆138Updated 5 years ago
- ☆302Updated 3 years ago
- auto-tuning momentum SGD optimizer☆287Updated 5 years ago
- Code used to generate the results appearing in "Train longer, generalize better: closing the generalization gap in large batch training o…☆148Updated 7 years ago
- A PyTorch implementation of " EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks."☆309Updated 4 years ago
- Transfer Learning Shootout for PyTorch's model zoo (torchvision)☆170Updated 4 years ago
- All Model summary in PyTorch similar to `model.summary()` in Keras☆87Updated 5 years ago
- PyTorch code for the "Deep Neural Networks with Box Convolutions" paper☆511Updated 4 years ago
- 2.86% and 15.85% on CIFAR-10 and CIFAR-100☆296Updated 6 years ago
- Code for the Eager Translation Model from the paper You May Not Need Attention☆294Updated 5 years ago
- ☆250Updated 8 years ago
- PyTorch implementation of a deep metric learning technique called "Magnet Loss" from Facebook AI Research (FAIR) in ICLR 2016.☆222Updated 9 months ago
- Random miniprojects with pytorch.☆173Updated 6 years ago
- visualization of CNN in PyTorch☆154Updated last year