sgugger / Adam-experimentsLinks
Experiments with Adam/AdamW/amsgrad
☆201Updated 7 years ago
Alternatives and similar repositories for Adam-experiments
Users that are interested in Adam-experiments are comparing it to the libraries listed below
Sorting:
- Corrupted labels and label smoothing☆129Updated 8 years ago
- Using the CLR algorithm for training (https://arxiv.org/abs/1506.01186)☆108Updated 7 years ago
- repo that holds code for improving on dropout using Stochastic Delta Rule☆141Updated 6 years ago
- Complementary code for the Targeted Dropout paper☆255Updated 6 years ago
- PyTorch implementation of the paper Dynamic Routing Between Capsules by Sara Sabour, Nicholas Frosst and Geoffrey Hinton☆170Updated 7 years ago
- Implements pytorch code for the Accelerated SGD algorithm.☆215Updated 7 years ago
- Implementations of ideas from recent papers☆392Updated 4 years ago
- A PyTorch implementation of the paper Mixup: Beyond Empirical Risk Minimization in PyTorch☆124Updated 7 years ago
- Files to create the figures in the paper "Super-Convergence: Very Fast Training of Residual Networks Using Large Learning Rates"☆191Updated 7 years ago
- Compare outputs between layers written in Tensorflow and layers written in Pytorch☆72Updated 7 years ago
- Implementation of "Learning with Random Learning Rates" in PyTorch.☆102Updated 6 years ago
- Code used to generate the results appearing in "Train longer, generalize better: closing the generalization gap in large batch training o…☆149Updated 8 years ago
- A simple, flexible, and extensible template for PyTorch. It's beautiful.☆191Updated 2 years ago
- All Model summary in PyTorch similar to `model.summary()` in Keras☆88Updated 6 years ago
- Accelerated Deep Learning with PyTorch at Jupyter Day Atlanta II☆126Updated 7 years ago
- Plot loss and accuracy of neural networks over time☆63Updated 8 years ago
- Snapshot Ensembles in Torch (Snapshot Ensembles: Train 1, Get M for Free)☆188Updated 8 years ago
- PyTorch implementation of a deep metric learning technique called "Magnet Loss" from Facebook AI Research (FAIR) in ICLR 2016.☆219Updated last year
- This repo houses the new PNN code, along with our responses to the issue raised in the recent Reddit discussion. The code is based on Mic…☆234Updated 6 years ago
- This is an example of how to train a MNIST network in Python and run it in c++ with pytorch 1.0☆96Updated 7 years ago
- TensorFlow implementation of PNASNet-5 on ImageNet☆101Updated 6 years ago
- Efficient Data Loading Pipeline in Pure Python☆212Updated 5 years ago
- A simpler version of the self-attention layer from SAGAN, and some image classification results.☆214Updated 6 years ago
- My personal toolkit for PyTorch development.☆129Updated 8 years ago
- Decoupled Weight Decay Regularization (ICLR 2019)☆284Updated 6 years ago
- Smooth Loss Functions for Deep Top-k Classification☆258Updated 4 years ago
- tunz's CUDA pytorch operator (MaskedSoftmax)☆75Updated 6 years ago
- Simple Tensorflow implementation of "On The Variance Of The Adaptive Learning Rate And Beyond"☆97Updated 5 years ago
- AI challenge deadlines with link to software baselines and evaluation results.☆201Updated 5 years ago
- Reviewing recent advances in classification on CIFAR 10 and 100 datasets☆37Updated 7 years ago