timgaripov / swaLinks
Stochastic Weight Averaging in PyTorch
☆975Updated 4 years ago
Alternatives and similar repositories for swa
Users that are interested in swa are comparing it to the libraries listed below
Sorting:
- mixup: Beyond Empirical Risk Minimization☆1,191Updated 3 years ago
- 2.56%, 15.20%, 1.30% on CIFAR10, CIFAR100, and SVHN https://arxiv.org/abs/1708.04552☆553Updated 5 years ago
- AugMix: A Simple Data Processing Method to Improve Robustness and Uncertainty☆988Updated 4 months ago
- lookahead optimizer (Lookahead Optimizer: k steps forward, 1 step back) for pytorch☆336Updated 6 years ago
- Efficient Learning of Augmentation Policy Schedules☆507Updated 5 years ago
- Implementations of ideas from recent papers☆392Updated 4 years ago
- ☆1,144Updated 2 years ago
- ☆536Updated 3 years ago
- Unofficial implementation of the ImageNet, CIFAR 10 and SVHN Augmentation Policies learned by AutoAugment using pillow☆1,489Updated 2 years ago
- A New Optimization Technique for Deep Neural Networks☆540Updated 3 years ago
- Official Implementation of 'Fast AutoAugment' in PyTorch.☆1,610Updated 4 years ago
- Over9000 optimizer☆424Updated 2 years ago
- Fine-tune pretrained Convolutional Neural Networks with PyTorch☆725Updated last year
- Code snippets created for the PyTorch discussion board☆571Updated 4 years ago
- Implementation of DropBlock: A regularization method for convolutional networks in PyTorch.☆596Updated 5 years ago
- Weakly Supervised Learning On Images☆601Updated 3 years ago
- Standardizing weights to accelerate micro-batch training☆550Updated 3 years ago
- Pytorch implementation of the paper "Class-Balanced Loss Based on Effective Number of Samples"☆800Updated last year
- A PyTorch implementation of " EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks."☆313Updated 5 years ago
- Code for Noisy Student Training. https://arxiv.org/abs/1911.04252☆765Updated 4 years ago
- Code for reproducing Manifold Mixup results (ICML 2019)☆493Updated last year
- Code for Switchable Normalization from "Differentiable Learning-to-Normalize via Switchable Normalization", https://arxiv.org/abs/1806.10…☆869Updated 5 years ago
- Gradually-Warmup Learning Rate Scheduler for PyTorch☆992Updated 11 months ago
- Class-Balanced Loss Based on Effective Number of Samples. CVPR 2019☆612Updated 4 years ago
- Implementation of the mixup training method☆466Updated 7 years ago
- Pytorch implementation for "Large-Scale Long-Tailed Recognition in an Open World" (CVPR 2019 ORAL)☆867Updated 3 years ago
- 🛠 Toolbox to extend PyTorch functionalities☆419Updated last year
- Ranger - a synergistic optimizer using RAdam (Rectified Adam), Gradient Centralization and LookAhead in one codebase☆1,203Updated last year
- This repository reproduces the results of the paper: "Fixing the train-test resolution discrepancy" https://arxiv.org/abs/1906.06423☆1,045Updated 4 years ago
- Wide Residual Networks (WideResNets) in PyTorch☆343Updated 4 years ago