Lance0218 / Pytorch-DistributedDataParallel-Training-TricksLinks
A guide that integrates Pytorch DistributedDataParallel, Apex, warmup, learning rate scheduler, also mentions the set-up of early-stopping and random seed.
☆65Updated 3 years ago
Alternatives and similar repositories for Pytorch-DistributedDataParallel-Training-Tricks
Users that are interested in Pytorch-DistributedDataParallel-Training-Tricks are comparing it to the libraries listed below
Sorting:
- Implementation of Transformer in Transformer, pixel level attention paired with patch level attention for image classification, in Pytorc…☆310Updated 4 years ago
- Warmup learning rate wrapper for Pytorch Scheduler☆41Updated 5 years ago
- When Does Label Smoothing Help?_pytorch_implementationimp☆126Updated 6 years ago
- official pytorch implementation of Rethining Self-supervised Learning: Small is Beautiful.☆43Updated 4 years ago
- Official Pytorch implementation of MixMo framework☆83Updated 4 years ago
- This repo is for our paper: Normalization Techniques in Training DNNs: Methodology, Analysis and Application☆85Updated 4 years ago
- MoEx (Moment Exchange)☆141Updated 4 years ago
- ☆140Updated 4 years ago
- [NeurIPS 2021] Official codes for "Efficient Training of Visual Transformers with Small Datasets".☆144Updated last year
- SCOUTER: Slot Attention-based Classifier for Explainable Image Recognition (ICCV 2021)☆98Updated 3 years ago
- Implementation of various Vision Transformers I found interesting☆84Updated 4 years ago
- Recent Advances in MLP-based Models (MLP is all you need!)☆117Updated 3 years ago
- UniMoCo: Unsupervised, Semi-Supervised and Full-Supervised Visual Representation Learning☆57Updated 4 years ago
- [ICCV 2021] Influence-balanced Loss for Imbalanced Visual Classification☆102Updated 3 years ago
- AdaShare: Learning What To Share For Efficient Deep Multi-Task Learning☆114Updated 5 years ago
- Example of PyTorch DistributedDataParallel☆61Updated 4 years ago
- Reproducing the Linear Multihead Attention introduced in Linformer paper (Linformer: Self-Attention with Linear Complexity)☆75Updated 5 years ago
- ☆135Updated 2 years ago
- Implementation of the 😇 Attention layer from the paper, Scaling Local Self-Attention For Parameter Efficient Visual Backbones☆201Updated 4 years ago
- Official PyTorch implementation of "Co-Mixup: Saliency Guided Joint Mixup with Supermodular Diversity" (ICLR'21 Oral)☆105Updated 4 years ago
- Self-supervised vIsion Transformer (SiT)☆337Updated 3 years ago
- Implementation of Visual Transformer for Small-size Datasets☆129Updated 3 years ago
- A pytorch implementation of paper 'Be Your Own Teacher: Improve the Performance of Convolutional Neural Networks via Self Distillation', …☆180Updated 4 years ago
- Unofficial implementation of MLP-Mixer: An all-MLP Architecture for Vision☆216Updated 4 years ago
- Unified Pytorch framework for image-based self-supervised learning☆92Updated 5 years ago
- Implementation of ResMLP, an all MLP solution to image classification, in Pytorch☆201Updated 3 years ago
- Official codes: Self-Supervised Learning by Estimating Twin Class Distribution☆100Updated 4 years ago
- Implementations of Recent Papers in Computer Vision☆38Updated 3 years ago
- The official code for the paper "Delving Deep into Label Smoothing", IEEE TIP 2021☆81Updated 3 years ago
- Implementation of Online Label Smoothing in PyTorch☆96Updated 3 years ago