Lance0218 / Pytorch-DistributedDataParallel-Training-Tricks
A guide that integrates Pytorch DistributedDataParallel, Apex, warmup, learning rate scheduler, also mentions the set-up of early-stopping and random seed.
☆60Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for Pytorch-DistributedDataParallel-Training-Tricks
- Implementations of Recent Papers in Computer Vision☆39Updated 2 years ago
- Warmup learning rate wrapper for Pytorch Scheduler☆39Updated 4 years ago
- This repository contains some of the latest data augmentation techniques and optimizers for image classification using pytorch and the CI…☆29Updated 3 years ago
- official pytorch implementation of Rethining Self-supervised Learning: Small is Beautiful.☆41Updated 3 years ago
- Implementation of various Vision Transformers I found interesting☆84Updated 3 years ago
- Simple but high-performing method for learning a policy of test-time augmentation☆38Updated last year
- Implementation of Online Label Smoothing in PyTorch☆94Updated 2 years ago
- PyTorch implementation of EMAN for self-supervised and semi-supervised learning: https://arxiv.org/abs/2101.08482☆101Updated 3 years ago
- ☆44Updated 3 years ago
- Reproducing the Linear Multihead Attention introduced in Linformer paper (Linformer: Self-Attention with Linear Complexity)☆73Updated 4 years ago
- Implementation of OmniNet, Omnidirectional Representations from Transformers, in Pytorch☆55Updated 3 years ago
- Implementation of Uniformer, a simple attention and 3d convolutional net that achieved SOTA in a number of video classification tasks, de…☆97Updated 2 years ago
- UniMoCo: Unsupervised, Semi-Supervised and Full-Supervised Visual Representation Learning☆53Updated 3 years ago
- Revisiting Contrastive Methods for Unsupervised Learning of Visual Representations. [NeurIPS 2021]☆88Updated 3 years ago
- PyTorch implementation of MLP-Mixer☆36Updated 3 years ago
- Pytorch implementation of CVPR2021 paper: SuperMix: Supervising the Mixing Data Augmentation☆91Updated 2 years ago
- This repo is for our paper: Normalization Techniques in Training DNNs: Methodology, Analysis and Application☆84Updated 3 years ago
- SoT: Delving Deeper into Classification Head for Transformer☆48Updated 2 years ago
- (NeurIPS 2020 Workshop on SSL) Official Implementation of "MixCo: Mix-up Contrastive Learning for Visual Representation"☆58Updated last year
- This repo contains the code of "ConTNet: Why not use convolution and transformer at the same time?"☆95Updated 3 years ago
- ☆25Updated 4 years ago
- TF 2 implementation Learning to Resize Images for Computer Vision Tasks (https://arxiv.org/abs/2103.09950v1).☆52Updated 3 years ago
- an implementation of mixup☆41Updated 4 years ago
- The official code for the paper "Delving Deep into Label Smoothing", IEEE TIP 2021☆73Updated 2 years ago
- (Unofficial) PyTorch implementation of the paper Early Convolutions Help Transformers See Better☆43Updated 3 years ago
- a pytorch implementation for MoCo V3☆32Updated 3 years ago
- Summary of Transformer applications for computer vision tasks.☆58Updated 3 years ago
- NeurIPS 2021, Official codes for "Efficient Training of Visual Transformers with Small Datasets".☆139Updated last year