Lance0218 / Pytorch-DistributedDataParallel-Training-TricksLinks

A guide that integrates Pytorch DistributedDataParallel, Apex, warmup, learning rate scheduler, also mentions the set-up of early-stopping and random seed.

☆65

Alternatives and similar repositories for Pytorch-DistributedDataParallel-Training-Tricks

Users that are interested in Pytorch-DistributedDataParallel-Training-Tricks are comparing it to the libraries listed below

Sorting:

lesliejackson / PyTorch-Distributed-Training
Example of PyTorch DistributedDataParallel
☆60Updated 4 years ago
seominseok0429 / label-smoothing-visualization-pytorch
When Does Label Smoothing Help?_pytorch_implementationimp
☆126Updated 5 years ago
lehduong / torch-warmup-lr
Warmup learning rate wrapper for Pytorch Scheduler
☆41Updated 5 years ago
alexrame / mixmo-pytorch
Official Pytorch implementation of MixMo framework
☆84Updated 4 years ago
zhangchbin / OnlineLabelSmoothing
The official code for the paper "Delving Deep into Label Smoothing", IEEE TIP 2021
☆81Updated 3 years ago
snu-mllab / Co-Mixup
Official PyTorch implementation of "Co-Mixup: Saliency Guided Joint Mixup with Supermodular Diversity" (ICLR'21 Oral)
☆105Updated 4 years ago
pseulki / IB-Loss
[ICCV 2021] Influence-balanced Loss for Imbalanced Visual Classification
☆102Updated 3 years ago
huangleiBuaa / NormalizationSurvey
This repo is for our paper: Normalization Techniques in Training DNNs: Methodology, Analysis and Application
☆85Updated 4 years ago
zhoudaquan / dvit_repo
☆140Updated 3 years ago
haofanwang / awesome-mlp-papers
Recent Advances in MLP-based Models (MLP is all you need!)
☆117Updated 2 years ago
lucidrains / transformer-in-transformer
Implementation of Transformer in Transformer, pixel level attention paired with patch level attention for image classification, in Pytorc…
☆310Updated 3 years ago
rishikksh20 / convolution-vision-transformers
PyTorch Implementation of CvT: Introducing Convolutions to Vision Transformers
☆229Updated 4 years ago
CupidJay / Scaled-down-self-supervised-learning
official pytorch implementation of Rethining Self-supervised Learning: Small is Beautiful.
☆43Updated 4 years ago
rosinality / vision-transformers-pytorch
Implementation of various Vision Transformers I found interesting
☆84Updated 4 years ago
ludvb / batchrenorm
Batch Renormalization in Pytorch
☆45Updated 2 years ago
kuixu / Linear-Multihead-Attention
Reproducing the Linear Multihead Attention introduced in Linformer paper (Linformer: Self-Attention with Linear Complexity)
☆75Updated 5 years ago
Boyiliee / MoEx
MoEx (Moment Exchange)
☆141Updated 4 years ago
etetteh / sota-data-augmentation-and-optimizers
This repository contains some of the latest data augmentation techniques and optimizers for image classification using pytorch and the CI…
☆29Updated 4 years ago
ankandrew / online-label-smoothing-pt
Implementation of Online Label Smoothing in PyTorch
☆95Updated 3 years ago
tczhangzhi / awesome-normalization-techniques
Papers for normalization techniques, released codes collections.
☆228Updated 5 years ago
SHI-Labs / Semi-Supervised-Transfer-Learning
[CVPR 2021] Adaptive Consistency Regularization for Semi-Supervised Transfer Learning
☆106Updated 4 years ago
snu-mllab / PuzzleMix
Official PyTorch implementation of "Puzzle Mix: Exploiting Saliency and Local Statistics for Optimal Mixup" (ICML'20)
☆156Updated 4 years ago
yan-hao-tian / ConTNet
This repo contains the code of "ConTNet: Why not use convolution and transformer at the same time?"
☆98Updated 4 years ago
Spijkervet / BYOL
Bootstrap Your Own Latent: A New Approach to Self-Supervised Learning
☆133Updated 3 years ago
blakechi / ComVEX
Implementations of Recent Papers in Computer Vision
☆38Updated 3 years ago
rishikksh20 / MLP-Mixer-pytorch
Unofficial implementation of MLP-Mixer: An all-MLP Architecture for Vision
☆217Updated 4 years ago
amazon-science / crossnorm-selfnorm
CrossNorm and SelfNorm for Generalization under Distribution Shifts, ICCV 2021
☆128Updated 4 years ago
bytedance / TWIST
Official codes: Self-Supervised Learning by Estimating Twin Class Distribution
☆100Updated 4 years ago
lucidrains / halonet-pytorch
Implementation of the 😇 Attention layer from the paper, Scaling Local Self-Attention For Parameter Efficient Visual Backbones
☆200Updated 4 years ago
matej-ulicny / harmonic-networks
☆60Updated 3 years ago