ag14774 / diffdist
☆61Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for diffdist
- ☆42Updated 5 years ago
- "Layer-wise Adaptive Rate Scaling" in PyTorch☆86Updated 3 years ago
- An implementation of shampoo☆74Updated 6 years ago
- Code for paper "SWALP: Stochastic Weight Averaging forLow-Precision Training".☆62Updated 5 years ago
- (CVPR 2020) This repo contains code for "PADS: Policy-Adapted Sampling for Visual Similarity Learning", which proposes learnable triplet …☆60Updated 4 years ago
- This repository is no longer maintained. Check☆82Updated 4 years ago
- Distributed, mixed-precision training with PyTorch☆89Updated 4 years ago
- [NeurIPS'20] GradAug: A New Regularization Method for Deep Neural Networks☆93Updated 3 years ago
- Code for "Are labels necessary for neural architecture search"☆92Updated 8 months ago
- Filter Response Normalization tested on better ImageNet baselines.☆35Updated 4 years ago
- A ShuffleBatchNorm layer to shuffle BatchNorm statistics across multiple GPUs☆56Updated 2 years ago
- [NeurIPS 2020 Oral] Is normalization indispensable for training deep neural networks?☆34Updated 2 years ago
- ICML 2020, Estimating Generalization under Distribution Shifts via Domain-Invariant Representations☆21Updated 4 years ago
- Pytorch implementation of the hamburger module from the ICLR 2021 paper "Is Attention Better Than Matrix Decomposition"☆98Updated 3 years ago
- Code for SelfAugment☆27Updated 3 years ago
- Implementation of the reversible residual network in pytorch☆101Updated 2 years ago
- Improving Generalization via Scalable Neighborhood Component Analysis☆136Updated last year
- A second-order optimizer for deep networks☆24Updated 5 years ago
- A PyTorch implementation of shake-shake☆111Updated 4 years ago
- custom cuda kernel for {2, 3}d relative attention with pytorch wrapper☆43Updated 4 years ago
- On Network Design Spaces for Visual Recognition☆94Updated 4 years ago
- ☆47Updated 3 years ago
- PyTorch Implementations of Dropout Variants☆87Updated 6 years ago
- Efficient reservoir sampling implementation for PyTorch☆104Updated 3 years ago
- (Batched) advanced indexing for PyTorch.☆53Updated 11 months ago
- Code for our paper "Informative Dropout for Robust Representation Learning: A Shape-bias Perspective" (ICML 2020)☆125Updated last year
- Minimal API for receptive field calculation in PyTorch☆66Updated 2 years ago
- pytorch implementation of "Contrastive Multiview Coding", "Momentum Contrast for Unsupervised Visual Representation Learning", and "Unsup…☆17Updated 4 years ago
- Code for reproducing experiments in "How Useful is Self-Supervised Pretraining for Visual Tasks?"☆60Updated 3 months ago