ag14774 / diffdistLinks
☆62Updated 4 years ago
Alternatives and similar repositories for diffdist
Users that are interested in diffdist are comparing it to the libraries listed below
Sorting:
- ☆43Updated 6 years ago
- "Layer-wise Adaptive Rate Scaling" in PyTorch☆87Updated 4 years ago
- An implementation of shampoo☆74Updated 7 years ago
- Code for paper "SWALP: Stochastic Weight Averaging forLow-Precision Training".☆62Updated 6 years ago
- [NeurIPS'20] GradAug: A New Regularization Method for Deep Neural Networks☆94Updated 4 years ago
- (Batched) advanced indexing for PyTorch.☆53Updated 5 months ago
- (CVPR 2020) This repo contains code for "PADS: Policy-Adapted Sampling for Visual Similarity Learning", which proposes learnable triplet …☆61Updated 4 years ago
- Pytorch implementation of the hamburger module from the ICLR 2021 paper "Is Attention Better Than Matrix Decomposition"☆99Updated 4 years ago
- Filter Response Normalization tested on better ImageNet baselines.☆35Updated 5 years ago
- Code for "Are labels necessary for neural architecture search"☆92Updated last year
- This repository is no longer maintained. Check☆81Updated 5 years ago
- A plug-in replacement for DataLoader to load Imagenet disk-sequentially in PyTorch.☆239Updated 3 years ago
- On Network Design Spaces for Visual Recognition☆95Updated 5 years ago
- Simple experiment of Apex (A PyTorch Extension)☆47Updated 5 years ago
- A Re-implementation of Fixed-update Initialization☆153Updated 5 years ago
- Code for "EigenDamage: Structured Pruning in the Kronecker-Factored Eigenbasis" https://arxiv.org/abs/1905.05934☆113Updated 5 years ago
- models and tools for -What makes ImageNet good for Transfer Learning?☆25Updated 7 years ago
- Minimal API for receptive field calculation in PyTorch☆67Updated 2 years ago
- Zero-Shot Knowledge Distillation in Deep Networks☆67Updated 3 years ago
- Video Noise Contrastive Estimation☆66Updated last year
- A ShuffleBatchNorm layer to shuffle BatchNorm statistics across multiple GPUs☆56Updated 3 years ago
- BlockDrop: Dynamic Inference Paths in Residual Networks☆143Updated 2 years ago
- ☆165Updated 6 years ago
- ☆34Updated 6 years ago
- Neural Rejuvenation: Improving Deep Network Training by Enhancing Computational Resource Utilization at CVPR'19☆48Updated 5 years ago
- Code for our paper "Informative Dropout for Robust Representation Learning: A Shape-bias Perspective" (ICML 2020)☆125Updated 2 years ago
- Code for SelfAugment☆27Updated 4 years ago
- Masked Convolutional Flow☆59Updated 5 years ago
- Unofficial implementation of Stand-Alone Self-Attention in Vision Models (obsolete)☆44Updated 5 years ago
- Sparse Switchable Normalization with sparse activation function SparestMax☆64Updated 5 years ago