stanford-futuredata / pytorch-distributedLinks
Fork of diux-dev/imagenet18
☆16Updated 6 years ago
Alternatives and similar repositories for pytorch-distributed
Users that are interested in pytorch-distributed are comparing it to the libraries listed below
Sorting:
- Codes for DATA: Differentiable ArchiTecture Approximation.☆11Updated 3 years ago
- Interpolation between Residual and Non-Residual Networks, ICML 2020. https://arxiv.org/abs/2006.05749☆26Updated 4 years ago
- (Batched) advanced indexing for PyTorch.☆53Updated 5 months ago
- Partially Adaptive Momentum Estimation method in the paper "Closing the Generalization Gap of Adaptive Gradient Methods in Training Deep …☆39Updated 2 years ago
- Python pdb for multiple processes☆44Updated last week
- ICML2019 Accepted Paper. Overcoming Multi-Model Forgetting☆13Updated 6 years ago
- An adaptive training algorithm for residual network☆15Updated 4 years ago
- Implementation of Kronecker Attention in Pytorch☆19Updated 4 years ago
- Code for paper 'Minimizing FLOPs to Learn Efficient Sparse Representations' published at ICLR 2020☆20Updated 5 years ago
- ☆25Updated 5 years ago
- NeurIPS'19: Meta-Weight-Net: Learning an Explicit Mapping For Sample Weighting (Pytorch implementation for class imbalance).☆33Updated 5 years ago
- Exploiting Uncertainty of Loss Landscape for Stochastic Optimization☆15Updated 6 years ago
- ☆33Updated 6 years ago
- ☆34Updated 6 years ago
- Implementation of Neural Arithmetic Logic Units (https://arxiv.org/pdf/1808.00508.pdf)☆31Updated 6 years ago
- MTAdam: Automatic Balancing of Multiple Training Loss Terms☆36Updated 4 years ago
- Reversible Recurrent Neural Network Pytorch Implementation☆21Updated 7 years ago
- Visualizing ImageNet Classes Hierarchical Structure.☆15Updated 7 years ago
- Implementation for NATv2.☆23Updated 4 years ago
- Code for the paper "Understanding the Role of Momentum in Stochastic Gradient Methods"☆14Updated 5 years ago
- Official Pytorch Implementation for the paper 'SUPER-ADAM: Faster and Universal Framework of Adaptive Gradients'☆17Updated 3 years ago
- A pytorch realization of adafactor (https://arxiv.org/pdf/1804.04235.pdf )☆24Updated 5 years ago
- tunz's CUDA pytorch operator (MaskedSoftmax)☆75Updated 6 years ago
- ☆23Updated 6 years ago
- Improving generalization by controlling label-noise information in neural network weights.☆40Updated 4 years ago
- Implementation for <Regularizing Neural Networks via Minimizing Hyperspherical Energy> in CVPR'20.☆24Updated 4 years ago
- Reproducible code for Augmentation paper☆17Updated 6 years ago
- LV-BERT: Exploiting Layer Variety for BERT (Findings of ACL 2021)☆18Updated 2 years ago
- Various implementations and experimentation for deep neural network model compression☆24Updated 6 years ago
- A PyTorch Dataset that caches samples in shared memory, accessible globally to all processes☆20Updated 3 years ago