PanZaifeng / G-SLIDE
☆14Updated 3 years ago
Alternatives and similar repositories for G-SLIDE:
Users that are interested in G-SLIDE are comparing it to the libraries listed below
- ☆15Updated 2 years ago
- A Learnable LSH Framework for Efficient NN Training☆31Updated 3 years ago
- Distributed DataLoader For Pytorch Based On Ray☆24Updated 3 years ago
- Some microbenchmarks and design docs before commencement☆12Updated 4 years ago
- [ICDCS 2023] DeAR: Accelerating Distributed Deep Learning with Fine-Grained All-Reduce Pipelining☆12Updated last year
- A study of the downstream instability of word embeddings☆12Updated 2 years ago
- Memory Optimizations for Deep Learning (ICML 2023)☆62Updated 11 months ago
- Distributed ML Optimizer☆30Updated 3 years ago
- Confident Adaptive Transformers☆12Updated 3 years ago
- Hyperparameter tuning via uncertainty modeling☆47Updated 9 months ago
- "Moshpit SGD: Communication-Efficient Decentralized Training on Heterogeneous Unreliable Devices", official implementation☆29Updated 2 weeks ago
- Code for paper 'Minimizing FLOPs to Learn Efficient Sparse Representations' published at ICLR 2020☆20Updated 5 years ago
- Inference framework for MoE layers based on TensorRT with Python binding☆41Updated 3 years ago
- Codes of the paper Deformable Butterfly: A Highly Structured and Sparse Linear Transform.☆12Updated 3 years ago
- A Python library transfers PyTorch tensors between CPU and NVMe☆103Updated 2 months ago
- ☆14Updated 2 years ago
- ☆19Updated last year
- PipeTransformer: Automated Elastic Pipelining for Distributed Training of Large-scale Models. ICML 2021