siddharth9820 / MoDNN
Implementation of algorithms for memory optimized deep neural network training
☆10Updated 4 years ago
Alternatives and similar repositories for MoDNN:
Users that are interested in MoDNN are comparing it to the libraries listed below
- ☆40Updated 4 years ago
- Hi-Speed DNN Training with Espresso: Unleashing the Full Potential of Gradient Compression with Near-Optimal Usage Strategies (EuroSys '2…☆15Updated last year
- Code for reproducing experiments performed for Accoridon☆13Updated 3 years ago
- [ICDCS 2023] DeAR: Accelerating Distributed Deep Learning with Fine-Grained All-Reduce Pipelining☆12Updated last year
- A Cluster-Wide Model Manager to Accelerate DNN Training via Automated Training Warmup☆34Updated 2 years ago
- Cupcake: A Compression Scheduler for Scalable Communication-Efficient Distributed Training (MLSys '23)☆9Updated last year
- Boost hardware utilization for ML training workloads via Inter-model Horizontal Fusion☆32Updated 11 months ago
- Machine Learning System☆14Updated 4 years ago
- Ok-Topk is a scheme for distributed training with sparse gradients. Ok-Topk integrates a novel sparse allreduce algorithm (less than 6k c…☆25Updated 2 years ago
- Source code of IPA, https://escholarship.org/uc/item/2p0805dq☆10Updated 10 months ago
- A Portable C Library for Distributed CNN Inference on IoT Edge Clusters