Soptq / Dynamic_Load_Balance_DistributedDNN
Official Pytorch implementation of "DBS: Dynamic Batch Size for Distributed Deep Neural Network Training"
☆22Updated 3 years ago
Alternatives and similar repositories for Dynamic_Load_Balance_DistributedDNN:
Users that are interested in Dynamic_Load_Balance_DistributedDNN are comparing it to the libraries listed below
- ☆45Updated 4 years ago
- PipeTransformer: Automated Elastic Pipelining for Distributed Training of Large-scale Models. ICML 2021☆55Updated 3 years ago
- Partial implementation of paper "DEEP GRADIENT COMPRESSION: REDUCING THE COMMUNICATION BANDWIDTH FOR DISTRIBUTED TRAINING"☆31Updated 4 years ago
- Implementation of (overlap) local SGD in Pytorch☆33Updated 4 years ago
- FedNAS: Federated Deep Learning via Neural Architecture Search☆53Updated 3 years ago
- Practical low-rank gradient compression for distributed optimization: https://arxiv.org/abs/1905.13727☆146Updated 3 months ago
- Dynamic Channel Pruning: Feature Boosting and Suppression☆17Updated 5 years ago
- Implementation of Parameter Server using PyTorch communication lib☆43Updated 5 years ago
- vector quantization for stochastic gradient descent.☆33Updated 4 years ago
- Codes for paper "Few Shot Network Compression via Cross Distillation", AAAI 2020.☆30Updated 5 years ago
- Code for "Adaptive Gradient Quantization for Data-Parallel SGD", published in NeurIPS 2020.☆29Updated 4 years ago
- Sparsified SGD with Memory: https://arxiv.org/abs/1809.07599☆58Updated 6 years ago
- Federated Learning Framework Benchmark (UniFed)☆49Updated last year
- [IJCAI2023] An automated parallel training system that combines the advantages from both data and model parallelism. If you have any inte…☆51Updated last year
- GRACE - GRAdient ComprEssion for distributed deep learning☆140Updated 6 months ago
- ☆22Updated 4 years ago
- [ICLR-2020] Dynamic Sparse Training: Find Efficient Sparse Network From Scratch With Trainable Masked Layers.☆31Updated 5 years ago
- MiLeNAS: Efficient Neural Architecture Search via Mixed-Level Reformulation. Published in CVPR 2020☆36Updated 4 years ago
- [NeurIPS‘2021] "MEST: Accurate and Fast Memory-Economic Sparse Training Framework on the Edge", Geng Yuan, Xiaolong Ma, Yanzhi Wang et al…☆18Updated 2 years ago
- Code for paper: Variance Reduced Local SGD with Lower Communication Complexity☆12Updated 4 years ago
- ☆40Updated 3 years ago
- XNAS: An effective, modular, and flexible Neural Architecture Search (NAS) framework.☆47Updated 2 years ago
- Accuracy 77%. Large batch deep learning optimizer LARS for ImageNet with PyTorch and ResNet, using Horovod for distribution. Optional acc…☆38Updated 3 years ago
- Distributed ML Training Benchmarks☆27Updated last year
- Distributed DataLoader For Pytorch Based On Ray☆24Updated 3 years ago
- [ICML 2021] "Do We Actually Need Dense Over-Parameterization? In-Time Over-Parameterization in Sparse Training" by Shiwei Liu, Lu Yin, De…☆46Updated last year
- PipeEdge: Pipeline Parallelism for Large-Scale Model Inference on Heterogeneous Edge Devices☆29Updated last year
- A Cluster-Wide Model Manager to Accelerate DNN Training via Automated Training Warmup☆34Updated 2 years ago
- Code for ICML 2022 paper "SPDY: Accurate Pruning with Speedup Guarantees"☆18Updated last year
- ddl-benchmarks: Benchmarks for Distributed Deep Learning☆37Updated 4 years ago