vineeths96 / Gradient-CompressionLinks

We present a set of all-reduce compatible gradient compression algorithms which significantly reduce the communication overhead while maintaining the performance of vanilla SGD. We empirically evaluate the performance of the compression methods by training deep neural networks on the CIFAR10 dataset.

☆10

Alternatives and similar repositories for Gradient-Compression

Users that are interested in Gradient-Compression are comparing it to the libraries listed below

Sorting:

ParCIS / Ok-Topk
Ok-Topk is a scheme for distributed training with sparse gradients. Ok-Topk integrates a novel sparse allreduce algorithm (less than 6k c…
☆26Updated 2 years ago
lzhangbv / acpsgd
[ICDCS 2023] Evaluation and Optimization of Gradient Compression for Distributed Deep Learning
☆10Updated 2 years ago
SymbioticLab / ModelKeeper
A Cluster-Wide Model Manager to Accelerate DNN Training via Automated Training Warmup
☆35Updated 2 years ago
hangxu0304 / DeepReduce
A Sparse-tensor Communication Framework for Distributed Deep Learning
☆13Updated 3 years ago
zhuangwang93 / Espresso
Hi-Speed DNN Training with Espresso: Unleashing the Full Potential of Gradient Compression with Near-Optimal Usage Strategies (EuroSys '2…
☆15Updated last year
CompML / survey-deep-gradient-compression
☆10Updated 4 years ago
SamuelGong / Pisces
[ACM SoCC'22] Pisces: Efficient Federated Learning via Guided Asynchronous Training
☆12Updated 3 months ago
lzhangbv / dear_pytorch
[ICDCS 2023] DeAR: Accelerating Distributed Deep Learning with Fine-Grained All-Reduce Pipelining
☆12Updated last year
hku-systems / naspipe
☆14Updated 3 years ago
NiuChaoyue / Secure-Federated-Submodel-Learning
☆15Updated 4 years ago
LGrCo / L-GreCo
AN EFFICIENT AND GENERAL FRAMEWORK FOR LAYERWISE-ADAPTIVE GRADIENT COMPRESSION
☆14Updated last year
sjtu-epcc / DVABatch
☆20Updated 3 years ago
amitport / EDEN-Distributed-Mean-Estimation
This repository is the official implementation of 'EDEN: Communication-Efficient and Robust Distributed Mean Estimation for Federated Lea…
☆14Updated 3 years ago
uw-mad-dash / Accordion
Code for reproducing experiments performed for Accoridon
☆13Updated 4 years ago
zhuangwang93 / Cupcake
Cupcake: A Compression Scheduler for Scalable Communication-Efficient Distributed Training (MLSys '23)
☆9Updated 2 years ago
Huangxy-Minel / System-Design-for-Federated-Learning
Paper list of federated learning: About system design
☆13Updated 3 years ago
casys-kaist / EnvPipe
☆25Updated last year
hclhkbu / GaussianK-SGD
Understanding Top-k Sparsification in Distributed Deep Learning
☆24Updated 5 years ago
kazukiosawa / pipe-fisher
☆10Updated 2 years ago
synxlin / deep-gradient-compression
[ICLR 2018] Deep Gradient Compression: Reducing the Communication Bandwidth for Distributed Training
☆223Updated last year
HKBU-HPML / OMGS-SGD
Layer-wise Sparsification of Distributed Deep Learning
☆10Updated 5 years ago
tonyzhao-jt / LLM-PQ
Official Repo for "LLM-PQ: Serving LLM on Heterogeneous Clusters with Phase-Aware Partition and Adaptive Quantization"
☆34Updated last month
sands-lab / omnireduce
☆69Updated 2 years ago
SophiaLi06 / BytePS_THC
THC: Accelerating Distributed Deep Learning Using Tensor Homomorphic Compression
☆19Updated last year
yanring / DGS
Dual-way gradient sparsification approach for async DNN training, based on PyTorch.
☆11Updated 2 years ago
sands-lab / grace
GRACE - GRAdient ComprEssion for distributed deep learning
☆139Updated last year
dywsjtu / apparate
Artifact for "Apparate: Rethinking Early Exits to Tame Latency-Throughput Tensions in ML Serving" [SOSP '24]
☆25Updated 8 months ago
bytedance / QSync
Official resporitory for "IPDPS' 24 QSync: Quantization-Minimized Synchronous Distributed Training Across Hybrid Devices".
☆20Updated last year
kimihe / Falcon
A computation-parallel deep learning architecture.
☆13Updated 5 years ago
qipengwang / Melon
MobiSys#114
☆21Updated last year