SGD with compressed gradients and error-feedback: https://arxiv.org/abs/1901.09847
☆32Jul 25, 2024Updated last year
Alternatives and similar repositories for error-feedback-SGD
Users that are interested in error-feedback-SGD are comparing it to the libraries listed below
Sorting:
- QSGD-TF☆21May 15, 2019Updated 6 years ago
- Code for the signSGD paper☆93Jan 12, 2021Updated 5 years ago
- gTop-k S-SGD: A Communication-Efficient Distributed Synchronous SGD Algorithm for Deep Learning☆37Aug 19, 2019Updated 6 years ago
- Atomo: Communication-efficient Learning via Atomic Sparsification☆28Dec 9, 2018Updated 7 years ago
- ☆33Dec 3, 2019Updated 6 years ago
- Adaptive gradient sparsification for efficient federated learning: an online learning approach☆18Oct 31, 2020Updated 5 years ago
- Practical low-rank gradient compression for distributed optimization: https://arxiv.org/abs/1905.13727☆149Oct 29, 2024Updated last year
- YALL1: Your ALgorithms for L1☆13Jan 28, 2018Updated 8 years ago
- Layer-wise Sparsification of Distributed Deep Learning☆10Jul 6, 2020Updated 5 years ago
- ☆77Jun 7, 2019Updated 6 years ago
- Understanding Top-k Sparsification in Distributed Deep Learning☆24Nov 15, 2019Updated 6 years ago
- PyTorch for benchmarking communication-efficient distributed SGD optimization algorithms☆78Aug 30, 2021Updated 4 years ago
- GRACE - GRAdient ComprEssion for distributed deep learning☆139Jul 23, 2024Updated last year
- Code for reproducing experiments performed for Accoridon☆13Jun 11, 2021Updated 4 years ago
- JNumberTools is an open-source Java library for solving complex problems in combinatorics and number theory. Whether you're a researcher,…☆12May 13, 2025Updated 9 months ago
- MISSION: Ultra Large-Scale Feature Selection using Count-Sketches☆13Oct 6, 2019Updated 6 years ago
- Code related to ’Beyond spectral gap: The role of the topology in decentralized learning‘.☆13Jun 7, 2022Updated 3 years ago
- Code for "Practical Low-Rank Communication Compression in Decentralized Deep Learning"☆17Aug 4, 2020Updated 5 years ago
- ☆14Mar 13, 2023Updated 2 years ago
- [ICLR 2018] Deep Gradient Compression: Reducing the Communication Bandwidth for Distributed Training☆226Jul 10, 2024Updated last year
- Stochastic Gradient Push for Distributed Deep Learning☆171Apr 5, 2023Updated 2 years ago
- The code for the paper "QuAFL: Federated Averaging Can Be Both Asynchronous and Communication-Efficient"☆17Mar 26, 2023Updated 2 years ago
- It is implementation of Research paper "DEEP GRADIENT COMPRESSION: REDUCING THE COMMUNICATION BANDWIDTH FOR DISTRIBUTED TRAINING". Deep g…☆18Aug 14, 2019Updated 6 years ago
- Ternary Gradients to Reduce Communication in Distributed Deep Learning (TensorFlow)☆182Nov 19, 2018Updated 7 years ago
- A compressed adaptive optimizer for training large-scale deep learning models using PyTorch☆25Nov 26, 2019Updated 6 years ago
- Reference implementations for RecurJac, CROWN, FastLin and FastLip (Neural Network verification and robustness certification algorithms)…☆27Nov 23, 2019Updated 6 years ago
- The performance of turbo equalizers in both ISI channel and multipath fading channel is evaluated☆11Nov 24, 2020Updated 5 years ago
- A set of tools that make working with the Scala ecosystem even better.☆12Updated this week
- Iterative decoding of turbo codes using the Soft Output Viterbi Algorithm (SOVA)☆10Feb 5, 2021Updated 5 years ago
- The MATLAB code below implements the second-order SPSA (simultaneous perturbation stochastic approximation) and second-order SG (stochast…☆13Nov 27, 2020Updated 5 years ago
- Ok-Topk is a scheme for distributed training with sparse gradients. Ok-Topk integrates a novel sparse allreduce algorithm (less than 6k c…☆27Dec 10, 2022Updated 3 years ago
- Code for "Adaptive Gradient Quantization for Data-Parallel SGD", published in NeurIPS 2020.☆30Jan 14, 2021Updated 5 years ago
- Implementation of the FedPM framework by the authors of the ICLR 2023 paper "Sparse Random Networks for Communication-Efficient Federated…☆30Feb 10, 2023Updated 3 years ago
- Coordinate Descent Fuzzy Twin Support Vector Machine for Classification☆11Jan 13, 2018Updated 8 years ago
- My personal site.☆10Jan 20, 2026Updated last month
- Code for the paper "Distinguishing the Knowable from the Unknowable with Language Models"☆11Apr 15, 2024Updated last year
- Nonconvex Regularized Robust Regression via I-LAMM Algorithm☆11May 9, 2022Updated 3 years ago
- Partial implementation of paper "DEEP GRADIENT COMPRESSION: REDUCING THE COMMUNICATION BANDWIDTH FOR DISTRIBUTED TRAINING"☆32Nov 20, 2020Updated 5 years ago
- bayesPop R package☆11Feb 17, 2026Updated last week