SGD with compressed gradients and error-feedback: https://arxiv.org/abs/1901.09847
☆32Jul 25, 2024Updated last year
Alternatives and similar repositories for error-feedback-SGD
Users that are interested in error-feedback-SGD are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Sparsified SGD with Memory: https://arxiv.org/abs/1809.07599☆58Oct 25, 2018Updated 7 years ago
- Atomo: Communication-efficient Learning via Atomic Sparsification☆28Dec 9, 2018Updated 7 years ago
- ☆33Dec 3, 2019Updated 6 years ago
- Simple Hierarchical Count Sketch in Python☆21Jun 3, 2021Updated 4 years ago
- PyTorch for benchmarking communication-efficient distributed SGD optimization algorithms☆78Aug 30, 2021Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Practical low-rank gradient compression for distributed optimization: https://arxiv.org/abs/1905.13727☆150Oct 29, 2024Updated last year
- Decentralized SGD and Consensus with Communication Compression: https://arxiv.org/abs/1907.09356☆74Sep 10, 2020Updated 5 years ago
- MISSION: Ultra Large-Scale Feature Selection using Count-Sketches☆13Oct 6, 2019Updated 6 years ago
- Code for reproducing experiments performed for Accoridon☆13Jun 11, 2021Updated 4 years ago
- Adaptive gradient sparsification for efficient federated learning: an online learning approach☆18Oct 31, 2020Updated 5 years ago
- Understanding Top-k Sparsification in Distributed Deep Learning☆24Nov 15, 2019Updated 6 years ago
- Layer-wise Sparsification of Distributed Deep Learning☆10Jul 6, 2020Updated 5 years ago
- GRACE - GRAdient ComprEssion for distributed deep learning☆139Jul 23, 2024Updated last year
- Code for "Practical Low-Rank Communication Compression in Decentralized Deep Learning"☆17Aug 4, 2020Updated 5 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Stochastic Gradient Push for Distributed Deep Learning☆171Apr 5, 2023Updated 3 years ago
- Q-RR, DIANA-RR, Q-NASTYA, NASTYA-DIANA, QSGD, DIANA, FedCOM and FedPAQ on logistic loss with L2 regularization☆11Nov 1, 2022Updated 3 years ago
- A compressed adaptive optimizer for training large-scale deep learning models using PyTorch☆25Nov 26, 2019Updated 6 years ago
- ☆12Mar 1, 2024Updated 2 years ago
- It is implementation of Research paper "DEEP GRADIENT COMPRESSION: REDUCING THE COMMUNICATION BANDWIDTH FOR DISTRIBUTED TRAINING". Deep g…☆18Aug 14, 2019Updated 6 years ago
- The code for the paper "QuAFL: Federated Averaging Can Be Both Asynchronous and Communication-Efficient"☆17Mar 26, 2023Updated 3 years ago
- Communication-efficient decentralized SGD (Pytorch)☆23Mar 17, 2020Updated 6 years ago
- vector quantization for stochastic gradient descent.☆36May 12, 2020Updated 5 years ago
- Code related to ’Beyond spectral gap: The role of the topology in decentralized learning‘.☆14Jun 7, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Partial implementation of paper "DEEP GRADIENT COMPRESSION: REDUCING THE COMMUNICATION BANDWIDTH FOR DISTRIBUTED TRAINING"☆32Nov 20, 2020Updated 5 years ago
- Artifacts of VLDB'22 paper "COMET: A Novel Memory-Efficient Deep Learning TrainingFramework by Using Error-Bounded Lossy Compression"☆10Aug 2, 2022Updated 3 years ago
- PyTorch implementation of a 9-layer ResNet for CIFAR-10.☆12May 8, 2024Updated last year
- ☆10Jun 4, 2021Updated 4 years ago
- Code for paper "Learning a Code: Machine Learning for Approximate Non-Linear Coded-Computation"☆10Dec 21, 2020Updated 5 years ago
- Implementation of Compressed SGD with Compressed Gradients in Pytorch☆13Jul 25, 2024Updated last year
- A LaTeX template for note☆10May 4, 2023Updated 2 years ago
- Zeroth-order Min-max Optimization☆13Jun 28, 2020Updated 5 years ago
- We present a set of all-reduce compatible gradient compression algorithms which significantly reduce the communication overhead while mai…☆10Nov 14, 2021Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- An attempt to replicate the paper "Multi-shot Pedestrian Re-identification via Sequential Decision Making (CVPR2018)"☆10Nov 16, 2019Updated 6 years ago
- ☆10Sep 3, 2017Updated 8 years ago
- Implementation of the SuRP algorithm by the authors of the AISTATS 2022 paper "An Information-Theoretic Justification for Model Pruning".…☆13May 4, 2022Updated 3 years ago
- RP-GAN: Stable GAN Training with Random Projections☆22Jun 27, 2018Updated 7 years ago
- The performance of turbo equalizers in both ISI channel and multipath fading channel is evaluated☆11Nov 24, 2020Updated 5 years ago
- Reference implementations for RecurJac, CROWN, FastLin and FastLip (Neural Network verification and robustness certification algorithms)…☆27Nov 23, 2019Updated 6 years ago
- SOTA results for reid baseline model (Gluon implementation)☆13Aug 6, 2018Updated 7 years ago