HKBU-HPML / OMGS-SGDView external linksLinks
Layer-wise Sparsification of Distributed Deep Learning
☆10Jul 6, 2020Updated 5 years ago
Alternatives and similar repositories for OMGS-SGD
Users that are interested in OMGS-SGD are comparing it to the libraries listed below
Sorting:
- MG-WFBP: Merging Gradients Wisely for Efficient Communication in Distributed Deep Learning☆12Apr 26, 2021Updated 4 years ago
- A compressed adaptive optimizer for training large-scale deep learning models using PyTorch☆25Nov 26, 2019Updated 6 years ago
- Code for reproducing experiments performed for Accoridon☆13Jun 11, 2021Updated 4 years ago
- A computation-parallel deep learning architecture.☆13Sep 25, 2019Updated 6 years ago
- [ICDCS 2023] DeAR: Accelerating Distributed Deep Learning with Fine-Grained All-Reduce Pipelining☆12Dec 4, 2023Updated 2 years ago
- Hi-Speed DNN Training with Espresso: Unleashing the Full Potential of Gradient Compression with Near-Optimal Usage Strategies (EuroSys '2…☆15Sep 21, 2023Updated 2 years ago
- gTop-k S-SGD: A Communication-Efficient Distributed Synchronous SGD Algorithm for Deep Learning☆37Aug 19, 2019Updated 6 years ago
- High performance NCCL plugin for Bagua.☆15Sep 15, 2021Updated 4 years ago
- BytePS examples (Vision, NLP, GAN, etc)☆19Nov 24, 2022Updated 3 years ago
- Simple Hierarchical Count Sketch in Python☆21Jun 3, 2021Updated 4 years ago
- QSGD-TF☆21May 15, 2019Updated 6 years ago
- Understanding Top-k Sparsification in Distributed Deep Learning☆24Nov 15, 2019Updated 6 years ago
- "Moshpit SGD: Communication-Efficient Decentralized Training on Heterogeneous Unreliable Devices", official implementation☆30Feb 4, 2025Updated last year
- Sparsified SGD with Memory: https://arxiv.org/abs/1809.07599☆58Oct 25, 2018Updated 7 years ago
- Stochastic Gradient Push for Distributed Deep Learning☆170Apr 5, 2023Updated 2 years ago
- Atomo: Communication-efficient Learning via Atomic Sparsification☆28Dec 9, 2018Updated 7 years ago
- SGD with compressed gradients and error-feedback: https://arxiv.org/abs/1901.09847☆32Jul 25, 2024Updated last year
- Sketched SGD☆28Jul 4, 2020Updated 5 years ago
- Decentralized SGD and Consensus with Communication Compression: https://arxiv.org/abs/1907.09356☆75Sep 10, 2020Updated 5 years ago
- PyTorch for benchmarking communication-efficient distributed SGD optimization algorithms☆78Aug 30, 2021Updated 4 years ago
- ddl-benchmarks: Benchmarks for Distributed Deep Learning☆36May 29, 2020Updated 5 years ago
- GRACE - GRAdient ComprEssion for distributed deep learning☆139Jul 23, 2024Updated last year
- [ICDCS 2023] Evaluation and Optimization of Gradient Compression for Distributed Deep Learning☆10Apr 28, 2023Updated 2 years ago
- Code accompanying the NeurIPS 2019 paper AutoAssist: A Framework to Accelerate Training of Deep Neural Networks.☆14Oct 3, 2022Updated 3 years ago
- netbeacon - monitoring your network capture, NIDS or network analysis process☆19Oct 26, 2013Updated 12 years ago
- 🕹 Implementation for the lesson Compiling Engineering(2020 Spring) in Peking University, adjusted from UCLA CS 132 Project.☆10Jun 21, 2020Updated 5 years ago
- Peking University Convex Optimization Course given by Professor Wen Zaiwen☆11Jan 11, 2018Updated 8 years ago
- FPGA-based HyperLogLog Accelerator☆12Jul 13, 2020Updated 5 years ago
- FalkorDB port to Rust☆12Jul 29, 2025Updated 6 months ago
- ☆11Mar 13, 2023Updated 2 years ago
- PSTensor provides a way to hack the memory management of tensors in TensorFlow and PyTorch by defining your own C++ Tensor Class.☆10Feb 10, 2022Updated 4 years ago
- Proposal for the next generation of course-oriented IR.☆10Dec 24, 2021Updated 4 years ago
- ☆10May 16, 2021Updated 4 years ago
- ☆11Apr 3, 2023Updated 2 years ago
- A Coq framework to support structural design and proof of hardware cache-coherence protocols☆14May 7, 2022Updated 3 years ago
- This is the numerical approach proposed in the paper "Optimal Incentives to Mitigate Epidemics: A Stackelberg Mean Field Game Approach" b…☆12Nov 22, 2021Updated 4 years ago
- ☆15Jul 18, 2023Updated 2 years ago
- ☆13Jan 21, 2022Updated 4 years ago
- Chaitin-Briggs register-allocation algorithm (LLVM back-end)☆12Jan 6, 2016Updated 10 years ago