A Sparse-tensor Communication Framework for Distributed Deep Learning
☆13Nov 1, 2021Updated 4 years ago
Alternatives and similar repositories for DeepReduce
Users that are interested in DeepReduce are comparing it to the libraries listed below
Sorting:
- Ok-Topk is a scheme for distributed training with sparse gradients. Ok-Topk integrates a novel sparse allreduce algorithm (less than 6k c…☆27Dec 10, 2022Updated 3 years ago
- ☆10Jun 4, 2021Updated 4 years ago
- Code for reproducing experiments performed for Accoridon☆13Jun 11, 2021Updated 4 years ago
- We present a set of all-reduce compatible gradient compression algorithms which significantly reduce the communication overhead while mai…☆10Nov 14, 2021Updated 4 years ago
- 飞桨迁移学习算法库☆20Feb 20, 2023Updated 3 years ago
- ☆15Jul 28, 2021Updated 4 years ago
- Hi-Speed DNN Training with Espresso: Unleashing the Full Potential of Gradient Compression with Near-Optimal Usage Strategies (EuroSys '2…☆15Sep 21, 2023Updated 2 years ago
- ☆11Oct 25, 2023Updated 2 years ago
- contribution works with PaddlePaddle from the third party developers☆20Nov 30, 2022Updated 3 years ago
- ☆69Mar 14, 2023Updated 3 years ago
- PyTorch compilation tutorial covering TorchScript, torch.fx, and Slapo☆17Mar 13, 2023Updated 3 years ago
- Reducing P4 Language’s Voluminosity using Higher-Level Constructs☆15Oct 15, 2022Updated 3 years ago
- Poise source code repo☆12Aug 12, 2020Updated 5 years ago
- Artifacts of VLDB'22 paper "COMET: A Novel Memory-Efficient Deep Learning TrainingFramework by Using Error-Bounded Lossy Compression"☆10Aug 2, 2022Updated 3 years ago
- Proximal Asynchronous SAGA☆13Nov 30, 2017Updated 8 years ago
- Code for paper "Learning a Code: Machine Learning for Approximate Non-Linear Coded-Computation"☆11Dec 21, 2020Updated 5 years ago
- Implementation for ACProp ( Momentum centering and asynchronous update for adaptive gradient methdos, NeurIPS 2021)☆16Oct 11, 2021Updated 4 years ago
- SC 2021, "LogECMem: Coupling Erasure-Coded In-Memory Key-Value Stores with Parity Logging"☆12Jul 12, 2021Updated 4 years ago
- An attempt to replicate the paper "Multi-shot Pedestrian Re-identification via Sequential Decision Making (CVPR2018)"☆10Nov 16, 2019Updated 6 years ago
- Notes and work-in-progress for BPF-related research projects☆12Jan 10, 2025Updated last year
- PyTorch implementation of LAMB for ImageNet/ResNet-50 training☆13May 13, 2021Updated 4 years ago
- A parallel programming model for online applications with complex synchronization requirements.☆16Jun 8, 2022Updated 3 years ago
- GRACE - GRAdient ComprEssion for distributed deep learning☆139Jul 23, 2024Updated last year
- ☆13Mar 27, 2019Updated 6 years ago
- Compressing weather and climate data into neural networks☆17May 19, 2024Updated last year
- SOTA results for reid baseline model (Gluon implementation)☆13Aug 6, 2018Updated 7 years ago
- A compressed adaptive optimizer for training large-scale deep learning models using PyTorch☆25Nov 26, 2019Updated 6 years ago
- ☆10Apr 20, 2025Updated 11 months ago
- THC: Accelerating Distributed Deep Learning Using Tensor Homomorphic Compression☆19Jul 30, 2024Updated last year
- A Learnable LSH Framework for Efficient NN Training☆34Jul 22, 2021Updated 4 years ago
- Sparsity support for PyTorch☆38Mar 22, 2025Updated last year
- Pressio is latin for compression. Libpressio is a C++ library with C compatible bindings to abstract between different lossless and lossy…☆16Dec 30, 2024Updated last year
- Multi-index hashing for the resolution of ANN search problem on large datasets☆15Oct 16, 2018Updated 7 years ago
- ☆16Apr 22, 2025Updated 11 months ago
- ☆10Jul 30, 2021Updated 4 years ago
- ☆27Dec 22, 2024Updated last year
- [ICDCS 2023] DeAR: Accelerating Distributed Deep Learning with Fine-Grained All-Reduce Pipelining☆12Dec 4, 2023Updated 2 years ago
- Implementation of TCP connection tracking in eBPF☆14May 9, 2024Updated last year
- ☆64Jun 25, 2024Updated last year