A Tool for Automatic Parallelization of Deep Learning Training in Distributed Multi-GPU Environments.
☆132Feb 21, 2022Updated 4 years ago
Alternatives and similar repositories for parallax
Users that are interested in parallax are comparing it to the libraries listed below
Sorting:
- Cruise: A Distributed Machine Learning Framework with Automatic System Configuration☆26Mar 19, 2019Updated 6 years ago
- Nemo: A flexible data processing system☆21Mar 12, 2018Updated 7 years ago
- ☆18Dec 4, 2017Updated 8 years ago
- ☆15Oct 4, 2022Updated 3 years ago
- Lightweight and Parallel Deep Learning Framework☆264Nov 26, 2022Updated 3 years ago
- Mirror of Apache REEF☆98Jul 6, 2022Updated 3 years ago
- ☆24Nov 24, 2018Updated 7 years ago
- ☆15Jun 8, 2021Updated 4 years ago
- RDMA Optimization on MXNet☆14Nov 12, 2017Updated 8 years ago
- a TensorFlow-based distributed training framework optimized for large-scale sparse data.☆333Dec 23, 2025Updated 2 months ago
- Fast and Adaptive Distributed Machine Learning for TensorFlow, PyTorch and MindSpore.☆295Feb 23, 2024Updated 2 years ago
- Deep Learning library for Python. Convnets, recurrent neural networks, and more. Runs on Theano or TensorFlow.☆12Dec 24, 2016Updated 9 years ago
- ☆12Jul 9, 2021Updated 4 years ago
- A python implementation of the Radiomics approach by Aerts et al (http://www.nature.com/articles/ncomms5006)☆10Mar 22, 2017Updated 8 years ago
- ☆392Nov 4, 2022Updated 3 years ago
- P4 compatible HLS modules☆11Apr 23, 2018Updated 7 years ago
- We present a set of all-reduce compatible gradient compression algorithms which significantly reduce the communication overhead while mai…☆10Nov 14, 2021Updated 4 years ago
- Code for the paper "Robustness Certificates for Sparse Adversarial Attacks by Randomized Ablation" by Alexander Levine and Soheil Feizi.☆10Aug 22, 2022Updated 3 years ago
- ☆14Oct 31, 2016Updated 9 years ago
- ☆10Sep 3, 2016Updated 9 years ago
- Repository for our ICLR 2019 paper: Discovery of Natural Language Concepts in Individual Units of CNNs☆26Mar 9, 2019Updated 6 years ago
- ☆21Jan 7, 2018Updated 8 years ago
- 스누씨 3.0 프론트엔드☆45Mar 11, 2023Updated 2 years ago
- tensorflow extend framework☆13Feb 5, 2020Updated 6 years ago
- TensorFlow Implementation of Several Zero-Shot Image Style Transfer Methods☆15Sep 30, 2017Updated 8 years ago
- Ultra-minimal autoregressive diffusion model for image generation☆21Dec 26, 2025Updated 2 months ago
- A Sparse-tensor Communication Framework for Distributed Deep Learning☆13Nov 1, 2021Updated 4 years ago
- A tensor-aware point-to-point communication primitive for machine learning☆284Dec 17, 2025Updated 2 months ago
- A source-to-source compiler for optimizing CUDA dynamic parallelism by aggregating launches☆15Jun 21, 2019Updated 6 years ago
- a high performance system for customized-precision distributed deep learning☆12Dec 10, 2020Updated 5 years ago
- (ICPP '20) ShadowTutor: Distributed Partial Distillation for Mobile Video DNN Inference☆12Jun 22, 2020Updated 5 years ago
- An Attention Superoptimizer☆22Jan 20, 2025Updated last year
- A GPipe implementation in PyTorch☆863Jul 25, 2024Updated last year
- Code for "Adaptive Gradient Quantization for Data-Parallel SGD", published in NeurIPS 2020.☆30Jan 14, 2021Updated 5 years ago
- Official Implementation of SAM-Decoding: Speculative Decoding via Suffix Automaton☆42Feb 13, 2025Updated last year
- gossip: Efficient Communication Primitives for Multi-GPU Systems☆62Jul 1, 2022Updated 3 years ago
- A morden implement of CHD/SHD algorithm.☆13Apr 25, 2025Updated 10 months ago
- ☆13Jan 23, 2021Updated 5 years ago
- Sparsified SGD with Memory: https://arxiv.org/abs/1809.07599☆58Oct 25, 2018Updated 7 years ago