A Tool for Automatic Parallelization of Deep Learning Training in Distributed Multi-GPU Environments.
☆131Feb 21, 2022Updated 4 years ago
Alternatives and similar repositories for parallax
Users that are interested in parallax are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Nemo: A flexible data processing system☆21Mar 12, 2018Updated 8 years ago
- Apache Nemo (Incubating) - Data Processing System for Flexible Employment With Different Deployment Characteristics☆113Jul 1, 2025Updated 8 months ago
- Welcome to PeriFlow CLI ☁︎☆12Aug 3, 2023Updated 2 years ago
- Lightweight and Parallel Deep Learning Framework☆264Nov 26, 2022Updated 3 years ago
- ☆24Nov 24, 2018Updated 7 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- FriendliAI Model Hub☆90Jun 9, 2022Updated 3 years ago
- RDMA Optimization on MXNet☆14Nov 12, 2017Updated 8 years ago
- ☆15Jun 8, 2021Updated 4 years ago
- A Sparse-tensor Communication Framework for Distributed Deep Learning☆13Nov 1, 2021Updated 4 years ago
- Simple Distributed Deep Learning on TensorFlow☆134Feb 5, 2026Updated last month
- tensorflow extend framework☆13Feb 5, 2020Updated 6 years ago
- We present a set of all-reduce compatible gradient compression algorithms which significantly reduce the communication overhead while mai…☆10Nov 14, 2021Updated 4 years ago
- Automatically Discovering Fast Parallelization Strategies for Distributed Deep Neural Network Training☆1,864Updated this week
- PyTorch implementation of LAMB for ImageNet/ResNet-50 training☆13May 13, 2021Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆23Jun 5, 2019Updated 6 years ago
- A morden implement of CHD/SHD algorithm.☆13Mar 10, 2026Updated 2 weeks ago
- Docker image for Ubuntu 18.04 with cuda 9.0☆10Jan 15, 2019Updated 7 years ago
- a high performance system for customized-precision distributed deep learning☆12Dec 10, 2020Updated 5 years ago
- Simple example showing how to use DGMA in OpenCL☆13Feb 11, 2016Updated 10 years ago
- Code for "Adaptive Gradient Quantization for Data-Parallel SGD", published in NeurIPS 2020.☆30Jan 14, 2021Updated 5 years ago
- Ternary Gradients to Reduce Communication in Distributed Deep Learning (TensorFlow)☆182Nov 19, 2018Updated 7 years ago
- A GPipe implementation in PyTorch☆862Jul 25, 2024Updated last year
- ☆392Nov 4, 2022Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- gTop-k S-SGD: A Communication-Efficient Distributed Synchronous SGD Algorithm for Deep Learning☆37Aug 19, 2019Updated 6 years ago
- Sparsified SGD with Memory: https://arxiv.org/abs/1809.07599☆58Oct 25, 2018Updated 7 years ago
- Code for the paper "Robustness Certificates for Sparse Adversarial Attacks by Randomized Ablation" by Alexander Levine and Soheil Feizi.☆10Aug 22, 2022Updated 3 years ago
- A tensor-aware point-to-point communication primitive for machine learning☆284Dec 17, 2025Updated 3 months ago
- A lightweight parameter server interface☆88Jan 13, 2023Updated 3 years ago
- A Distributed Camera System for Inference Scheduling and Continuous Learning in Video Analytics☆18Jun 26, 2023Updated 2 years ago
- Ultra-minimal autoregressive diffusion model for image generation☆21Dec 26, 2025Updated 3 months ago
- ☆21Nov 29, 2022Updated 3 years ago
- GRACE - GRAdient ComprEssion for distributed deep learning☆139Jul 23, 2024Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- (ICPP '20) ShadowTutor: Distributed Partial Distillation for Mobile Video DNN Inference☆12Jun 22, 2020Updated 5 years ago
- ☆102Jul 2, 2023Updated 2 years ago
- Accelerating CNN's convolution operation on GPUs by using memory-efficient data access patterns.☆14Dec 8, 2017Updated 8 years ago
- The Tensor Algebra SuperOptimizer for Deep Learning☆740Jan 26, 2023Updated 3 years ago
- A high performance and generic framework for distributed DNN training☆3,716Oct 3, 2023Updated 2 years ago
- deepx_core是一个专注于张量计算/深度学习的基础库☆380Apr 15, 2025Updated 11 months ago
- 🤖 Code for our EMNLP 2022 paper: "BotsTalk: Machine-sourced Framework for Automatic Curation of Large-scale Multi-skill Dialogue Dataset…☆16Oct 7, 2024Updated last year