snuspl / parallaxLinks

A Tool for Automatic Parallelization of Deep Learning Training in Distributed Multi-GPU Environments.

☆132

Alternatives and similar repositories for parallax

Users that are interested in parallax are comparing it to the libraries listed below

Sorting:

petuum / autodist
Simple Distributed Deep Learning on TensorFlow
☆133Updated last month
TalwalkarLab / paleo
An analytical performance modeling tool for deep neural networks.
☆89Updated 4 years ago
ucla-labx / distbelief
Implementing Google's DistBelief paper
☆111Updated 2 years ago
mlcommons / training_results_v0.7
This repository contains the results and code for the MLPerf™ Training v0.7 benchmark.
☆57Updated 2 years ago
xldrx / tensorflow-tracer
Runtime Tracing Library for TensorFlow
☆43Updated 6 years ago
VoVAllen / tf-dlpack
DLPack for Tensorflow
☆35Updated 5 years ago
snuspl / nimble
Lightweight and Parallel Deep Learning Framework
☆264Updated 2 years ago
parasj / checkmate
Training neural networks in TensorFlow 2.0 with 5x less memory
☆132Updated 3 years ago
octoml / synr
A library for syntactically rewriting Python programs, pronounced (sinner).
☆69Updated 3 years ago
NVIDIA / nvtx-plugins
Python bindings for NVTX
☆66Updated 2 years ago
Distributed-AI / PipeTransformer
PipeTransformer: Automated Elastic Pipelining for Distributed Training of Large-scale Models. ICML 2021
☆56Updated 4 years ago
tvmai / meetup-slides
Place for meetup slides
☆141Updated 4 years ago
hwang595 / ps_pytorch
implement distributed machine learning with Pytorch + OpenMPI
☆51Updated 6 years ago
cuihenggang / geeps
GPU-specialized parameter server for GPU machine learning.
☆101Updated 7 years ago
awslabs / lorien
☆43Updated last year
mlcommons / training_results_v0.5
This repository contains the results and code for the MLPerf™ Training v0.5 benchmark.
☆35Updated 2 months ago
kanonjz / paper
Machine Learning System
☆14Updated 5 years ago
pytorch / tensorpipe
A tensor-aware point-to-point communication primitive for machine learning
☆259Updated 2 years ago
mlcommons / inference_policies
Issues related to MLPerf™ Inference policies, including rules and suggested changes
☆63Updated last week
HKBU-HPML / ddl-benchmarks
ddl-benchmarks: Benchmarks for Distributed Deep Learning
☆37Updated 5 years ago
dmlc / nnvm-fusion
Kernel Fusion and Runtime Compilation Based on NNVM
☆70Updated 8 years ago
dlsys-course / tinyflow
Tutorial code on how to build your own Deep Learning System in 2k Lines
☆125Updated 8 years ago
spcl / substation
Research and development for optimizing transformers
☆129Updated 4 years ago
facebookresearch / FBTT-Embedding
This is a Tensor Train based compression library to compress sparse embedding tables used in large-scale machine learning models such as …
☆194Updated 3 years ago
ucbrise / cs294-ai-sys-sp19
CS294; AI For Systems and Systems For AI
☆224Updated 5 years ago
facebookresearch / fairring
Fairring (FAIR + Herring) is a plug-in for PyTorch that provides a process group for distributed training that outperforms NCCL at large …
☆65Updated 3 years ago
jwfromm / Riptide
Simple Training and Deployment of Fast End-to-End Binary Networks
☆157Updated 3 years ago
facebookresearch / DLRM-FlexFlow
Development repository for integrating FlexFlow (A distributed deep learning framework that supports flexible parallelization strategies)…
☆29Updated 3 years ago
mcanini / SysML-reading-list
Systems for ML/AI & ML/AI for Systems paper reading list: A curated reading list of computer science research for work at the intersectio…
☆278Updated last month
swsnu / bd2018
☆24Updated 6 years ago