alexrenz / AdaPMLinks

A fully adaptive, zero-tuning parameter manager that enables efficient distributed machine learning training

☆20

Alternatives and similar repositories for AdaPM

Users that are interested in AdaPM are comparing it to the libraries listed below

Sorting:

xldrx / tensorflow-tracer
Runtime Tracing Library for TensorFlow
☆43Updated 6 years ago
dglai / minigun
Light-weight GPU kernel interface for graph operations
☆15Updated 5 years ago
ray-project / ray_shuffling_data_loader
A Ray-based data loader with per-epoch shuffling and configurable pipelining, for shuffling and loading training data for distributed tra…
☆18Updated 2 years ago
SymbioticLab / Fluid
A Generic Resource-Aware Hyperparameter Tuning Execution Engine
☆15Updated 3 years ago
HKBU-HPML / ddl-benchmarks
ddl-benchmarks: Benchmarks for Distributed Deep Learning
☆37Updated 5 years ago
efficient / faiss-learned-termination
Source code for SIGMOD 2020 paper "Improving Approximate Nearest Neighbor Search through Learned Adaptive Early Termination"
☆55Updated 5 years ago
bytedance / ps-lite
A lightweight parameter server interface
☆77Updated 2 years ago
Funatiq / gossip
gossip: Efficient Communication Primitives for Multi-GPU Systems
☆59Updated 3 years ago
4paradigm / OpenEmbedding
OpenEmbedding is an open source framework for Tensorflow distributed training acceleration.
☆32Updated 2 years ago
cuihenggang / geeps
GPU-specialized parameter server for GPU machine learning.
☆101Updated 7 years ago
realstolz / powerlyra
Differentiated Computation and Partitioning on Skewed (Natural or Bipartite) Graphs
☆66Updated 3 years ago
yingjunwu / IndexZoo
Benchmarking In-Memory Index Structures
☆26Updated 6 years ago
lsds / Crossbow
Crossbow: A Multi-GPU Deep Learning System for Training with Small Batch Sizes
☆55Updated 2 years ago
triton-inference-server / hugectr_backend
☆55Updated last year
byteps / examples
BytePS examples (Vision, NLP, GAN, etc)
☆19Updated 2 years ago
petuum / autodist
Simple Distributed Deep Learning on TensorFlow
☆133Updated last month
K-Wu / pytorch-direct_dgl
Large Graph Convolutional Network Training with GPU-Oriented Data Communication Architecture (accepted by PVLDB)
☆44Updated 2 years ago
quiver-team / quiver-feature
High performance RDMA-based distributed feature collection component for training GNN model on EXTREMELY large graph
☆54Updated 3 years ago
wheatman / BP-Tree
☆14Updated 5 months ago
KnightKingWalk / KnightKing
A general-purpose, distributed graph random walk engine.
☆109Updated last year
bcaine / learned_indices
A C++11 implementation of the B-Tree part of "The Case for Learned Index Structures"
☆81Updated 7 years ago
uwsampl / nexus
☆82Updated last month
Shekhale / gbnns_dim_red
Reducing Dimensionality method for Nearest Neighbor Search
☆15Updated 4 years ago
CGCL-codes / Tensorflow-RDMA
Tensorflow is a computational library using data flow graphs for scalable machine learning, and Tensorflow-RDMA is the implementation ov…
☆58Updated 2 years ago
TalwalkarLab / paleo
An analytical performance modeling tool for deep neural networks.
☆89Updated 4 years ago
DMALab / Reading_Group
DMALab's reading group slides and papers.
☆16Updated 4 years ago
technicolor-research / quick-adc
Quick ADC
☆26Updated 6 years ago
fzhedu / db-imv
Reducing the cache misses of SIMD vectorization using IMV
☆28Updated 3 years ago
zzy590 / article-code
知乎文章附带代码
☆15Updated 2 years ago
stanford-futuredata / index-baselines
Simple baselines for "Learned Indexes"
☆159Updated 7 years ago