alexrenz / AdaPM
A fully adaptive, zero-tuning parameter manager that enables efficient distributed machine learning training
☆19Updated last year
Alternatives and similar repositories for AdaPM:
Users that are interested in AdaPM are comparing it to the libraries listed below
- A Ray-based data loader with per-epoch shuffling and configurable pipelining, for shuffling and loading training data for distributed tra…☆18Updated 2 years ago
- A fast high dimensional near neighbor search algorithm based on group testing and locality sensitive hashing☆19Updated last year
- Light-weight GPU kernel interface for graph operations☆15Updated 4 years ago
- C++ source code for the Dynamic Index algorithm proposed in "Efficient Similarity Computation for Collaborative Filtering in Dynamic Envi…☆16Updated 5 years ago
- Source code for SIGMOD 2020 paper "Improving Approximate Nearest Neighbor Search through Learned Adaptive Early Termination"☆49Updated 4 years ago
- Machine Learning System☆14Updated 4 years ago
- Reducing the cache misses of SIMD vectorization using IMV☆27Updated 2 years ago
- OpenEmbedding is an open source framework for Tensorflow distributed training acceleration.☆31Updated last year
- Trisk on Flink☆16Updated 2 years ago
- Runtime Tracing Library for TensorFlow☆43Updated 6 years ago
- Benchmarking In-Memory Index Structures☆26Updated 6 years ago
- ☆51Updated last year
- ☆14Updated last week
- A C++11 implementation of the B-Tree part of "The Case for Learned Index Structures"☆80Updated 7 years ago
- A graph application benchmark suite☆21Updated 9 years ago
- BytePS examples (Vision, NLP, GAN, etc)☆19Updated 2 years ago
- DMALab's reading group slides and papers.☆16Updated 3 years ago
- High performance RDMA-based distributed feature collection component for training GNN model on EXTREMELY large graph☆49Updated 2 years ago
- gossip: Efficient Communication Primitives for Multi-GPU Systems☆58Updated 2 years ago
- ☆15Updated last year
- ☆21Updated 2 years ago
- Vector search with bounded performance.☆34Updated last year
- Parameter Server implementation in Apache Flink.☆14Updated 6 years ago
- A Generic Resource-Aware Hyperparameter Tuning Execution Engine☆15Updated 3 years ago
- PyTorch-Direct code on top of PyTorch-1.8.0nightly (e152ca5) for Large Graph Convolutional Network Training with GPU-Oriented Data Commun…☆45Updated last year
- HogWild++: A New Mechanism for Decentralized Asynchronous Stochastic Gradient Descent☆33Updated 8 years ago
- FTPipe and related pipeline model parallelism research.☆41Updated last year
- ddl-benchmarks: Benchmarks for Distributed Deep Learning☆37Updated 4 years ago
- SnailTrail implementation☆38Updated 5 years ago
- To Index or Not to Index: Optimizing Exact Maximum Inner Product Search☆26Updated 5 years ago