amplab / cycladesLinks
Cyclades
☆28Updated 7 years ago
Alternatives and similar repositories for cyclades
Users that are interested in cyclades are comparing it to the libraries listed below
Sorting:
- PMLS-Caffe: Distributed Deep Learning Framework for Parallel ML System☆193Updated 7 years ago
- GPU-specialized parameter server for GPU machine learning.☆102Updated 7 years ago
- (Spring 2017) Assignment 2: GPU Executor☆63Updated 8 years ago
- Papers and blogs related to distributed deep learning☆96Updated 8 years ago
- cache-friendly multithread matrix factorization☆90Updated 9 years ago
- MPI for Torch☆60Updated 8 years ago
- ☆18Updated 7 years ago
- Tensorflow implementation of SGD with Coupled Adaptive Batch Size (CABS)☆44Updated 8 years ago
- FRED simulator and associated paper☆26Updated 9 years ago
- Scalable and Sustainable Deep Learning via Randomized Hashing☆94Updated 3 years ago
- Proof of concept prototype to perform distributed training using BVLC/caffe, based on a parameter server implementation using MPI. Data p…☆13Updated 10 years ago
- The Operator Vectorization Library, or OVL, is a python productivity library for defining high performance custom operators for the Tenso…☆68Updated 9 years ago
- CUDA Matrix Factorization Library with Stochastic Gradient Descent (SGD)☆71Updated 8 years ago
- DrMAD☆107Updated 8 years ago
- ☆76Updated 9 years ago
- A set of distributed learning algorithms for Torch☆93Updated 4 years ago
- papers on scalable and efficient machine learning systems☆192Updated 7 years ago
- CUDA Matrix Factorization Library with Alternating Least Square (ALS)☆181Updated 7 years ago
- Asynchronous One Step Q Learning implemented with MXNET☆20Updated 9 years ago
- Efficient layer normalization GPU kernel for Tensorflow☆110Updated 8 years ago
- Machine Learning Toolkit for Extreme Scale (MaTEx)☆111Updated 7 years ago
- Cython implementation of k-MC2 and AFK-MC2 seeding☆207Updated 4 years ago
- Sublinear memory optimization for deep learning, reduce GPU memory cost to train deeper nets☆28Updated 9 years ago
- Implements a message passing interface (MPI) wrapper that makes it easy to do massively parallel computations inside the Torch deep-learn…☆110Updated 6 years ago
- Original Python version of Intel® Nervana™ Graph☆214Updated 3 years ago
- A primal-dual framework for distributed L1-regularized optimization☆36Updated 9 years ago
- A platform for distributed optimization expriments using OpenMPI☆21Updated 8 years ago
- Hyperparameter optimization with approximate gradient☆66Updated 4 years ago
- Multi-armed bandit simulation library☆140Updated 2 years ago
- Benchmarking State-of-the-Art Deep Learning Software Tools☆171Updated 8 years ago