slowbull / MPIPlatformLinks
A platform for distributed optimization expriments using OpenMPI
☆21Updated 8 years ago
Alternatives and similar repositories for MPIPlatform
Users that are interested in MPIPlatform are comparing it to the libraries listed below
Sorting:
- GPU-specialized parameter server for GPU machine learning.☆102Updated 7 years ago
- PMLS-Caffe: Distributed Deep Learning Framework for Parallel ML System☆193Updated 7 years ago
- FRED simulator and associated paper☆26Updated 10 years ago
- MPI for Torch☆60Updated 8 years ago
- Cyclades☆28Updated 7 years ago
- CUDA Matrix Factorization Library with Stochastic Gradient Descent (SGD)☆71Updated 8 years ago
- ☆12Updated 7 years ago
- Tensorflow implementation of SGD with Coupled Adaptive Batch Size (CABS)☆44Updated 8 years ago
- Machine Learning Toolkit for Extreme Scale (MaTEx)☆111Updated 7 years ago
- MPI Parallel framework for training deep learning models built in Theano☆54Updated 8 years ago
- (Spring 2017) Assignment 2: GPU Executor☆63Updated 8 years ago
- Implements a message passing interface (MPI) wrapper that makes it easy to do massively parallel computations inside the Torch deep-learn…☆110Updated 6 years ago
- Benchmarking State-of-the-Art Deep Learning Software Tools☆171Updated 8 years ago
- Papers and blogs related to distributed deep learning☆96Updated 8 years ago
- Proximal Asynchronous SAGA☆13Updated 8 years ago
- LIBBLE by Parameter Server☆17Updated 7 years ago
- cache-friendly multithread matrix factorization☆90Updated 9 years ago
- A primal-dual framework for distributed L1-regularized optimization☆36Updated 9 years ago
- HogWild++: A New Mechanism for Decentralized Asynchronous Stochastic Gradient Descent☆33Updated 9 years ago
- My fork of Alex Krizhevsky's cuda-convnet from 2013 where I added dropout, among other features.☆259Updated 11 years ago
- papers on scalable and efficient machine learning systems☆191Updated 7 years ago
- Proceedings of ICML 2017☆24Updated 3 years ago
- MXNet implementation of Deep Q-learning☆34Updated 8 years ago
- ☆127Updated 9 years ago
- Asynchronous One Step Q Learning implemented with MXNET☆20Updated 9 years ago
- CUDA Matrix Factorization Library with Alternating Least Square (ALS)☆181Updated 7 years ago
- ☆18Updated 7 years ago
- Hyperparameter optimization with approximate gradient☆66Updated 4 years ago
- A simple memory manager for CUDA designed to help Deep Learning frameworks manage memory☆300Updated 7 years ago
- Implementation of fast exact k-means algorithms☆45Updated 6 years ago