amplab / cycladesLinks
Cyclades
☆28Updated 7 years ago
Alternatives and similar repositories for cyclades
Users that are interested in cyclades are comparing it to the libraries listed below
Sorting:
- PMLS-Caffe: Distributed Deep Learning Framework for Parallel ML System☆193Updated 7 years ago
- GPU-specialized parameter server for GPU machine learning.☆102Updated 7 years ago
- Papers and blogs related to distributed deep learning☆96Updated 8 years ago
- MPI for Torch☆60Updated 8 years ago
- cache-friendly multithread matrix factorization☆90Updated 9 years ago
- (Spring 2017) Assignment 2: GPU Executor☆63Updated 8 years ago
- ☆18Updated 7 years ago
- FRED simulator and associated paper☆26Updated 9 years ago
- CUDA Matrix Factorization Library with Alternating Least Square (ALS)☆181Updated 7 years ago
- papers on scalable and efficient machine learning systems☆192Updated 7 years ago
- Scalable and Sustainable Deep Learning via Randomized Hashing☆94Updated 3 years ago
- DeepArchitect: Automatically Designing and Training Deep Architectures☆145Updated 6 years ago
- Deep learning system course☆215Updated 7 years ago
- DrMAD☆107Updated 8 years ago
- Tensorflow implementation of SGD with Coupled Adaptive Batch Size (CABS)☆44Updated 8 years ago
- CUDA Matrix Factorization Library with Stochastic Gradient Descent (SGD)☆71Updated 8 years ago
- Sublinear memory optimization for deep learning, reduce GPU memory cost to train deeper nets☆28Updated 9 years ago
- Proof of concept prototype to perform distributed training using BVLC/caffe, based on a parameter server implementation using MPI. Data p…☆13Updated 10 years ago
- Cython implementation of k-MC2 and AFK-MC2 seeding☆207Updated 4 years ago
- A set of distributed learning algorithms for Torch☆93Updated 4 years ago
- Benchmarks for several RNN variations with different deep-learning frameworks☆171Updated 6 years ago
- The Operator Vectorization Library, or OVL, is a python productivity library for defining high performance custom operators for the Tenso…☆68Updated 9 years ago
- Hyperparameter optimization with approximate gradient☆66Updated 4 years ago
- HogWild++: A New Mechanism for Decentralized Asynchronous Stochastic Gradient Descent☆33Updated 9 years ago
- AI Final Project☆65Updated 9 years ago
- ☆76Updated 9 years ago
- Efficient layer normalization GPU kernel for Tensorflow☆110Updated 8 years ago
- Source code for ``Neural Networks with Few Multiplications'' published at ICLR 2016☆80Updated 9 years ago
- A platform for distributed optimization expriments using OpenMPI☆21Updated 8 years ago
- A Tensorflow based implementation of "Asynchronous Methods for Deep Reinforcement Learning": https://arxiv.org/abs/1602.01783☆68Updated 9 years ago