amplab / cycladesLinks
Cyclades
☆28Updated 7 years ago
Alternatives and similar repositories for cyclades
Users that are interested in cyclades are comparing it to the libraries listed below
Sorting:
- GPU-specialized parameter server for GPU machine learning.☆101Updated 7 years ago
- PMLS-Caffe: Distributed Deep Learning Framework for Parallel ML System☆194Updated 7 years ago
- Scalable and Sustainable Deep Learning via Randomized Hashing☆93Updated 3 years ago
- Papers and blogs related to distributed deep learning☆96Updated 7 years ago
- cache-friendly multithread matrix factorization☆89Updated 9 years ago
- (Spring 2017) Assignment 2: GPU Executor☆62Updated 8 years ago
- FRED simulator and associated paper☆26Updated 9 years ago
- MPI for Torch☆61Updated 8 years ago
- Proof of concept prototype to perform distributed training using BVLC/caffe, based on a parameter server implementation using MPI. Data p…☆13Updated 10 years ago
- ☆19Updated 7 years ago
- papers on scalable and efficient machine learning systems☆192Updated 6 years ago
- CUDA Matrix Factorization Library with Alternating Least Square (ALS)☆180Updated 6 years ago
- Tensorflow implementation of SGD with Coupled Adaptive Batch Size (CABS)☆44Updated 8 years ago
- Benchmarking State-of-the-Art Deep Learning Software Tools☆170Updated 7 years ago
- The Operator Vectorization Library, or OVL, is a python productivity library for defining high performance custom operators for the Tenso…☆68Updated 8 years ago
- CUDA Matrix Factorization Library with Stochastic Gradient Descent (SGD)☆72Updated 7 years ago
- communication-efficient distributed coordinate ascent☆91Updated 6 years ago
- Sublinear memory optimization for deep learning, reduce GPU memory cost to train deeper nets☆28Updated 9 years ago
- Parallelizing word2vec in shared and distributed memory☆190Updated 2 years ago
- Asynchronous One Step Q Learning implemented with MXNET☆20Updated 8 years ago
- ☆76Updated 9 years ago
- Omnivore Optimizer and Distributed CcT☆13Updated 9 years ago
- Machine Learning Toolkit for Extreme Scale (MaTEx)☆110Updated 6 years ago
- Cython implementation of k-MC2 and AFK-MC2 seeding☆204Updated 3 years ago
- Multi-armed bandit simulation library☆139Updated last year
- Benchmarks for CNTK and other toolkits.☆44Updated 9 years ago
- HogWild++: A New Mechanism for Decentralized Asynchronous Stochastic Gradient Descent☆33Updated 8 years ago
- Efficient layer normalization GPU kernel for Tensorflow☆111Updated 8 years ago
- A Tensorflow based implementation of "Asynchronous Methods for Deep Reinforcement Learning": https://arxiv.org/abs/1602.01783☆67Updated 8 years ago
- Random Walk (Personalized PageRank) Algorithms for Large Graphs☆73Updated 9 years ago