baidu-research / nervanagpuLinks
Nervana GPU library
☆49Updated 10 years ago
Alternatives and similar repositories for nervanagpu
Users that are interested in nervanagpu are comparing it to the libraries listed below
Sorting:
- Torch7: state-of-the-art machine learning algorithms☆224Updated 11 years ago
- TH++, C++ interface to the torch7 TH library☆238Updated 6 years ago
- Computation using data flow graphs for scalable machine learning☆35Updated 8 years ago
- C++ 11 implementation of Geoff Hinton's Deep Learning matlab code☆283Updated 9 years ago
- An efficient character based RNN☆91Updated 6 years ago
- Facebook's extensions to torch/torch7. This is a preliminary release.☆36Updated 8 years ago
- PMLS-Caffe: Distributed Deep Learning Framework for Parallel ML System☆194Updated 7 years ago
- Convolution op for Theano based on CuFFT using scikits.cuda☆52Updated 10 years ago
- Unsupervised Learning on Neural Network Outputs☆73Updated 8 years ago
- (Spring 2017) Assignment 2: GPU Executor☆62Updated 8 years ago
- LASSO is a parallel regression model learning system☆69Updated 11 years ago
- Benchmarks for CNTK and other toolkits.☆44Updated 9 years ago
- The Deep Learning training framework on Spark☆220Updated last month
- Logistic regression engine for medium-sized data☆55Updated 10 years ago
- A GPU implementation of Convolutional Neural Nets in C++☆506Updated 4 years ago
- Distributed word embedding☆137Updated 8 years ago
- Facebook's extensions to torch/nn.☆283Updated 8 years ago
- Convolutional neural networks C++ framework with CPU and GPU (CUDA) backends☆178Updated 6 years ago
- ☆154Updated 8 years ago
- ☆73Updated 13 years ago
- Lightweight MapReduce in python☆151Updated 11 years ago
- High Efficiency Convolution Kernel for Maxwell GPU Architecture☆134Updated 8 years ago
- ☆76Updated 8 years ago
- Distributed skipgram mixture model for multisense word embedding☆115Updated 9 years ago
- Proof of concept prototype to perform distributed training using BVLC/caffe, based on a parameter server implementation using MPI. Data p…☆13Updated 10 years ago
- The Operator Vectorization Library, or OVL, is a python productivity library for defining high performance custom operators for the Tenso…☆68Updated 8 years ago
- Intel® Deep Learning Framework☆313Updated 9 years ago
- Facebook's CUDA extensions.☆283Updated 6 years ago
- AI Final Project☆65Updated 9 years ago
- Implementing (parts of) TensorFlow (almost) from Scratch☆30Updated 7 years ago