tqchen / mshadow
Matrix Shadow:Lightweight CPU/GPU Matrix and Tensor Template Library in C++/CUDA for (Deep) Machine Learning
☆33Updated 8 years ago
Alternatives and similar repositories for mshadow:
Users that are interested in mshadow are comparing it to the libraries listed below
- (Spring 2017) Assignment 2: GPU Executor☆62Updated 7 years ago
- a C++ wrapper of Caffe and mxnet to make predictions☆49Updated 7 years ago
- C++ interface for mxnet☆115Updated 8 years ago
- Demos interesting image-in, image-out networks running on both NVIDIA and AMD GPUs, with NNVM☆49Updated 7 years ago
- ☆30Updated 7 years ago
- Simple MXNet sequence-to-sequence model (neural machine translation)☆24Updated 7 years ago
- MXNet Model Serving☆25Updated 7 years ago
- Just-in-time Dynamic Batching with MXNet Gluon.☆52Updated 4 years ago
- Sublinear memory optimization for deep learning, reduce GPU memory cost to train deeper nets☆28Updated 8 years ago
- Batch Normalization Layer for Caffe☆35Updated 9 years ago
- a mxnet multi-task tutorial☆33Updated 8 years ago
- Coding example of DLIF tutorial☆66Updated 8 years ago
- An universal deep learning models conversor☆141Updated 8 years ago
- Flattened convolutional neural networks (1D convolution modules for Torch nn)☆61Updated 9 years ago
- Full convolution MultiBox Detector ( like SSD) implemented in Torch.☆40Updated 8 years ago
- gated cnn (Language Modeling with Gated Convolutional Networks)☆18Updated 8 years ago
- An implementation of Highway Networks in Caffe☆95Updated 9 years ago
- The code to learn mxnet☆60Updated 8 years ago
- OpenCL Inference Engine for pytorch☆51Updated 7 years ago
- Faster R-CNN, an MXNet implementation with distributed implementation and data parallelization.☆36Updated 8 years ago
- MPI Parallel framework for training deep learning models built in Theano☆53Updated 7 years ago
- DNN Inference with CPU, C++, ONNX support: Instant☆56Updated 6 years ago
- GPU/CPU (CUDA) Implementation of "Recurrent Memory Array Structures", Simple RNN, LSTM, Array LSTM..☆25Updated 5 years ago
- Asynchronous One Step Q Learning implemented with MXNET☆20Updated 8 years ago
- An OCR-system based on torch using the technique of LSTM/GRU-RNN, CTC and referred to the works of rnnlib and clstm.☆66Updated 9 years ago
- Distributed LDA, takes raw text as input and outputs topic word table.☆16Updated 8 years ago
- mpi-caffe☆49Updated 5 years ago
- TensorFlow Input Pipeline Examples based on multi-thread and FIFOQueue☆52Updated 7 years ago
- For my own use BVLC/caffe helper tools☆24Updated 9 years ago
- MXNet implementation of Deep Q-learning☆34Updated 7 years ago