tqchen / mshadow
Matrix Shadow:Lightweight CPU/GPU Matrix and Tensor Template Library in C++/CUDA for (Deep) Machine Learning
☆33Updated 8 years ago
Alternatives and similar repositories for mshadow:
Users that are interested in mshadow are comparing it to the libraries listed below
- Demos interesting image-in, image-out networks running on both NVIDIA and AMD GPUs, with NNVM☆49Updated 7 years ago
- a mxnet multi-task tutorial☆33Updated 8 years ago
- C++ interface for mxnet☆115Updated 7 years ago
- Just-in-time Dynamic Batching with MXNet Gluon.☆52Updated 4 years ago
- Sublinear memory optimization for deep learning, reduce GPU memory cost to train deeper nets☆28Updated 8 years ago
- a C++ wrapper of Caffe and mxnet to make predictions☆50Updated 6 years ago
- (Spring 2017) Assignment 2: GPU Executor☆63Updated 7 years ago
- The code to learn mxnet☆60Updated 8 years ago
- auto-tuning momentum SGD optimizer☆23Updated 7 years ago
- ☆30Updated 7 years ago
- Batch Normalization Layer for Caffe☆35Updated 9 years ago
- For my own use BVLC/caffe helper tools☆24Updated 9 years ago
- Full convolution MultiBox Detector ( like SSD) implemented in Torch.☆40Updated 8 years ago
- MXNet Model Serving☆25Updated 7 years ago
- Asynchronous One Step Q Learning implemented with MXNET☆20Updated 8 years ago
- detection-developing☆20Updated 10 years ago
- An implementation of Highway Networks in Caffe☆95Updated 9 years ago
- mpi-caffe☆49Updated 5 years ago
- Faster R-CNN, an MXNet implementation with distributed implementation and data parallelization.☆36Updated 8 years ago
- An universal deep learning models conversor☆141Updated 8 years ago
- Caffe with NNPACK integration☆58Updated 8 years ago
- Simple MXNet sequence-to-sequence model (neural machine translation)☆24Updated 6 years ago
- Flattened convolutional neural networks (1D convolution modules for Torch nn)☆61Updated 9 years ago
- MXNet implementation of Deep Q-learning☆34Updated 7 years ago
- DNN Inference with CPU, C++, ONNX support: Instant☆56Updated 6 years ago
- ☆57Updated 6 years ago
- ☆37Updated 9 years ago
- Fast binary matrix product on CPU☆10Updated 8 years ago
- ☆18Updated 7 years ago
- Training deep neural networks with low precision multiplications☆63Updated 9 years ago