csarron / MobiRnn
Efficient LSTM parallelization on smartphone GPU
☆21Updated 7 years ago
Alternatives and similar repositories for MobiRnn:
Users that are interested in MobiRnn are comparing it to the libraries listed below
- MobiRNN code for the 1st International Workshop on Embedded and Mobile Deep Learning☆8Updated 7 years ago
- SqueezeNet Generator☆31Updated 6 years ago
- The quantization of CNN/LSTM☆11Updated 8 years ago
- Training neural networks with 8-bit computations☆28Updated 9 years ago
- auto-tuning momentum SGD optimizer☆23Updated 7 years ago
- ☆23Updated 8 years ago
- An Example of MXNet Models Comilation and Deployment with NNVM in C++☆16Updated 6 years ago
- Demos interesting image-in, image-out networks running on both NVIDIA and AMD GPUs, with NNVM☆49Updated 7 years ago
- Training deep neural networks with low precision multiplications☆63Updated 9 years ago
- ☆17Updated 4 years ago
- ☆57Updated 6 years ago
- ☆19Updated last year
- ☆13Updated 8 years ago
- Some deep learning models written with mxnet and C++11.☆13Updated 7 years ago
- Test winograd convolution written in TVM for CUDA and AMDGPU☆41Updated 6 years ago
- ☆18Updated 7 years ago
- Caffe re-implementation of dynamic network surgery.☆18Updated 6 years ago
- Move to https://github.com/apache/incubator-tvm-site☆26Updated 4 years ago
- Various implementations and experimentation for deep neural network model compression☆24Updated 6 years ago
- A script to convert floating-point CNN models into generalized low-precision ShiftCNN representation☆56Updated 7 years ago
- a model zoo☆11Updated 7 years ago
- Training Low-bits DNNs with Stochastic Quantization☆73Updated 7 years ago
- Simple pruning example using Caffe☆33Updated 7 years ago
- Simple MXNet sequence-to-sequence model (neural machine translation)☆24Updated 7 years ago
- VGG16 architecture with BatchNorm☆14Updated 8 years ago
- Faster Deep Neural Networks☆36Updated 7 years ago
- Binarized Neural Network☆9Updated 8 years ago
- Portal of Johannes and Felix's RNN implementation and further modifications for ASR☆21Updated 10 years ago
- This is a PyTorch implementation of the Scalpel. Node pruning for five benchmark networks and SIMD-aware weight pruning for LeNet-300-100…☆41Updated 6 years ago
- ☆15Updated 7 years ago