merrymercy / tvm-mali
Optimizing Mobile Deep Learning on ARM GPU with TVM
☆179Updated 5 years ago
Related projects: ⓘ
- Heterogeneous Run Time version of MXNet. Added heterogeneous capabilities to the MXNet, uses heterogeneous computing infrastructure frame…☆72Updated 6 years ago
- Benchmark of TVM quantized model on CUDA☆112Updated 4 years ago
- A quick view of high-performance convolution neural networks (CNNs) inference engines on mobile devices.☆150Updated 2 years ago
- Tengine gemm tutorial, step by step☆11Updated 3 years ago
- This is a CNN Analyzer tool, based on Netscope by dgschwend/netscope☆40Updated 6 years ago
- ☆76Updated this week
- Simulate quantization and quantization aware training for MXNet-Gluon models.☆46Updated 4 years ago
- Parallel CUDA implementation of NON maximum Suppression☆77Updated 4 years ago
- Heterogeneous Run Time version of Caffe. Added heterogeneous capabilities to the Caffe, uses heterogeneous computing infrastructure frame…☆269Updated 5 years ago
- Added quantization layer into caffe (support a coarse level fixed point simulation)☆22Updated 7 years ago
- The benchmark of ncnn that is a high-performance neural network inference framework optimized for the mobile platform☆72Updated 5 years ago
- Fork of https://source.codeaurora.org/quic/hexagon_nn/nnlib☆53Updated last year
- Simple pruning example using Caffe☆33Updated 6 years ago
- Efficient Sparse-Winograd Convolutional Neural Networks (ICLR 2018)☆190Updated 5 years ago
- Caffe implementation of accurate low-precision neural networks☆118Updated 5 years ago
- This repository contains the results and code for the MLPerf™ Inference v0.5 benchmark.☆55Updated last year
- Ristretto: Caffe-based approximation of convolutional neural networks.☆30Updated 5 years ago
- TVM tutorial☆65Updated 5 years ago
- This code is an implementation of a trained YOLO neural network used with the TensorRT framework.☆88Updated 7 years ago
- tophub autotvm log collections☆70Updated last year
- Caffe for Sparse Convolutional Neural Network☆238Updated last year
- A pyCaffe implementaion of the 2017 ICLR's "Pruning Filters for Efficient ConvNets" publication☆43Updated 6 years ago
- This repository has moved. The new link can be obtained from https://github.com/TexasInstruments/jacinto-ai-devkit☆116Updated 4 years ago
- benchmark for embededded-ai deep learning inference engines, such as NCNN / TNN / MNN / TensorFlow Lite etc.☆202Updated 3 years ago
- A script to convert floating-point CNN models into generalized low-precision ShiftCNN representation☆55Updated 7 years ago
- CaffePresso: An Optimized Library for Deep Learning on Embedded Accelerator-based platforms☆88Updated 6 years ago
- ☆31Updated 6 years ago
- Hopefully fast implementation of XNOR-Net in C, because, why not?☆26Updated 7 years ago
- Binary Weight Network and XNOR Network.☆63Updated 8 years ago
- Ristretto: Caffe-based approximation of convolutional neural networks.☆292Updated 5 years ago