vinx13 / tvm-cuda-int8-benchmarkLinks
Benchmark of TVM quantized model on CUDA
☆111Updated 5 years ago
Alternatives and similar repositories for tvm-cuda-int8-benchmark
Users that are interested in tvm-cuda-int8-benchmark are comparing it to the libraries listed below
Sorting:
- Optimizing Mobile Deep Learning on ARM GPU with TVM☆181Updated 7 years ago
- Simulate quantization and quantization aware training for MXNet-Gluon models.☆45Updated 5 years ago
- TVM tutorial☆66Updated 6 years ago
- This is a CNN Analyzer tool, based on Netscope by dgschwend/netscope☆42Updated 7 years ago
- A quick view of high-performance convolution neural networks (CNNs) inference engines on mobile devices.☆151Updated 3 years ago
- Tengine gemm tutorial, step by step☆13Updated 4 years ago
- ☆66Updated 6 years ago
- Fast CUDA Kernels for ResNet Inference.☆181Updated 6 years ago
- TFLite python API package for parsing TFLite model☆12Updated 5 years ago
- tophub autotvm log collections☆69Updated 2 years ago
- convert torch module to tensorrt network or tvm function☆89Updated 5 years ago
- Simple Training and Deployment of Fast End-to-End Binary Networks☆158Updated 3 years ago
- TVM learning and research☆13Updated 4 years ago
- benchmark for embededded-ai deep learning inference engines, such as NCNN / TNN / MNN / TensorFlow Lite etc.☆204Updated 4 years ago
- Efficient Sparse-Winograd Convolutional Neural Networks (ICLR 2018)☆193Updated 6 years ago
- Parallel CUDA implementation of NON maximum Suppression☆80Updated 5 years ago
- ☆57Updated 4 years ago
- TensorFlow and TVM integration☆36Updated 5 years ago
- A Computation Graph Virtual Machine based ML Framework☆108Updated last year
- Added quantization layer into caffe (support a coarse level fixed point simulation)☆21Updated 8 years ago
- ☆10Updated 5 years ago
- Caffe Computation Graph Optimization.☆29Updated 5 years ago
- ☆26Updated 8 years ago
- ☆42Updated 5 years ago
- Graph Transforms to Quantize and Retrain Deep Neural Nets in TensorFlow☆168Updated 5 years ago
- Caffe implementation of accurate low-precision neural networks☆118Updated 7 years ago
- Qualcomm Hexagon NN Offload Framework☆43Updated 5 years ago
- ☆36Updated 3 years ago
- This repository contains the results and code for the MLPerf™ Inference v0.5 benchmark.☆55Updated 3 months ago
- Caffe implementation of ICCV 2017 & TPAMI 2018 paper - ThiNet☆46Updated 7 years ago