raymond-li / tflite_tensor_outputter
Generates intermediate tensor outputs for tflite
☆15Updated 5 years ago
Alternatives and similar repositories for tflite_tensor_outputter:
Users that are interested in tflite_tensor_outputter are comparing it to the libraries listed below
- Explained QNNPACK Implementation☆20Updated 5 years ago
- Tutorials on Quantized Neural Network using Tensorflow Lite☆86Updated 5 years ago
- Parse TFLite models (*.tflite) EASILY with Python. Check the API at https://zhenhuaw.me/tflite/docs/☆98Updated last month
- ☆11Updated 5 years ago
- Tensorflow quantization (float32-->int8) inference test☆74Updated 6 years ago
- ONNX converter and optimizer scirpts for Kneron hardware.☆38Updated last year
- ☆67Updated 5 years ago
- Simple pruning example using Caffe☆33Updated 7 years ago
- fixed-point, symmetric, power-of-2 quantization-aware training in tensorflow 1.13.1☆12Updated 5 years ago
- Repository containing pruned models and related information☆37Updated 4 years ago
- Quantized Neural Networks - networks trained for inference at arbitrary low precision.☆146Updated 7 years ago
- Implementation of "DeepShift: Towards Multiplication-Less Neural Networks" https://arxiv.org/abs/1905.13298☆108Updated 3 years ago
- PyTorch Static Quantization Example☆38Updated 3 years ago
- Cheng-Hao Tu, Jia-Hong Lee, Yi-Ming Chan and Chu-Song Chen, "Pruning Depthwise Separable Convolutions for MobileNet Compression," Interna…☆18Updated 4 years ago
- ☆62Updated 7 years ago
- DL quantization for pytorch☆26Updated 5 years ago
- Optimizing Mobile Deep Learning on ARM GPU with TVM☆180Updated 6 years ago
- PyTorch -> ONNX -> TVM for autotuning☆23Updated 5 years ago
- I'm going to use the Winograd’s minimal filtering algorithms to introduce a new class of fast algorithms for convolutional neural networks…☆12Updated 7 years ago
- Graph Transforms to Quantize and Retrain Deep Neural Nets in TensorFlow☆168Updated 5 years ago
- ☆28Updated last year
- Implementing CNN code in CUDA and OpenCL to evaluate its performance on NVIDIA GPUs, AMD GPUs, and an FPGA platform.☆54Updated 7 years ago
- A quick view of high-performance convolution neural networks (CNNs) inference engines on mobile devices.☆150Updated 2 years ago
- This repository contains the results and code for the MLPerf™ Inference v0.5 benchmark.☆55Updated last year
- Benchmark of TVM quantized model on CUDA☆111Updated 4 years ago
- ☆58Updated 3 years ago
- Inference of quantization aware trained networks using TensorRT☆80Updated 2 years ago
- Simulate quantization and quantization aware training for MXNet-Gluon models.☆46Updated 4 years ago
- Implementation of convolution layer in different flavors☆68Updated 7 years ago
- Hopefully fast implementation of XNOR-Net in C, because, why not?☆26Updated 7 years ago