kentaroy47 / benchmark-FP32-FP16-INT8-with-TensorRTLinks
Benchmark inference speed of CNNs with various quantization methods in Pytorch+TensorRT with Jetson Nano/Xavier
☆56Updated 2 years ago
Alternatives and similar repositories for benchmark-FP32-FP16-INT8-with-TensorRT
Users that are interested in benchmark-FP32-FP16-INT8-with-TensorRT are comparing it to the libraries listed below
Sorting:
- PyTorch Static Quantization Example☆38Updated 4 years ago
- ☆79Updated 4 years ago
- ☆52Updated 4 years ago
- Count number of parameters / MACs / FLOPS for ONNX models.☆95Updated last year
- convert torch module to tensorrt network or tvm function☆89Updated 5 years ago
- PyTorch reimplementation of RegNet (Design Space Design, CVPR2020) on CIFAR10 and ImageNet☆47Updated 5 years ago
- Fast NPU-aware Neural Architecture Search☆22Updated 4 years ago
- ☆60Updated 5 years ago
- ☆25Updated 3 years ago
- Convert MobileNetV3Small defined and pre-trained in PyTorch to a TFLite quantized model☆76Updated 2 years ago
- ☆48Updated 5 years ago
- Implementation for the paper 'YOLO-ReT: Towards High Accuracy Real-time Object Detection on Edge GPUs'☆100Updated 2 years ago
- Inference of quantization aware trained networks using TensorRT☆83Updated 2 years ago
- Implementation of darknet19 in PyTorch with imagenet pretrained☆18Updated 6 years ago
- Pytorch implementation of EfficientNet-lite. ImageNet pre-trained models are provided.☆116Updated 4 years ago
- PyTorch Quantization Aware Training Example☆146Updated last year
- Parallel CUDA implementation of NON maximum Suppression☆81Updated 5 years ago
- Refactored implementation of CenterNet (Objects as Points - Zhou, Xingyi et. al.) shipping with PyTorch Lightning modules☆60Updated 2 years ago
- ONNX converter and optimizer scirpts for Kneron hardware.☆40Updated 2 years ago
- quantize aware training package for NCNN on pytorch☆69Updated 4 years ago
- Apply the pruning strategy for MobileNet_v2☆52Updated 6 years ago
- Class Project for 18663 - Implementation of FBNet (Hardware-Aware DNAS)☆34Updated 6 years ago
- yolov3 model compress and acceleration (quantization, sparse), c++ version☆37Updated 5 years ago
- ☆124Updated 4 years ago
- Using ideas from product quantization for state-of-the-art neural network compression.☆146Updated 4 years ago
- Object detection achieving 44.3 mAP / 45 fps on COCO dataset☆169Updated 5 years ago
- PyTorch re-implementation of YOLOv4 architecture☆46Updated 5 years ago
- This repo contains the official Pytorch reimplementation of the paper "NetAdapt: Platform-Aware Neural Network Adaptation for Mobile Appl…☆187Updated 2 years ago
- A PyTorch implementation of SSDLite on COCO☆89Updated 5 years ago
- Mish Activation Function for PyTorch☆51Updated 4 years ago