kentaroy47 / benchmark-FP32-FP16-INT8-with-TensorRTLinks
Benchmark inference speed of CNNs with various quantization methods in Pytorch+TensorRT with Jetson Nano/Xavier
☆56Updated 2 years ago
Alternatives and similar repositories for benchmark-FP32-FP16-INT8-with-TensorRT
Users that are interested in benchmark-FP32-FP16-INT8-with-TensorRT are comparing it to the libraries listed below
Sorting:
- ONNX converter and optimizer scirpts for Kneron hardware.☆40Updated last year
- Count number of parameters / MACs / FLOPS for ONNX models.☆93Updated 8 months ago
- ☆79Updated 4 years ago
- ☆52Updated 4 years ago
- quantize aware training package for NCNN on pytorch☆69Updated 3 years ago
- convert torch module to tensorrt network or tvm function☆89Updated 5 years ago
- ☆60Updated 5 years ago
- Fast NPU-aware Neural Architecture Search☆22Updated 3 years ago
- Implementation for the paper 'YOLO-ReT: Towards High Accuracy Real-time Object Detection on Edge GPUs'☆98Updated last year
- Refactored implementation of CenterNet (Objects as Points - Zhou, Xingyi et. al.) shipping with PyTorch Lightning modules☆60Updated 2 years ago
- Python scripts for performing lane detection using the LSTR model in ONNX☆33Updated 3 years ago
- PyTorch Static Quantization Example☆38Updated 4 years ago
- PyTorch reimplementation of RegNet (Design Space Design, CVPR2020) on CIFAR10 and ImageNet☆48Updated 5 years ago
- ☆25Updated 2 years ago
- PyTorch re-implementation of YOLOv4 architecture☆45Updated 5 years ago
- Convert MobileNetV3Small defined and pre-trained in PyTorch to a TFLite quantized model☆74Updated 2 years ago
- TensorRT plugin forDCNv2 layer in ONNX model☆60Updated 4 years ago
- Pilgrim Project: torch2trt, quick convert your pytorch model to TensorRT engine.☆19Updated 4 years ago
- A package to make do Network Slimming a little easier☆48Updated 3 years ago
- Yolov3 (+tiny) pythonic pytorch implementation.☆34Updated 6 years ago
- Inference of quantization aware trained networks using TensorRT☆83Updated 2 years ago
- Pytorch implementation of EfficientNet Lite variants☆15Updated 2 years ago
- Parallel CUDA implementation of NON maximum Suppression☆79Updated 4 years ago
- PyTorch Quantization Aware Training Example☆137Updated last year
- [ICML 2022] "DepthShrinker: A New Compression Paradigm Towards Boosting Real-Hardware Efficiency of Compact Neural Networks", by Yonggan …☆35Updated 3 years ago
- A Pytorch implementation of EfficientDet☆31Updated 5 years ago
- an SDK about how to use openvino model transformed from yolov5☆36Updated 4 years ago
- yolov3 model compress and acceleration (quantization, sparse), c++ version☆37Updated 5 years ago
- This repository provides YOLOV5 GPU optimization sample☆106Updated 2 years ago
- TensorRT implementation of "RepVGG: Making VGG-style ConvNets Great Again"☆76Updated 4 years ago