lix19937 / tensorrt-insight
Deep insight tensorrt, including but not limited to qat, ptq, plugin, triton_inference, cuda
☆17Updated this week
Alternatives and similar repositories for tensorrt-insight
Users that are interested in tensorrt-insight are comparing it to the libraries listed below
Sorting:
- Quick and Self-Contained TensorRT Custom Plugin Implementation and Integration☆57Updated 11 months ago
- Quantize yolov5 using pytorch_quantization.🚀🚀🚀☆14Updated last year
- learn TensorRT from scratch🥰☆14Updated 7 months ago
- A tool convert TensorRT engine/plan to a fake onnx☆39Updated 2 years ago
- ☆13Updated 2 years ago
- 本仓库在OpenVINO推理框架下部署Nanodet检测算法,并重写预处理和后处理部分,具有超高性能!让你在Intel CPU平台上的检测速度起飞! 并基于NNCF和PPQ工具将模型量化(PTQ)至int8精度,推理速度更快!☆15Updated last year
- ☆16Updated last year
- PointPillars TensorRT version pretrained on MMDetection3d with WaymoOpenDataset☆18Updated 2 years ago
- This is a repository to practice multi-thread programming in C++☆24Updated last year
- For 2022 Nvidia Hackathon☆21Updated 2 years ago
- C++ TensorRT Implementation of NanoSAM☆38Updated last year
- ☆17Updated 4 years ago
- The real-time Instance Segmentation Algorithm Yolov7 running on TensoRT and ONNX☆23Updated 2 years ago
- YOLOv5 on Orin DLA☆201Updated last year
- Common libraries for PPL projects☆29Updated 2 months ago
- ☆23Updated 2 years ago
- snpe tutorial☆10Updated last year
- A fork of the BEVDet series .☆20Updated last year
- A ROS 1/ROS 2 hybrid package wrapping the Apache TVM project.☆10Updated 2 years ago
- ☆24Updated 2 years ago
- A simple neural network inference framework☆25Updated last year
- BevDet_TensorRT☆5Updated last year
- original trained model(float) for Horizon model convert☆53Updated last year
- Collection of blogs on AI development☆19Updated 6 months ago
- OpenVINO™ optimization for PointPillars*☆32Updated last week
- A unified and extensible pipeline for deep learning model inference with C++. Now support yolov8, yolov9, clip, and nanosam. More models …☆12Updated last year
- TensorRT depth-anything for anyone and anywhere☆14Updated last year
- CUDA 8-bit Tensor Core Matrix Multiplication based on m16n16k16 WMMA API☆30Updated last year
- Python scripts performing Open Vocabulary Object Detection using the YOLO-World model in ONNX. And Export the ONNX model for AXera's NPU☆12Updated 7 months ago
- CUDA 6大并行计算模式 代码与笔记☆61Updated 4 years ago