Yulv-git / Model-Inference-DeploymentLinks
A curated list of awesome inference deployment framework of artificial intelligence (AI) models. OpenVINO, TensorRT, MediaPipe, TensorFlow Lite, TensorFlow Serving, ONNX Runtime, LibTorch, NCNN, TNN, MNN, TVM, MACE, Paddle Lite, MegEngine Lite, OpenPPL, Bolt, ExecuTorch.
☆70Updated last year
Alternatives and similar repositories for Model-Inference-Deployment
Users that are interested in Model-Inference-Deployment are comparing it to the libraries listed below
Sorting:
- A simple tool that can generate TensorRT plugin code quickly.☆237Updated 2 years ago
- NVIDIA DLA-SW, the recipes and tools for running deep learning workloads on NVIDIA DLA cores for inference applications.☆219Updated last year
- ☆120Updated 2 years ago
- An onnx-based quantitation tool.☆71Updated last year
- Quick and Self-Contained TensorRT Custom Plugin Implementation and Integration☆72Updated 5 months ago
- A large number of cuda/tensorrt cases . 大量案例来学习cuda/tensorrt☆157Updated 3 years ago
- YOLOv5 on Orin DLA☆215Updated last year
- ☆145Updated last year
- 该代码与B站上的视频 https://www.bilibili.com/video/BV18L41197Uz/?spm_id_from=333.788&vd_source=eefa4b6e337f16d87d87c2c357db8ca7 相关联。☆70Updated 2 years ago
- A simple tutorial of SNPE.☆178Updated 2 years ago
- TensorRT 7 C++ (almost) minimal examples☆83Updated 2 years ago
- TensorRT 2022 亚军方案,tensorrt加速mobilevit模型☆68Updated 3 years ago
- learning-cuda-trt☆118Updated 2 years ago
- ☆79Updated 2 years ago
- This is 8-bit quantization sample for yolov5. Both PTQ, QAT and Partial Quantization have been implemented, and present the results based…☆109Updated 3 years ago
- PyTorch Quantization Aware Training Example☆144Updated last year
- A tool convert TensorRT engine/plan to a fake onnx☆41Updated 2 years ago
- ☆149Updated last year
- A Toolkit to Help Optimize Large Onnx Model☆162Updated 3 weeks ago
- A tutorial for getting started with the Deep Learning Accelerator (DLA) on NVIDIA Jetson☆355Updated 3 years ago
- Offline Quantization Tools for Deploy.☆141Updated last year
- NVIDIA TensorRT Hackathon 2023复赛选题:通义千问Qwen-7B用TensorRT-LLM模型搭建及优化☆43Updated 2 years ago
- NVIDIA-阿里2021 TRT比赛 `二等奖` 代码提交 团队:美迪康 AI Lab☆174Updated 3 years ago
- Simple demo of tensorrt plugin☆44Updated 4 years ago
- Using pattern matcher in onnx model to match and replace subgraphs.☆81Updated last year
- Implementation of YOLOv9 QAT optimized for deployment on TensorRT platforms.☆129Updated 6 months ago
- Serving Inside Pytorch☆165Updated last week
- Useful tensorrt plugin. For pytorch and mmdetection model conversion.☆165Updated last year
- TensorRT 2022复赛方案: 首个基于Transformer的图像重建模型MST++的TensorRT模型推断优化☆143Updated 3 years ago
- a simple pipline of int8 quantization based on tensorrt.☆69Updated 3 years ago