Yulv-git / Model-Inference-DeploymentLinks
A curated list of awesome inference deployment framework of artificial intelligence (AI) models. OpenVINO, TensorRT, MediaPipe, TensorFlow Lite, TensorFlow Serving, ONNX Runtime, LibTorch, NCNN, TNN, MNN, TVM, MACE, Paddle Lite, MegEngine Lite, OpenPPL, Bolt, ExecuTorch.
☆71Updated last year
Alternatives and similar repositories for Model-Inference-Deployment
Users that are interested in Model-Inference-Deployment are comparing it to the libraries listed below
Sorting:
- A large number of cuda/tensorrt cases . 大量案例来学习cuda/tensorrt☆170Updated 3 years ago
- ☆120Updated 2 years ago
- 该代码与B站上的视频 https://www.bilibili.com/video/BV18L41197Uz/?spm_id_from=333.788&vd_source=eefa4b6e337f16d87d87c2c357db8ca7 相关联。☆71Updated 2 years ago
- NVIDIA DLA-SW, the recipes and tools for running deep learning workloads on NVIDIA DLA cores for inference applications.☆225Updated last year
- An onnx-based quantitation tool.☆71Updated 2 years ago
- Serving Inside Pytorch☆170Updated 2 weeks ago
- A simple tool that can generate TensorRT plugin code quickly.☆239Updated 2 years ago
- learning-cuda-trt☆119Updated 2 years ago
- Quick and Self-Contained TensorRT Custom Plugin Implementation and Integration☆79Updated 8 months ago
- ☆149Updated 2 years ago
- YOLOv5 on Orin DLA☆221Updated last year
- ☆47Updated 2 years ago
- TensorRT 7 C++ (almost) minimal examples☆84Updated 2 years ago
- High-performance, light-weight C++ LLM and VLM Inference Software for Physical AI☆227Updated last month
- 高效部署:YOLO X, V3, V4, V5, V6, V7, V8, EdgeYOLO TRT推理 ™️ ,前后处理均由CUDA核函数实现 CPP/CUDA🚀☆53Updated 2 years ago
- Using pattern matcher in onnx model to match and replace subgraphs.☆82Updated last year
- ☆79Updated 2 years ago
- TensorRT 2022 亚军方案,tensorrt加速mobilevit模型☆68Updated 3 years ago
- Edgeai TIDL Tools and Examples - This repository contains Tools and example developed for Deep learning runtime (DLRT) offering provided …☆179Updated last week
- Speed up image preprocess with cuda when handle image or tensorrt inference☆85Updated 3 months ago
- Resources of our survey paper "Optimizing Edge AI: A Comprehensive Survey on Data, Model, and System Strategies"☆99Updated 4 months ago
- YOLOv5 Quantization Aware Training (QAT, qat_torch branch) and Post Training Quantization with ONNX (ptq_onnx branch ptq_onnx.ipynb)☆15Updated 2 years ago
- This is a repository to practice multi-thread programming in C++☆27Updated last year
- A Toolkit to Help Optimize Large Onnx Model☆163Updated 3 months ago
- Sample projects for TensorRT in C++☆199Updated 2 years ago
- snpe tutorial☆10Updated 2 years ago
- ☆113Updated last year
- yolov5 tensorrt int8量化方法汇总☆86Updated 2 years ago
- A tool convert TensorRT engine/plan to a fake onnx☆42Updated 3 years ago
- For 2022 Nvidia Hackathon☆22Updated 3 years ago