Yulv-git / Model-Inference-DeploymentLinks

A curated list of awesome inference deployment framework of artificial intelligence (AI) models. OpenVINO, TensorRT, MediaPipe, TensorFlow Lite, TensorFlow Serving, ONNX Runtime, LibTorch, NCNN, TNN, MNN, TVM, MACE, Paddle Lite, MegEngine Lite, OpenPPL, Bolt, ExecuTorch.

☆65

Alternatives and similar repositories for Model-Inference-Deployment

Users that are interested in Model-Inference-Deployment are comparing it to the libraries listed below

Sorting:

Oldpan / DeployIsAllYouNeed
☆121Updated 2 years ago
torchpipe / torchpipe
Serving Inside Pytorch
☆163Updated last week
NVIDIA-AI-IOT / tensorrt_plugin_generator
A simple tool that can generate TensorRT plugin code quickly.
☆231Updated 2 years ago
jinmin527 / learning-cuda-trt
A large number of cuda/tensorrt cases . 大量案例来学习cuda/tensorrt
☆139Updated 3 years ago
NVIDIA / Deep-Learning-Accelerator-SW
NVIDIA DLA-SW, the recipes and tools for running deep learning workloads on NVIDIA DLA cores for inference applications.
☆209Updated last year
tsingmicro-toolchain / OnnxSlim
A Toolkit to Help Optimize Large Onnx Model
☆157Updated last year
shouxieai / tensorRT_quantization
该代码与B站上的视频 https://www.bilibili.com/video/BV18L41197Uz/?spm_id_from=333.788&vd_source=eefa4b6e337f16d87d87c2c357db8ca7 相关联。
☆69Updated last year
chenlamei / MobileVit_TensorRT
TensorRT 2022 亚军方案，tensorrt加速mobilevit模型
☆68Updated 3 years ago
leimao / TensorRT-Custom-Plugin-Example
Quick and Self-Contained TensorRT Custom Plugin Implementation and Integration
☆66Updated 2 months ago
thb1314 / mmyolo_tensorrt
☆147Updated last year
NVIDIA-AI-IOT / cuDLA-samples
YOLOv5 on Orin DLA
☆207Updated last year
gesanqiu / SNPE_Tutorial
A simple tutorial of SNPE.
☆177Updated 2 years ago
TRT2022 / trtllm-llama
☢️ TensorRT 2023复赛——基于TensorRT-LLM的Llama模型推断加速优化
☆50Updated last year
BBuf / onnx_learn
☆99Updated 4 years ago
maggiez0138 / yolov5_quant_sample
This is 8-bit quantization sample for yolov5. Both PTQ, QAT and Partial Quantization have been implemented, and present the results based…
☆105Updated 3 years ago
sesmfs / onnx_quant_tool
An onnx-based quantitation tool.
☆71Updated last year
Ascend / samples
☆130Updated last year
agrechnev / trt-cpp-min
TensorRT 7 C++ (almost) minimal examples
☆83Updated last year
leimao / PyTorch-Quantization-Aware-Training
PyTorch Quantization Aware Training Example
☆138Updated last year
FeiGeChuanShu / trt2023
NVIDIA TensorRT Hackathon 2023复赛选题：通义千问Qwen-7B用TensorRT-LLM模型搭建及优化
☆42Updated last year
thb1314 / maskrcnn-tensorrt
☆47Updated 2 years ago
sesmfs / onnx_matcher
Using pattern matcher in onnx model to match and replace subgraphs.
☆81Updated last year
cvdong / YOLO_TRT_SIM
高效部署：YOLO X, V3, V4, V5, V6, V7, V8, EdgeYOLO TRT推理 ™️ ,前后处理均由CUDA核函数实现 CPP/CUDA🚀
☆49Updated 2 years ago
TRT2022 / MST-plus-plus-TensorRT
TensorRT 2022复赛方案：首个基于Transformer的图像重建模型MST++的TensorRT模型推断优化
☆140Updated 3 years ago
Tlntin / trt2023
☆26Updated last year
DataXujing / TensorRT-DETR
NVIDIA-阿里2021 TRT比赛 `二等奖` 代码提交团队：美迪康 AI Lab
☆171Updated 3 years ago
shouxieai / learning-cuda-trt
learning-cuda-trt
☆114Updated 2 years ago
inisis / OnnxSlim
A Toolkit to Help Optimize Onnx Model
☆189Updated this week
yhwang-hub / dl_model_deploy
☆78Updated 2 years ago
HeKun-NVIDIA / TensorRT-Developer_Guide_in_Chinese
☆293Updated 3 years ago