Yulv-git / Model-Inference-Deployment
A curated list of awesome inference deployment framework of artificial intelligence (AI) models. OpenVINO, TensorRT, MediaPipe, TensorFlow Lite, TensorFlow Serving, ONNX Runtime, LibTorch, NCNN, TNN, MNN, TVM, MACE, Paddle Lite, MegEngine Lite, OpenPPL, Bolt, ExecuTorch.
☆59Updated 11 months ago
Alternatives and similar repositories for Model-Inference-Deployment:
Users that are interested in Model-Inference-Deployment are comparing it to the libraries listed below
- ☆120Updated last year
- NVIDIA DLA-SW, the recipes and tools for running deep learning workloads on NVIDIA DLA cores for inference applications.☆193Updated 10 months ago
- ☢️ TensorRT 2023复赛——基于TensorRT-LLM的Llama模型推断加速优化☆46Updated last year
- Offline Quantization Tools for Deploy.☆127Updated last year
- NVIDIA TensorRT Hackathon 2023复赛选题:通义千问Qwen-7B用TensorRT-LLM模型搭建及优化☆42Updated last year
- A light llama-like llm inference framework based on the triton kernel.☆108Updated last week
- An onnx-based quantitation tool.☆71Updated last year
- Serving Inside Pytorch☆160Updated this week
- For 2022 Nvidia Hackathon☆20Updated 2 years ago
- A simple tool that can generate TensorRT plugin code quickly.☆230Updated last year
- b站上的课程☆74Updated last year
- Tencent NCNN with added CUDA support☆69Updated 4 years ago
- Based of paper "Quantization and Training of Neural Networks for Efficient Integer-Arithmetic-Only Inference"☆63Updated 4 years ago
- This is 8-bit quantization sample for yolov5. Both PTQ, QAT and Partial Quantization have been implemented, and present the results based…☆102Updated 2 years ago
- Quick and Self-Contained TensorRT Custom Plugin Implementation and Integration☆57Updated 10 months ago
- A tool convert TensorRT engine/plan to a fake onnx☆38Updated 2 years ago
- Collection of blogs on AI development☆19Updated 5 months ago
- ☆99Updated 3 years ago
- MegEngine到其他框架的转换器☆69Updated last year
- A Toolkit to Help Optimize Large Onnx Model☆153Updated 11 months ago
- This is a repository to practice multi-thread programming in C++☆24Updated last year
- 天池 NVIDIA TensorRT Hackathon 2023 —— 生成式AI模型优化赛 初赛第三名方案☆49Updated last year
- arm-neon☆90Updated 8 months ago
- YOLOv5 on Orin DLA☆198Updated last year
- www.giantpandacv.com☆148Updated 10 months ago
- Simple demo of tensorrt plugin☆44Updated 3 years ago
- A large number of cuda/tensorrt cases . 大量案例来学习cuda/tensorrt☆131Updated 2 years ago
- nanodet int8 量化,实测推理2ms一帧!☆37Updated 4 years ago
- 车道线检测Lanenet TensorRT加速C++实现☆21Updated 3 years ago
- ☆80Updated 4 years ago