Phoenix8215 / A-White-Paper-on-Neural-Network-DeploymentLinks

模型部署白皮书(CUDA|ONNX|TensorRT|C++)🚀🚀🚀

☆212

Alternatives and similar repositories for A-White-Paper-on-Neural-Network-Deployment

Users that are interested in A-White-Paper-on-Neural-Network-Deployment are comparing it to the libraries listed below

Sorting:

Li-Hongda / TensorRT_Inference_Demo
A repo that uses TensorRT to deploy wll-trained models.Support RTDETR,YOLO-NAS,YOLOV5,YOLOV6,YOLOV7,YOLOV8,YOLOX.
☆107Updated last year
NetEase-Media / grps
Deep Learning Deployment Framework: Supports tf/torch/trt/trtllm/vllm and other NN frameworks. Support dynamic batching, and streaming mo…
☆164Updated 2 months ago
midea-ai / Aidget
Ai edge toolbox，专门面向边端设备尤其是嵌入式RTOS平台，AI模型部署工具链，包括模型推理引擎和模型压缩工具
☆157Updated last year
NetEase-Media / grps_trtllm
Higher performance OpenAI LLM service than vLLM serve: A pure C++ high-performance OpenAI LLM service implemented with GPRS+TensorRT-LLM+…
☆145Updated 2 months ago
kalfazed / tensorrt_starter
This repository give a guidline to learn CUDA and TensorRT from the beginning.
☆251Updated 5 months ago
HeKun-NVIDIA / TensorRT-Developer_Guide_in_Chinese
☆291Updated 3 years ago
OroChippw / SegmentAnything-OnnxRunner
SegmentAnything-OnnxRunner is an example using Meta AI Research's SAM onnx model in C++.The encoder and decoder of SAM are decoupled in t…
☆98Updated last year
Phoenix8215 / BuildCudaNeuralNetworkFromScratch
Build CUDA Neural Network From Scratch
☆21Updated 10 months ago
YZY-stack / Ultra_Fast_Lane_Detection_TensorRT
An ultra fast tiny model for lane detection, using onnx_parser, TensorRTAPI, torch2trt to accelerate. our model support for int8, dynamic…
☆120Updated 4 years ago
HeKun-NVIDIA / CUDA-Programming-Guide-in-Chinese
This is a Chinese translation of the CUDA programming guide
☆1,598Updated 8 months ago
harleyszhang / lite_llama
A light llama-like llm inference framework based on the triton kernel.
☆134Updated last week
YukSing12 / anchordetr_tensorrt
trt-hackathon-2022 三等奖方案
☆10Updated 2 years ago
caixiongjiang / HPC
高性能计算课程&CUDA编程实例&深度学习推理框架
☆50Updated last year
QINZHAOYU / CudaSteps
基于《cuda编程-基础与实践》（樊哲勇著）的cuda学习之路。
☆329Updated last year
shouxieai / learning-cuda-trt
learning-cuda-trt
☆113Updated 2 years ago
jinmin527 / learning-cuda-trt
A large number of cuda/tensorrt cases . 大量案例来学习cuda/tensorrt
☆136Updated 2 years ago
uyzhang / yolov5_prune
YOLOv5 pruning on COCO Dataset
☆83Updated 2 years ago
zjhellofss / kuiperdatawhale
☆279Updated 9 months ago
zjhellofss / KuiperLLama
校招、秋招、春招、实习好项目，带你从零动手实现支持LLama2/3和Qwen2.5的大模型推理框架。
☆383Updated 2 weeks ago
Broad-sky / common-image-segmentation-algorithm
🚀 Do not need libtorch, pure C++ TensorRT deploys SOLOv2 etc, which can be quickly ported to NX/TX2.
☆42Updated 2 years ago
shouxieai / tensorRT_quantization
该代码与B站上的视频 https://www.bilibili.com/video/BV18L41197Uz/?spm_id_from=333.788&vd_source=eefa4b6e337f16d87d87c2c357db8ca7 相关联。
☆69Updated last year
RussWong / CUDATutorial
A CUDA tutorial to make people learn CUDA program from 0
☆239Updated last year
chenlamei / MobileVit_TensorRT
TensorRT 2022 亚军方案，tensorrt加速mobilevit模型
☆68Updated 3 years ago
Tongkaio / CUDA_Kernel_Samples
CUDA 算子手撕与面试指南
☆471Updated 6 months ago
emptysoal / tensorrt-experiment
Base on tensorrt version 8.2.4, compare inference speed for different tensorrt api.
☆48Updated 3 weeks ago
chaizwj / yolov8-tricks
目标检测，采用yolov8作为基准模型，数据集采用VisDrone2019，带有自己的改进策略
☆97Updated last year
yhwang-hub / dl_model_deploy
☆78Updated 2 years ago
zjhellofss / triton_course
☆31Updated 2 months ago
thb1314 / mmyolo_tensorrt
☆142Updated last year
gottingen / kumo-search
docs for search system and ai infra
☆218Updated last year