Phoenix8215 / A-White-Paper-on-Neural-Network-DeploymentLinks
模型部署白皮书(CUDA|ONNX|TensorRT|C++)🚀🚀🚀
☆231Updated last year
Alternatives and similar repositories for A-White-Paper-on-Neural-Network-Deployment
Users that are interested in A-White-Paper-on-Neural-Network-Deployment are comparing it to the libraries listed below
Sorting:
- Ai edge toolbox,专门面向边端设备尤其是嵌入式RTOS平台,AI模型部署工具链,包括模型推理引擎和模型压缩工具☆166Updated last year
- Deep Learning Deployment Framework: Supports tf/torch/trt/trtllm/vllm and other NN frameworks. Support dynamic batching, and streaming mo…☆168Updated 7 months ago
- Higher performance OpenAI LLM service than vLLM serve: A pure C++ high-performance OpenAI LLM service implemented with GPRS+TensorRT-LLM+…☆160Updated this week
- A repo that uses TensorRT to deploy wll-trained models.Support RTDETR,YOLO-NAS,YOLOV5,YOLOV6,YOLOV7,YOLOV8,YOLOX.☆107Updated 2 years ago
- A large number of cuda/tensorrt cases . 大量案例来学习cuda/tensorrt☆158Updated 3 years ago
- TensorRT 2022 亚军方案,tensorrt加速mobilevit模型☆68Updated 3 years ago
- This repository give a guidline to learn CUDA and TensorRT from the beginning.☆294Updated 9 months ago
- ☆305Updated 3 years ago
- 介绍更加详细的剪枝蒸馏方法☆70Updated 7 months ago
- Build CUDA Neural Network From Scratch☆22Updated last year
- An ultra fast tiny model for lane detection, using onnx_parser, TensorRTAPI, torch2trt to accelerate. our model support for int8, dynamic…☆120Updated 4 years ago
- SegmentAnything-OnnxRunner is an example using Meta AI Research's SAM onnx model in C++.The encoder and decoder of SAM are decoupled in t…☆98Updated 2 years ago
- YOLOv5 pruning on COCO Dataset☆85Updated 2 years ago
- learning-cuda-trt☆118Updated 2 years ago
- 基于《cuda编程-基础与实践》(樊哲勇 著)的cuda学习之路。☆387Updated last year
- 高性能计算课程&CUDA编程实例&深度学习推理框架☆61Updated 2 years ago
- A light llama-like llm inference framework based on the triton kernel.☆166Updated 2 months ago
- 校招、秋招、春招、实习好项目,带你从零动手实现支持LLama2/3和Qwen2.5的大模型推理框架。☆463Updated last month
- This is a Chinese translation of the CUDA programming guide☆1,782Updated last year
- 🚀 Do not need libtorch, pure C++ TensorRT deploys SOLOv2 etc, which can be quickly ported to NX/TX2.☆42Updated 3 years ago
- 目标检测,采用yolov8作为基准模型,数据集采用VisDrone2019,带有自己的改进策略☆113Updated last year
- ☆152Updated last year
- ☆302Updated last year
- PyTorch Quantization Aware Training(QAT,量化感知训练)☆42Updated 2 years ago
- 深度学习系统笔记,包含深度学习数学基础知识、神经网络基础部件详解、深度学习炼丹策略、模型压缩算法详解。☆503Updated 6 months ago
- 该代码与B站上的视频 https://www.bilibili.com/video/BV18L41197Uz/?spm_id_from=333.788&vd_source=eefa4b6e337f16d87d87c2c357db8ca7 相关联。☆70Updated 2 years ago
- 《CUDA编程基础与实践》一书的代码☆144Updated 3 years ago
- trt-hackathon-2022 三等奖方案☆10Updated 2 years ago
- QAT(quantize aware training) for classification with MQBench☆28Updated 4 years ago
- An onnx-based quantitation tool.☆71Updated last year