Phoenix8215 / A-White-Paper-on-Neural-Network-Deployment
模型部署白皮书(CUDA|ONNX|TensorRT|C++)🚀🚀🚀
☆188Updated 6 months ago
Alternatives and similar repositories for A-White-Paper-on-Neural-Network-Deployment:
Users that are interested in A-White-Paper-on-Neural-Network-Deployment are comparing it to the libraries listed below
- This repository give a guidline to learn CUDA and TensorRT from the beginning.☆207Updated last month
- A large number of cuda/tensorrt cases . 大量案例来学习cuda/tensorrt☆127Updated 2 years ago
- A repo that uses TensorRT to deploy wll-trained models.Support RTDETR,YOLO-NAS,YOLOV5,YOLOV6,YOLOV7,YOLOV8,YOLOX.☆106Updated last year
- ☆255Updated 5 months ago
- 高性能计算课程&CUDA编程实例&深度学习推理框架☆43Updated last year
- 校招、秋招、春招、实习好项目,带你从零动手实现支持LLama2/3和Qwen2.5的大模型推理框架。☆312Updated last week
- 该代码与B站上的视频 https://www.bilibili.com/video/BV18L41197Uz/?spm_id_from=333.788&vd_source=eefa4b6e337f16d87d87c2c357db8ca7 相关联。☆66Updated last year
- learning-cuda-trt☆111Updated 2 years ago
- 基于《cuda编程-基础与实践》(樊哲勇 著)的cuda学习之路。☆293Updated last year
- Deep Learning Deployment Framework: Supports tf/torch/trt/trtllm/vllm and other NN frameworks. Support dynamic batching, and streaming mo…☆158Updated this week
- ☆267Updated 2 years ago
- A CUDA tutorial to make people learn CUDA program from 0☆222Updated 8 months ago
- algorithm-cpp projects☆79Updated 2 years ago
- A light llama-like llm inference framework based on the triton kernel.☆100Updated 2 weeks ago
- Ai edge toolbox,专门面向边端设备尤其是嵌入式RTOS平台,AI模型部署工具链,包括模型推理引擎和模型压缩工具☆151Updated last year
- CUDA 算子手撕与面试指南☆276Updated 2 months ago
- trt-hackathon-2022 三等奖方案☆10Updated 2 years ago
- TensorRT 2022 亚军方案,tensorrt加速mobilevit模型☆62Updated 2 years ago
- 《CUDA编程基础与实践》一书的代码☆114Updated 2 years ago
- ☆132Updated last year
- 模型压缩的小白入门教程☆254Updated 4 months ago
- b站上的课程☆71Updated last year
- NCNN的代码学习,各种小Demo。☆103Updated last year
- Higher performance OpenAI LLM service than vLLM serve: A pure C++ high-performance OpenAI LLM service implemented with GPRS+TensorRT-LLM+…☆123Updated this week
- This is a Chinese translation of the CUDA programming guide☆1,473Updated 4 months ago
- learning how CUDA works☆223Updated 3 weeks ago
- CUDA C 编程权威指南代码实现 包含了书上第二章到第八章的大部分代码实现和作者笔记,全由作者本人手动实现,难免有错误的地方,请大家谨慎参考,非常欢迎对错误的指正。 如果有帮助的话请Star一下,对作者帮助很大,谢谢!☆324Updated 2 years ago
- ☆78Updated last year
- Base on tensorrt version 8.2.4, compare inference speed for different tensorrt api.☆41Updated 3 weeks ago
- 🚀 Do not need libtorch, pure C++ TensorRT deploys SOLOv2 etc, which can be quickly ported to NX/TX2.☆43Updated 2 years ago