tangjunjun966 / cuda-tutorial-master
第一章 指针篇 第二章 CUDA原理篇 第三章 CUDA编译器环境配置篇 第四章 kernel函数基础篇 第五章 kernel索引(index)篇 第六章 kenel矩阵计算实战篇 第七章 kenel实战强化篇 第八章 CUDA内存应用与性能优化篇 第九章 CUDA原子(atomic)实战篇 第十章 CUDA流(stream)实战篇 第十一章 CUDA的NMS算子实战篇 第十二章 YOLO的部署实战篇 第十三章 基于CUDA的YOLO部署实战篇
☆16Updated 3 months ago
Related projects ⓘ
Alternatives and complementary repositories for cuda-tutorial-master
- 高效部署:YOLO X, V3, V4, V5, V6, V7, V8, EdgeYOLO TRT推理 ™️ ,前后处理均由CUDA核函数实现 CPP/CUDA🚀☆48Updated last year
- Quantize yolov5 using pytorch_quantization.🚀🚀🚀☆13Updated last year
- ☆78Updated last year
- 对 tensorRT_Pro 开源项目理解☆18Updated last year
- 彻底弄懂BP反向传播,15行代码,C++实现也简单,MNIST分类98.29%精度☆34Updated 2 years ago
- ☆24Updated last week
- 跟着Tensorrt_pro学习各种知识☆37Updated 2 years ago
- A large number of cuda/tensorrt cases . 大量案例来学习cuda/tensorrt☆108Updated 2 years ago
- Llama3 Streaming Chat Sample☆23Updated 7 months ago
- https://zhuanlan.zhihu.com/p/396448133☆40Updated 2 years ago
- 深度学习, especially CV☆39Updated this week
- "FastSAM_Awsome_Openvino" 项目展示了如何通过 OpenVINO 框架高效部署 FastSAM 模型,实现了令人瞩目的实例分割功能。该项目提供了 C++ 版本和 Python 版本两种实现,为开发者提供了在不同语言环境下使用 FastSAM 模型的选…☆31Updated 11 months ago
- ☆42Updated last year
- 用OpenVINO对yolov8导出的onnx模型进行C++的推理, 任务包括图像分类, 目标识别和语义分割, 步骤包括图片前处理, 推理, NMS等☆52Updated 6 months ago
- 简单快速的部署深度学习模型☆10Updated last year
- TensorRT 2022 亚军方案,tensorrt加速mobilevit模型☆59Updated 2 years ago
- yolov5 tensorrt int8量化方法汇总☆59Updated 11 months ago
- 🚀🚀🚀This is an AI high-performance reasoning C++ library, Currently supports the deployment of yolov5, yolov7, yolov7-pose, yolov8, yol…☆120Updated 6 months ago
- Speed up image preprocess with cuda when handle image or tensorrt inference☆54Updated last week
- 基于OpenMP和CUDA的并行计算项目 - 马赛克生成器☆7Updated 5 years ago
- An onnx-based quantitation tool.☆71Updated 10 months ago
- Easy Training Official YOLOv8、YOLOv7、YOLOv6、YOLOv5 and Prune all_model using Torch-Pruning!☆50Updated 10 months ago
- A unified and extensible pipeline for deep learning model inference with C++. Now support yolov8, yolov9, clip, and nanosam. More models …☆10Updated 6 months ago
- learning-cuda-trt☆103Updated last year
- This is a repository to practice multi-thread programming in C++☆17Updated 9 months ago
- 分别使用OpenCV,ONNXRuntime部署yolov5不规则四边形目标检测,包含C++和Python两个版本的程序☆27Updated 2 years ago
- ☆110Updated 8 months ago
- YOLOv8 Inference C++ sample code based on OpenVINO C++ API☆39Updated last year
- yolov8n 部署版,基于官方的导出onnx脚本导出onnx模型,在不同平台上进行部署测试,便于移植不同平台(onnx、tensorRT、rknn、Horizon)。☆36Updated last year
- cpp project template based on visual studio, OpenCV and CUDA, gdb debug, makefile☆27Updated 3 years ago