zjhellofss / KuiperInferLinks

校招、秋招、春招、实习好项目！带你从零实现一个高性能的深度学习推理库，支持大模型 llama2 、Unet、Yolov5、Resnet等模型的推理。Implement a high-performance deep learning inference library step by step

☆3,064

Alternatives and similar repositories for KuiperInfer

Users that are interested in KuiperInfer are comparing it to the libraries listed below

Sorting:

nndeploy / nndeploy
Workflow-based Multi-platform AI Deployment Tool
☆1,138Updated this week
zjhellofss / kuiperdatawhale
☆290Updated 10 months ago
l0ngc / hpc-learning
hpc-learning
☆752Updated last year
Eddie-Wang1120 / HPC-Learning-Notes
高性能计算相关知识学习笔记，包含学习笔记和相关知识的代码demo，在持续完善中。如果有帮助的话请Star一下，对作者帮助很大，谢谢！
☆442Updated 2 years ago
zjhellofss / KuiperLLama
校招、秋招、春招、实习好项目，带你从零动手实现支持LLama2/3和Qwen2.5的大模型推理框架。
☆410Updated last month
BBuf / how-to-optim-algorithm-in-cuda
how to optimize some algorithm in cuda.
☆2,421Updated this week
openmlsys / openmlsys-zh
《Machine Learning Systems: Design and Implementation》- Chinese Version
☆4,511Updated last year
QINZHAOYU / CudaSteps
基于《cuda编程-基础与实践》（樊哲勇著）的cuda学习之路。
☆346Updated last year
Tongkaio / CUDA_Kernel_Samples
CUDA 算子手撕与面试指南
☆541Updated 7 months ago
parallel101 / course
高性能并行编程与优化 - 课件
☆4,063Updated 10 months ago
BBuf / tvm_mlir_learn
compiler learning resources collect.
☆2,491Updated 5 months ago
HeKun-NVIDIA / CUDA-Programming-Guide-in-Chinese
This is a Chinese translation of the CUDA programming guide
☆1,651Updated 9 months ago
Tony-Tan / CUDA_Freshman
☆2,529Updated last year
ChunelFeng / CGraph
【A common used C++ & Python DAG framework】一个通用的、无三方依赖的、跨平台的、收录于awesome-cpp的、基于流图的并行计算框架。欢迎star & fork & 交流
☆2,090Updated last week
PaddleJitLab / CUDATutorial
A self-learning tutorail for CUDA High Performance Programing.
☆712Updated last month
Eddie-Wang1120 / Professional-CUDA-C-Programming-Code-and-Notes
CUDA C 编程权威指南代码实现包含了书上第二章到第八章的大部分代码实现和作者笔记，全由作者本人手动实现，难免有错误的地方，请大家谨慎参考，非常欢迎对错误的指正。如果有帮助的话请Star一下，对作者帮助很大，谢谢！
☆354Updated 2 years ago
xlite-dev / LeetCUDA
📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA Kernels, Tensor Cores, HGEMM, FA-2 MMA.🎉
☆6,127Updated last week
caixiongjiang / HPC
高性能计算课程&CUDA编程实例&深度学习推理框架
☆53Updated last year
harleyszhang / dl_note
深度学习系统笔记，包含深度学习数学基础知识、神经网络基础部件详解、深度学习炼丹策略、模型压缩算法详解。
☆490Updated 2 months ago
harleyszhang / llm_note
LLM notes, including model inference, transformer model structure, and llm framework code analysis notes.
☆811Updated last week
yuesong-feng / 30dayMakeCppServer
30天自制C++服务器，包含教程和源代码
☆6,752Updated 4 months ago
brucefan1983 / CUDA-Programming
Sample codes for my CUDA programming book
☆1,793Updated 6 months ago
RussWong / CUDATutorial
A CUDA tutorial to make people learn CUDA program from 0
☆248Updated last year
godweiyang / NN-CUDA-Example
Several simple examples for popular neural network toolkits calling custom CUDA operators.
☆1,498Updated 4 years ago
BBuf / how-to-learn-deep-learning-framework
how to learn PyTorch and OneFlow
☆449Updated last year
Infrasys-AI / AIInfra
AIInfra（AI 基础设施）指AI系统从底层芯片等硬件，到上层软件栈支持AI大模型训练和推理。
☆3,930Updated this week
parallel101 / cppguidebook
小彭老师领衔编写，现代C++的中文百科全书
☆917Updated last week
Infrasys-AI / aisystem-docs
☆318Updated last month
Infrasys-AI / AISystem
AISystem 主要是指AI系统，包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术
☆14,861Updated 2 weeks ago
MAhaitao999 / CUDA_Programming
《CUDA编程基础与实践》一书的代码
☆133Updated 3 years ago