SongQiPing / KuiperInfer_rs
使用 Rust 语言重新实现 https://github.com/zjhellofss/KuiperInfer 和 https://github.com/zjhellofss/kuiperdatawhale 中的深度学习推理框架。
☆12Updated 9 months ago
Alternatives and similar repositories for KuiperInfer_rs:
Users that are interested in KuiperInfer_rs are comparing it to the libraries listed below
- 自制基于C++的深度学习前向推理框架☆12Updated last year
- bilibili视频【CUDA 12.x 并行编程入门(C++版)】配套代码☆27Updated 5 months ago
- EasyNN是一个面向教学而开发的神经网络推理框架,旨在让大家0基础也能自主完成推理框架编写!☆24Updated 4 months ago
- ☆16Updated this week
- ☆15Updated 8 months ago
- A light llama-like llm inference framework based on the triton kernel.☆78Updated last week
- easy cuda code☆43Updated 3 weeks ago
- cuda编程学习入门☆31Updated 5 months ago
- CPU Memory Compiler and Parallel programing☆25Updated 2 months ago
- NVIDIA TensorRT Hackathon 2023复赛选题:通义千问Qwen-7B用TensorRT-LLM模型搭建及优化☆41Updated last year
- 分层解耦的深度学习推理引擎☆67Updated last month
- ☆15Updated last year
- 这个项目介绍了简单的CUDA入门,涉及到CUDA执行模型、线程层次、CUDA内存模型、核函数的编写方式以及PyTorch使用CUDA扩展的两种方式。通过该项目可以基本入门基于PyTorch的CUDA扩展的开发方式。☆81Updated 3 years ago
- paper-read-notes☆10Updated 3 months ago
- 第一章 指针篇 第二章 CUDA原理篇 第三章 CUDA编译器环境配置篇 第四章 kernel函数基础篇 第五章 kernel索引(index)篇 第六章 kenel矩阵计算实战篇 第七章 kenel实战强化篇 第八章 CUDA内存应用与性能优化篇 第九章 CUDA原子(a…☆18Updated 5 months ago
- tutorial for writing custom pytorch cpp+cuda kernel, applied on volume rendering (NeRF)☆23Updated last year
- A simple neural network inference framework☆25Updated last year
- A lite and head-only CGraph-API-liked DAG project.☆14Updated 2 months ago
- 大规模并行处理器编程实战 第二版答案☆29Updated 2 years ago
- 用C++实现一个简单的Transformer模型。 Attention Is All You Need。☆42Updated 3 years ago
- A unified and extensible pipeline for deep learning model inference with C++. Now support yolov8, yolov9, clip, and nanosam. More models …☆10Updated 8 months ago
- This is a repository to practice multi-thread programming in C++☆18Updated 10 months ago
- 校招、秋招、春招、实习好项目,带你从零动手实现支持LLama2/3和Qwen2.5的大模型推理框架。☆264Updated last week
- Implementation and optimization of matrix multiplication on single CPU (HPC-THU-2023-Autumn)☆10Updated 10 months ago
- Inference deployment of the llama3☆11Updated 8 months ago
- 🐱 ncnn int8 模型量化评估☆12Updated 2 years ago
- 解读cudnn文档,掌握其用法☆16Updated 8 months ago
- YoloV10 NPU for the RK3566/68/88☆12Updated 7 months ago
- 《CUDA编程基础与实践》一书的代码☆105Updated 2 years ago