b站上的课程
☆85Aug 25, 2023Updated 2 years ago
Alternatives and similar repositories for KuiperCourse
Users that are interested in KuiperCourse are comparing it to the libraries listed below
Sorting:
- 校招、秋招、春招、实习好项目!带你从零实现一个高性能的深度学习推理库,支持大模型 llama2 、Unet、Yolov5、Resnet等模型的推理。Implement a high-performance deep learning inference library st…☆3,354Jun 22, 2025Updated 9 months ago
- ☆314Oct 9, 2024Updated last year
- 模型压缩的小白入门教程☆22Jul 7, 2024Updated last year
- Flash Attention in ~100 lines of CUDA (forward pass only)☆10Jun 10, 2024Updated last year
- ☆11Mar 15, 2023Updated 3 years ago
- ☆38Oct 12, 2024Updated last year
- TensorRT-in-Action 是一个 GitHub 代码库,提供了使用 TensorRT 的代码示例,并有对应 Jupyter Notebook。☆14Jun 1, 2023Updated 2 years ago
- 校招、秋招、春招、实习好项目,带你从零动手实现支持LLama2/3和Qwen2.5的大模型推理框架。☆512Oct 28, 2025Updated 4 months ago
- A simple neural network inference framework☆25Aug 1, 2023Updated 2 years ago
- An onnx-based quantitation tool.☆71Jan 8, 2024Updated 2 years ago
- https://github.com/shouxieai/hard_decode_trt windows编译版本☆13Sep 8, 2022Updated 3 years ago
- ☆15Jun 22, 2025Updated 9 months ago
- This is a series of GPU optimization topics. Here we will introduce how to optimize the CUDA kernel in detail. I will introduce several…☆1,248Jul 29, 2023Updated 2 years ago
- 自制基于C++的深度学习前向推理框架☆21Jun 4, 2023Updated 2 years ago
- ☆10Sep 23, 2025Updated 5 months ago
- ☆27Jan 5, 2025Updated last year
- compiler learning resources collect.☆2,693Mar 19, 2025Updated last year
- ☆11Nov 13, 2022Updated 3 years ago
- 高性能 高精度 大陆车牌、港澳车牌、台湾车牌 韩国车牌(South Korea LPR)识别 代码开源(ncnn移植)☆40Nov 5, 2025Updated 4 months ago
- The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.☆14Mar 30, 2024Updated last year
- EasyNN是一个面向教学而开发的神经网络推理框架,旨在让大家0基础也能自主完成推理框架编写!☆39Aug 26, 2024Updated last year
- 用C++和Python实现从头实现一个深度学习训练框架☆12Nov 22, 2020Updated 5 years ago
- how to optimize some algorithm in cuda.☆2,872Updated this week
- DLBlas: clean and efficient kernels☆35Updated this week
- 一款简单易用和高性能的AI部署框架 | An Easy-to-Use and High-Performance AI Deployment Framework☆1,762Mar 15, 2026Updated last week
- Optimize GEMM with tensorcore step by step☆37Dec 17, 2023Updated 2 years ago
- 使用VC检测车道线(曲线)☆10Apr 23, 2018Updated 7 years ago
- ☆15Apr 15, 2022Updated 3 years ago
- 关于自建AI推理引擎的手册,从0开始你需要知道的所有事情☆272Sep 8, 2022Updated 3 years ago
- Z项目系列-埋点SDK☆15Mar 11, 2026Updated last week
- KV cache compression via sparse coding☆17Oct 26, 2025Updated 4 months ago
- Deploy deep learning model on difference hardware and framework. (TensorRT/ONNX/MNN/RKNN)☆13Jan 2, 2022Updated 4 years ago
- DGEMM on KNL, achieve 75% MKL☆19May 19, 2022Updated 3 years ago
- Nano vLLM☆13Jun 26, 2025Updated 8 months ago
- AIInfra 和 AISystem开源课程项目☆41Jun 22, 2025Updated 9 months ago
- ☆69Mar 19, 2023Updated 3 years ago
- ☆11Dec 16, 2021Updated 4 years ago
- ☆128Mar 5, 2026Updated 2 weeks ago
- 非雇员OD管理复盘与面试改进思考☆16Jul 2, 2025Updated 8 months ago