zpye / SimpleInfer
A simple neural network inference framework
☆25Updated last year
Alternatives and similar repositories for SimpleInfer:
Users that are interested in SimpleInfer are comparing it to the libraries listed below
- A one-page-only CGraph-API-liked DAG project.☆16Updated this week
- ☆10Updated 6 months ago
- CUDA 6大并行计算模式 代码与笔记☆60Updated 4 years ago
- 分层解耦的深度学习推理引擎☆70Updated 2 months ago
- ☆24Updated 3 years ago
- This is a repository to practice multi-thread programming in C++☆19Updated 11 months ago
- 大规模并行处理器编程实战 第二版答案☆30Updated 2 years ago
- 🐱 ncnn int8 模型量化评估☆12Updated 2 years ago
- ☆35Updated 4 months ago
- High Performan Ai Model Web Server. Mainly support computer vision model. Quickly establish your own ai-model server. https://github.com/…☆39Updated last month
- 使用 CUDA C++ 实现的 llama 模型推理框架☆44Updated 3 months ago
- ☆17Updated 10 months ago
- Quick and Self-Contained TensorRT Custom Plugin Implementation and Integration☆51Updated 8 months ago
- EasyNN是一个面向教学而开发的神经网络推理框架,旨在让大家0基础也能自主完成推理框架编写!☆25Updated 5 months ago
- Common libraries for PPL projects☆29Updated 3 months ago
- CPU Memory Compiler and Parallel programing☆25Updated 2 months ago
- ☆14Updated 3 years ago
- ☆19Updated 3 years ago
- study of cutlass☆21Updated 3 months ago
- arm-neon☆89Updated 6 months ago
- llama 2 Inference☆41Updated last year
- 解读cudnn文档,掌握其用法☆16Updated 9 months ago
- b站上的课程☆71Updated last year
- ☆95Updated 3 years ago
- ☆11Updated 9 months ago
- ☆26Updated 10 months ago
- A light llama-like llm inference framework based on the triton kernel.☆83Updated this week
- ☆26Updated 8 months ago
- ☆15Updated last year
- CUDA 8-bit Tensor Core Matrix Multiplication based on m16n16k16 WMMA API☆28Updated last year