zpye / SimpleInfer
A simple neural network inference framework
☆25Updated last year
Alternatives and similar repositories for SimpleInfer:
Users that are interested in SimpleInfer are comparing it to the libraries listed below
- A one-page-only CGraph-API-liked DAG project.☆14Updated this week
- CUDA 6大并行计算模式 代码与笔记☆60Updated 4 years ago
- ☆10Updated 6 months ago
- This is a repository to practice multi-thread programming in C++☆19Updated 11 months ago
- 分层解耦的深度学习推理引擎☆70Updated last month
- ☆17Updated 9 months ago
- ☆24Updated 3 years ago
- ☆33Updated 3 months ago
- b站上的课程☆69Updated last year
- Common libraries for PPL projects☆29Updated 3 months ago
- High Performan Ai Model Web Server. Mainly support computer vision model. Quickly establish your own ai-model server. https://github.com/…☆39Updated 2 weeks ago
- 大规模并行处理器编程实战 第二版答案☆29Updated 2 years ago
- Quantize yolov5 using pytorch_quantization.🚀🚀🚀☆13Updated last year
- ☆19Updated 3 years ago
- ☆94Updated 3 years ago
- pdf☆89Updated 6 years ago
- A light llama-like llm inference framework based on the triton kernel.☆78Updated 3 weeks ago
- ☆14Updated 3 years ago
- ☆11Updated 8 months ago
- 将MNN拆解的简易前向推理框架(for study!)☆20Updated 3 years ago
- My learning notes about AI, including Machine Learning and Deep Learning.☆18Updated 5 years ago
- 🐱 ncnn int8 模型量化评估☆12Updated 2 years ago
- autoTVM神经网络推理代码优化搜索演示,基于tvm编译开源模型centerface,并使用autoTVM搜索最优推理代码, 最终部署编译为c++代码,演示平台是cuda,可以是其他平台,例如树莓派,安卓手机,苹果手机.Thi is a demonstration of …☆27Updated 3 years ago
- study of cutlass☆20Updated 2 months ago
- 解读cudnn文档,掌握其用法☆16Updated 9 months ago
- arm-neon☆89Updated 5 months ago
- 使用 CUDA C++ 实现的 llama 模型推理框架☆44Updated 2 months ago
- Quick and Self-Contained TensorRT Custom Plugin Implementation and Integration☆50Updated 7 months ago
- OneFlow->ONNX☆42Updated last year
- C++数 据流并行处理框架☆24Updated 3 years ago