jinmingyi1998 / opencl_kernels
An easy way to run, test, benchmark and tune OpenCL kernel files
☆23Updated last year
Alternatives and similar repositories for opencl_kernels:
Users that are interested in opencl_kernels are comparing it to the libraries listed below
- Tengine 管子是用来快速生产 demo 的辅助工具☆13Updated 3 years ago
- ☆24Updated 2 years ago
- MegEngine到其他框架的转换器☆69Updated last year
- OneFlow->ONNX☆42Updated last year
- TVMScript kernel for deformable attention☆24Updated 3 years ago
- CVFusion is an open-source deep learning compiler to fuse the OpenCV operators.☆29Updated 2 years ago
- quantize aware training package for NCNN on pytorch☆70Updated 3 years ago
- Tencent NCNN with added CUDA support☆68Updated 4 years ago
- This is a demo how to write a high performance convolution run on apple silicon☆52Updated 3 years ago
- symmetric int8 gemm☆66Updated 4 years ago
- Common libraries for PPL projects☆29Updated 4 months ago
- Yet another Polyhedra Compiler for DeepLearning☆19Updated last year
- ☆21Updated last month
- Wanwu models release, code will be released soon☆24Updated 2 years ago
- 将MNN拆解的简易前向推理框架(for study!)☆20Updated 4 years ago
- the C++ version of Seq2Seq with ncnn☆23Updated 3 years ago
- cpp syntactic sugar☆9Updated 4 months ago
- python wrapper of ncnn with pybind11☆72Updated 4 years ago
- ☆18Updated 2 years ago
- ☆38Updated 2 years ago
- A tool convert TensorRT engine/plan to a fake onnx☆37Updated 2 years ago
- ☆18Updated last year
- OneFlow Serving☆20Updated last month
- study of cutlass☆21Updated 3 months ago
- ☆11Updated last year
- Count number of parameters / MACs / FLOPS for ONNX models.☆90Updated 3 months ago
- This a bridge for converting torch,and other AI training framework to C++ speed up infer library,like NCNN and ect☆20Updated 5 years ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆16Updated 8 months ago
- ONNX Command-Line Toolbox☆35Updated 4 months ago
- A codebase & model zoo for pretrained backbone based on MegEngine.☆33Updated last year