jinmingyi1998 / opencl_kernelsLinks

An easy way to run, test, benchmark and tune OpenCL kernel files

☆23

Alternatives and similar repositories for opencl_kernels

Users that are interested in opencl_kernels are comparing it to the libraries listed below

Sorting:

MegEngine / mgeconvert
MegEngine到其他框架的转换器
☆70Updated 2 years ago
atanmarko / ncnn-with-cuda
Tencent NCNN with added CUDA support
☆69Updated 4 years ago
OAID / TengineInferPipe
☆24Updated 2 years ago
daquexian / web-model-converter
☆41Updated 2 years ago
DeepLink-org / CVFusion
CVFusion is an open-source deep learning compiler to fuse the OpenCV operators.
☆31Updated 2 years ago
tpoisonooo / tengine-pipe
Tengine 管子是用来快速生产 demo 的辅助工具
☆13Updated 4 years ago
Oneflow-Inc / oneflow-lite
☆18Updated last year
OpenPPL / ppl.common
Common libraries for PPL projects
☆29Updated 4 months ago
Oneflow-Inc / oneflow_convert
OneFlow->ONNX
☆43Updated 2 years ago
inisis / OnnxLLM
Large Language Model Onnx Inference Framework
☆36Updated 6 months ago
lrw04 / llama2.c-to-ncnn
A converter for llama2.c legacy models to ncnn models.
☆81Updated last year
Adlik / model_zoo
☆11Updated this week
FeiGeChuanShu / trt2023
NVIDIA TensorRT Hackathon 2023复赛选题：通义千问Qwen-7B用TensorRT-LLM模型搭建及优化
☆42Updated last year
pytorch-labs / tokenizers
C++ implementations for various tokenizers (sentencepiece, tiktoken etc).
☆34Updated this week
scarsty / cccc-lite
☆45Updated 8 months ago
caishanli / pyncnn
python wrapper of ncnn with pybind11
☆72Updated 4 years ago
ZHEQIUSHUI / CLIP-ONNX-AX650-CPP
c++实现的clip推理，模型有一点点改动，但是不大，改动和导出模型的代码可以在readme里找到，模型文件都在Releases里，包括AX650的模型。新增支持ChineseCLIP
☆30Updated last month
ChenShisen / ncnnqat
quantize aware training package for NCNN on pytorch
☆69Updated 4 years ago
gmalivenko / onnx-opcounter
Count number of parameters / MACs / FLOPS for ONNX models.
☆93Updated 9 months ago
tpoisonooo / cpp-syntactic-sugar
cpp syntactic sugar
☆8Updated last month
tpoisonooo / chgemm
symmetric int8 gemm
☆66Updated 5 years ago
pigirons / conv3x3_m1
This is a demo how to write a high performance convolution run on apple silicon
☆54Updated 3 years ago
DayBreak-u / seq2seq_ncnn
the C++ version of Seq2Seq with ncnn
☆23Updated 4 years ago
hisrg / SNPE
Snapdragon Neural Processing Engine (SNPE) SDKThe Snapdragon Neural Processing Engine (SNPE) is a Qualcomm Snapdragon software accelerate…
☆34Updated 3 years ago
FeiGeChuanShu / ncnn-android-yolov6
☆65Updated 3 years ago
yester31 / Cutlass_EX
study of cutlass
☆22Updated 8 months ago
torchpipe / torchpipe
Serving Inside Pytorch
☆163Updated this week
lucasjinreal / wanwu_release
Wanwu models release, code will be released soon
☆24Updated 2 years ago
deepglint / eq-ncnn
☆42Updated 5 years ago
azeme1 / keras2ncnn
☆24Updated 3 years ago