quic / ai-engine-direct-helperLinks

QAI AppBuilder is designed to help developers easily execute models on WoS and Linux platforms. It encapsulates the Qualcomm® AI Runtime SDK APIs into a set of simplified interfaces for running models on the NPU/HTP.

☆59

Alternatives and similar repositories for ai-engine-direct-helper

Users that are interested in ai-engine-direct-helper are comparing it to the libraries listed below

Sorting:

MollySophia / rwkv-qualcomm
Inference RWKV v5, v6 and v7 with Qualcomm AI Engine Direct SDK
☆77Updated 3 weeks ago
quic / qidk
☆149Updated last month
hova88 / CUDA-MatMul-Practice
☆17Updated last year
jeffzhou2000 / ggml-hexagon
the original reference implementation of a specified llama.cpp backend for Qualcomm Hexagon NPU on Android phone, https://github.com/ggml…
☆27Updated 3 weeks ago
tsingmicro-toolchain / OnnxSlim
A Toolkit to Help Optimize Large Onnx Model
☆157Updated last year
VeriSilicon / TIM-VX
VeriSilicon Tensor Interface Module
☆236Updated 7 months ago
MegEngine / mperf
mperf是一个面向移动/嵌入式平台的算子性能调优工具箱
☆188Updated last year
MegEngine / mgeconvert
MegEngine到其他框架的转换器
☆70Updated 2 years ago
XiaoMi / nnlib
Fork of https://source.codeaurora.org/quic/hexagon_nn/nnlib
☆58Updated 2 years ago
DeepLink-org / CVFusion
CVFusion is an open-source deep learning compiler to fuse the OpenCV operators.
☆31Updated 2 years ago
atanmarko / ncnn-with-cuda
Tencent NCNN with added CUDA support
☆69Updated 4 years ago
inisis / OnnxLLM
Large Language Model Onnx Inference Framework
☆36Updated 6 months ago
BBuf / ArmNeonOptimization
arm-neon
☆92Updated last year
daquexian / web-model-converter
☆41Updated 2 years ago
gesanqiu / Chinese_MobileBert_on_SNPE
Run Chinese MobileBert model on SNPE.
☆15Updated 2 years ago
BBuf / how-to-optimize-gemm
☆97Updated 4 years ago
OpenPPL / ppl.common
Common libraries for PPL projects
☆29Updated 4 months ago
gesanqiu / SNPE_Tutorial
A simple tutorial of SNPE.
☆177Updated 2 years ago
OpenPPL / ppl.kernel.cuda
☆37Updated 9 months ago
ModelTC / Dipoorlet
Offline Quantization Tools for Deploy.
☆132Updated last year
Cambricon / mlu-ops
Efficient operation implementation based on the Cambricon Machine Learning Unit (MLU) .
☆125Updated last week
lrw04 / llama2.c-to-ncnn
A converter for llama2.c legacy models to ncnn models.
☆81Updated last year
chraac / llama.cpp
LLM inference in C/C++
☆44Updated this week
tpoisonooo / chgemm
symmetric int8 gemm
☆66Updated 5 years ago
airockchip / rknpu_ddk
DDK for Rockchip NPU
☆65Updated 4 years ago
wangzhaode / mnn-stable-diffusion
stable diffusion using mnn
☆66Updated last year
waau / qualcomm-nnlib
Qualcomm Hexagon NN Offload Framework
☆43Updated 4 years ago
wangzhaode / onnx-llm
llm deploy project based onnx.
☆42Updated 9 months ago
MegEngine / examples
A set of examples around MegEngine
☆31Updated last year
inisis / OnnxSlim
A Toolkit to Help Optimize Onnx Model
☆188Updated last week