zhouwg / ggml-hexagonLinks

☆25

Alternatives and similar repositories for ggml-hexagon

Users that are interested in ggml-hexagon are comparing it to the libraries listed below

Sorting:

chraac / llama.cpp
LLM inference in C/C++
☆42Updated this week
MollySophia / rwkv-qualcomm
Inference RWKV v5, v6 and v7 with Qualcomm AI Engine Direct SDK
☆75Updated this week
MegEngine / mperf
mperf是一个面向移动/嵌入式平台的算子性能调优工具箱
☆186Updated last year
wangzhaode / onnx-llm
llm deploy project based onnx.
☆42Updated 9 months ago
daquexian / faster-rwkv
☆124Updated last year
wangzhaode / mnn-stable-diffusion
stable diffusion using mnn
☆65Updated last year
VeriSilicon / TIM-VX
VeriSilicon Tensor Interface Module
☆236Updated 6 months ago
inisis / OnnxLLM
Large Language Model Onnx Inference Framework
☆36Updated 6 months ago
quic / ai-engine-direct-helper
QAI AppBuilder is designed to help developers easily execute models on WoS and Linux platforms. It encapsulates the Qualcomm® AI Runtime …
☆55Updated this week
lx200916 / ChatBotApp
☆36Updated 3 months ago
sunkx109 / llama.cpp
llama 2 Inference
☆41Updated last year
sophgo / LLM-TPU
Run generative AI models in sophgo BM1684X/BM1688
☆225Updated this week
caiwanxianhust / FasterLLaMA
使用 CUDA C++ 实现的 llama 模型推理框架
☆58Updated 8 months ago
sophgo / libsophon
Sophgo AI chips driver and runtime library.
☆22Updated last week
HuPengsheet / EasyNN
EasyNN是一个面向教学而开发的神经网络推理框架，旨在让大家0基础也能自主完成推理框架编写！
☆31Updated 10 months ago
wangzhaode / llm-export
llm-export can export llm model to onnx.
☆300Updated 6 months ago
StudyingLover / ggml-tutorial
☆32Updated 10 months ago
wangzyon / trt_learn
TensorRT encapsulation, learn, rewrite, practice.
☆28Updated 2 years ago
FlagTree / flagtree
FlagTree is a unified compiler for multiple AI chips, which is forked from triton-lang/triton.
☆62Updated last week
lrw04 / llama2.c-to-ncnn
A converter for llama2.c legacy models to ncnn models.
☆81Updated last year
xxxxyu / FlexNN
Code for ACM MobiCom 2024 paper "FlexNN: Efficient and Adaptive DNN Inference on Memory-Constrained Edge Devices"
☆55Updated 5 months ago
ARM-software / kleidiai
This repository is a read-only mirror of https://gitlab.arm.com/kleidi/kleidiai
☆56Updated this week
AXERA-TECH / ax-llm
Explore LLM model deployment based on AXera's AI chips
☆109Updated this week
lovemefan / ggml-learning-notes
ggml学习笔记，ggml是一个机器学习的推理框架
☆17Updated last year
tsingmicro-toolchain / OnnxSlim
A Toolkit to Help Optimize Large Onnx Model
☆157Updated last year
doongz / mlc-ai
机器学习编译陈天奇
☆37Updated 2 years ago
JieRen98 / SGEMM-SASS-Annotation
☆21Updated 4 years ago
Qwesh157 / conv_op_optimization
This project is about convolution operator optimization on GPU, include GEMM based (Implicit GEMM) convolution.
☆33Updated 6 months ago
OpenPPL / ppl.pmx
☆59Updated 7 months ago
OpenPPL / ppl.kernel.cuda
☆37Updated 9 months ago