chraac/llama-cpp-qnn-builder

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/chraac/llama-cpp-qnn-builder)

chraac / llama-cpp-qnn-builder

☆30

Alternatives and similar repositories for llama-cpp-qnn-builder

Users that are interested in llama-cpp-qnn-builder are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

chraac / llama.cpp
View on GitHub
LLM inference in C/C++
☆53Jul 10, 2026Updated last week
zhouwg / ggml-hexagon
View on GitHub
the original reference implementation of a specified llama.cpp backend for Qualcomm Hexagon NPU on Android phone, history of ggml-hexagon…
☆48Updated this week
tanjatang / snpe_resnet
View on GitHub
snpe tutorial
☆10Dec 25, 2023Updated 2 years ago
KoKoLates / snpe-yolov7-inference
View on GitHub
Inference of YOLOv7 model applied on Qualcomm SNPE for pedestrian detection with embedded system.
☆13Sep 23, 2024Updated last year
latentCall145 / channels-last-groupnorm
View on GitHub
A CUDA kernel for NHWC GroupNorm for PyTorch
☆23Nov 15, 2024Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
nilseuropa / hopenet_ncnn
View on GitHub
Hopenet: deep head pose estimator on ncnn
☆10Jun 18, 2020Updated 6 years ago
oujieww / ANPD
View on GitHub
☆11Feb 5, 2026Updated 5 months ago
Qengineering / Head-Pose-ncnn-Raspberry-Pi-4
View on GitHub
Ultra fast head pose estimation on a bare Raspberry Pi 4 at 20 FPS
☆10Dec 21, 2021Updated 4 years ago
powerserve-project / PowerServe
View on GitHub
High-speed and easy-use LLM serving framework for local deployment
☆161Aug 7, 2025Updated 11 months ago
ajokela / rktop
View on GitHub
High-performance system monitor for Rockchip SoCs (RK3588, RK3399) with real-time CPU, GPU, NPU, RGA, memory, and process monitoring. W…
☆25Nov 25, 2025Updated 7 months ago
defi0x1 / pose-detect-optimizer
View on GitHub
Optimized pose detector inference for edge devices
☆15Feb 23, 2023Updated 3 years ago
qualcomm / fastrpc
View on GitHub
FastRPC is Qualcomm's userspace library that facilitates efficient remote procedure calls between the CPU and DSP for high-performance co…
☆104Jul 14, 2026Updated last week
qualcomm / qai-appbuilder
View on GitHub
QAI AppBuilder is designed to help developers easily execute models on WoS and Linux platforms. It encapsulates the Qualcomm® AI Runtime …
☆187Updated this week
taowen / hexagon-tutorial
View on GitHub
hexagon tutorial
☆56Mar 29, 2026Updated 3 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
ToyoDAdoubiBackup / SSRStatus
View on GitHub
Shadowsocks/ShadowsocksR 账号在线监控
☆12Nov 25, 2018Updated 7 years ago
Machine-Learning-Tokyo / edgeai-lab-microcontroller-series
View on GitHub
This repository is to share the EdgeAI Lab with Microcontrollers Series material to the entire community. We will share documents, presen…
☆17Oct 14, 2021Updated 4 years ago
haozixu / htp-ops-lib
View on GitHub
Self-implemented NN operators for Qualcomm's Hexagon NPU
☆76Sep 30, 2025Updated 9 months ago
wkt / YoloMobile
View on GitHub
A Android Library for YOLOv5/YOLOv7/YOLOv8 Detection and Pose Inference Based on NCNN
☆59Aug 15, 2024Updated last year
ppogg / Deepstream-Box
View on GitHub
deepstream + cuda，yolo26，yolo-master，yolo11，yolov8，sam，transformer, etc.
☆27Feb 7, 2026Updated 5 months ago
peTzxz / Mini-Program_Online-tutoring
View on GitHub
智能家教微信小程序
☆11Sep 15, 2018Updated 7 years ago
chuoibo / VocalMind
View on GitHub
End to End Speech to Speech with Emotion System
☆15Feb 6, 2025Updated last year
zdenop / tessdata_downloader
View on GitHub
Tesseract tessdata downloader from GitHub repositories
☆11Sep 17, 2021Updated 4 years ago
hpc203 / Gaze-LLE-onnxrun
View on GitHub
使用onnxruntime部署Gaze-LLE凝视目标估计，包含C++和Python两个版本的程序
☆17Jan 21, 2025Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
geyang / variational_autoencoder_pytorch
View on GitHub
pyTorch variational autoencoder, with explainations
☆11May 31, 2017Updated 9 years ago
chenyueqi / hotBPF
View on GitHub
☆15Apr 28, 2023Updated 3 years ago
ArtificialZeng / transformers-Explained
View on GitHub
官方transformers源码解析。AI大模型时代，pytorch、transformer是新操作系统，其他都是运行在其上面的软件。
☆16Sep 25, 2023Updated 2 years ago
YangLinzhuo / cuda-sgemm-optimization
View on GitHub
CUDA SGEMM optimization note
☆15Oct 31, 2023Updated 2 years ago
shengbinmeng / Bjontegaard_metric
View on GitHub
Bjontegaard metric calculation. Include BD-PSNR and BD-rate
☆14Sep 4, 2024Updated last year
daquexian / dnnlibrary-example
View on GitHub
An example app of DNNLibrary :)
☆13Jul 26, 2019Updated 6 years ago
mi150 / VaLoRA
View on GitHub
☆11May 19, 2025Updated last year
Kazuhito00 / Pycon-mini-Tokai-2024-VLM-Colaboratory-Sample
View on GitHub
PyCon mini 東海 2024 のトーク「Google Colaboratoryで試すVLM」で紹介したサンプル集
☆12Nov 15, 2024Updated last year
FrancescoSaverioZuppichini / detector
View on GitHub
☆13Apr 28, 2023Updated 3 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
xingrz / rpi-pico-builder
View on GitHub
Build environment for Raspberry Pi Pico (RP2040) C/C++ SDK
☆11Jan 25, 2021Updated 5 years ago
shirohana / alfred3-youtube-control
View on GitHub
🎵 Control YouTube players with browser by Alfred
☆12Sep 30, 2020Updated 5 years ago
Luowaterbi / TokenRecycling
View on GitHub
[ACL2025 Oral🔥]Turning Trash into Treasure: Accelerating Inference of Large Language Models with Token Recycling
☆29Nov 11, 2025Updated 8 months ago
pittisl / ElasticTrainer
View on GitHub
Code for paper "ElasticTrainer: Speeding Up On-Device Training with Runtime Elastic Tensor Selection" (MobiSys'23)
☆14Nov 1, 2023Updated 2 years ago
XiaoMi / StableDiffusionOnDevice
View on GitHub
本项目是一个通过文字生成图片的项目，基于开源模型Stable Diffusion V1.5生成可以在手机的CPU和NPU上运行的模型，包括其配套的模型运行框架。
☆245Mar 29, 2024Updated 2 years ago
jiachengh / Fleet
View on GitHub
☆13Mar 18, 2024Updated 2 years ago
ruankie / rag-qa
View on GitHub
RAG-QA is a free, containerised question-answer framework that allows you to ask questions to your documents in an intuitive way
☆21Jan 25, 2024Updated 2 years ago