QAI AppBuilder is designed to help developers easily execute models on WoS and Linux platforms. It encapsulates the Qualcomm® AI Runtime SDK APIs into a set of simplified interfaces for running models on the NPU/HTP.
☆138Mar 6, 2026Updated this week
Alternatives and similar repositories for ai-engine-direct-helper
Users that are interested in ai-engine-direct-helper are comparing it to the libraries listed below
Sorting:
- Run Chinese MobileBert model on SNPE.☆15May 19, 2023Updated 2 years ago
- snpe tutorial☆10Dec 25, 2023Updated 2 years ago
- Inference RWKV v5, v6 and v7 with Qualcomm AI Engine Direct SDK☆90Feb 14, 2026Updated 3 weeks ago
- ☆182Jan 22, 2026Updated last month
- [EMNLP Findings 2024] MobileQuant: Mobile-friendly Quantization for On-device Language Models☆67Sep 22, 2024Updated last year
- This project is intended to build and deploy an SNPE model on Qualcomm Devices, which are having unsupported layers which are not part of…☆10Oct 4, 2021Updated 4 years ago
- ☆342Feb 12, 2026Updated 3 weeks ago
- mobilenet-ssd snpe demo☆41Nov 19, 2021Updated 4 years ago
- ☆10Jul 18, 2024Updated last year
- Project is intended to build and deploy an scene detection application onto Qualcomm Robotics development Kit (RB5) that detects whether …☆10Jun 26, 2022Updated 3 years ago
- MobileSAM のエンコーダー/デコーダーをONNXに変換し、推論するサンプル☆12Apr 11, 2024Updated last year
- Dockerfiles for poetry/mlc-llm(rk3588)/...☆10Sep 13, 2023Updated 2 years ago
- ☆14Nov 28, 2023Updated 2 years ago
- A yolov7-tiny model inference applied on qualcomm snpe for pedestrian detection with embedded system.☆13Sep 23, 2024Updated last year
- ggml学习笔记,ggml是一个机器学习的推理框架☆18Mar 24, 2024Updated last year
- Qualcomm® AI Hub Models is our collection of state-of-the-art machine learning models optimized for performance (latency, memory etc.) an…☆936Feb 25, 2026Updated last week
- This library empowers users to seamlessly port pretrained models and checkpoints on the HuggingFace (HF) hub (developed using HF transfor…☆86Mar 2, 2026Updated last week
- 搜藏的希望的代码片段☆13Jun 6, 2023Updated 2 years ago
- 本项目是一个通过文字生成图片的项目,基于开源模型Stable Diffusion V1.5生成可以在手机的CPU和NPU上运行的模型,包括其配套的模型运行框架。☆235Mar 29, 2024Updated last year
- Self-implemented NN operators for Qualcomm's Hexagon NPU☆49Sep 30, 2025Updated 5 months ago
- Large Language Model Onnx Inference Framework☆35Nov 25, 2025Updated 3 months ago
- High-speed and easy-use LLM serving framework for local deployment☆147Aug 7, 2025Updated 7 months ago
- stable diffusion using mnn☆67Sep 28, 2023Updated 2 years ago
- This repo provides the C++ implementation of YOLO-NAS based on ONNXRuntime for performing object detection in real-time.Support float32/f…☆46Apr 1, 2024Updated last year
- HunyuanDiT with TensorRT and libtorch☆18May 22, 2024Updated last year
- Fast Multimodal LLM on Mobile Devices☆1,422Updated this week
- A simple tutorial of SNPE.☆183Mar 30, 2023Updated 2 years ago
- ☆22Apr 10, 2024Updated last year
- The open-source project for "Mandheling: Mixed-Precision On-Device DNN Training with DSP Offloading"[MobiCom'2022]☆19Aug 4, 2022Updated 3 years ago
- 本仓库基于 Intel OpenVINO Toolkit 部署 LightTrack 跟踪算法,包含 Python、C++ 两种语言的推理代码.☆21Nov 2, 2023Updated 2 years ago
- High Performan Ai Model Web Server. Mainly support computer vision model. Quickly establish your own ai-model server. https://github.com/…☆45May 13, 2025Updated 9 months ago
- armchina NPU Integration☆24Oct 22, 2025Updated 4 months ago
- Sample projects for InferenceHelper, a Helper Class for Deep Learning Inference Frameworks: TensorFlow Lite, TensorRT, OpenCV, ncnn, MNN,…☆22Mar 27, 2022Updated 3 years ago
- YoloV5 NPU multithread for the RK3566/68/88 (200 FPS)☆25Dec 24, 2024Updated last year
- 基于MNN-llm的安卓手机部署大语言模型:Qwen1.5-0.5B-Chat☆90Apr 8, 2024Updated last year
- the original reference implementation of a specified llama.cpp backend for Qualcomm Hexagon NPU on Android phone, https://github.com/ggml…☆38Jul 14, 2025Updated 7 months ago
- RKNN模型推理部署模板☆24Aug 11, 2023Updated 2 years ago
- Python scripts performing Open Vocabulary Object Detection using the YOLO-World model in ONNX.☆62Apr 7, 2024Updated last year
- 对 tensorRT_Pro 开源项目理解☆22Feb 23, 2023Updated 3 years ago