QAI AppBuilder is designed to help developers easily execute models on WoS and Linux platforms. It encapsulates the Qualcomm® AI Runtime SDK APIs into a set of simplified interfaces for running models on the NPU/HTP.
☆150Apr 17, 2026Updated this week
Alternatives and similar repositories for ai-engine-direct-helper
Users that are interested in ai-engine-direct-helper are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The Qualcomm® AI Hub apps are a collection of state-of-the-art machine learning models optimized for performance (latency, memory etc.) a…☆396Apr 12, 2026Updated last week
- Inference RWKV v5, v6 and v7 with Qualcomm AI Engine Direct SDK☆91Feb 14, 2026Updated 2 months ago
- Run Chinese MobileBert model on SNPE.☆15May 19, 2023Updated 2 years ago
- ☆187Jan 22, 2026Updated 2 months ago
- snpe tutorial☆10Dec 25, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Qualcomm® AI Hub Models is our collection of state-of-the-art machine learning models optimized for performance (latency, memory etc.) an…☆988Updated this week
- The repository supports TensorRT, QNN platform inference, 2D obstacle detection yolo series (yolov5, yolov8, yolo11, yolox), semantic seg…☆20May 6, 2025Updated 11 months ago
- MobileSAM のエンコーダー/デコーダーをONNXに変換し、推論するサンプル☆12Apr 11, 2024Updated 2 years ago
- This library empowers users to seamlessly port pretrained models and checkpoints on the HuggingFace (HF) hub (developed using HF transfor…☆86Apr 10, 2026Updated last week
- This project is intended to build and deploy an SNPE model on Qualcomm Devices, which are having unsupported layers which are not part of…☆10Oct 4, 2021Updated 4 years ago
- ☆10Jul 18, 2024Updated last year
- Project is intended to build and deploy an scene detection application onto Qualcomm Robotics development Kit (RB5) that detects whether …☆10Jun 26, 2022Updated 3 years ago
- mobilenet-ssd snpe demo☆41Nov 19, 2021Updated 4 years ago
- Self-implemented NN operators for Qualcomm's Hexagon NPU☆61Sep 30, 2025Updated 6 months ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- 本项目是一个通过文字生成图片的项目,基于开源模型Stable Diffusion V1.5生成可以在手机的CPU和NPU上运行的模型,包括其配套的模型运行框架。☆237Mar 29, 2024Updated 2 years ago
- A whisper repo for TPU☆11Jun 4, 2024Updated last year
- Fast Multimodal LLM on Mobile Devices☆1,470Apr 12, 2026Updated last week
- ☆10Oct 5, 2023Updated 2 years ago
- ☆15Nov 28, 2023Updated 2 years ago
- C++ implementations for various tokenizers (sentencepiece, tiktoken etc).☆49Apr 1, 2026Updated 2 weeks ago
- High-speed and easy-use LLM serving framework for local deployment☆148Aug 7, 2025Updated 8 months ago
- armchina NPU Integration☆24Oct 22, 2025Updated 5 months ago
- stable diffusion using mnn☆67Sep 28, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- tsingmicro AI model zoo☆10Aug 6, 2025Updated 8 months ago
- 大模型API性能指标比较 - 深入分析TTFT、TPS等关键指标☆20Sep 12, 2024Updated last year
- A simple tutorial of SNPE.☆184Mar 30, 2023Updated 3 years ago
- DTMF Decoder and Encoder shield for Arduino☆11Nov 3, 2021Updated 4 years ago
- ggml学习笔记,ggml是一个机器学习的推理框架☆18Mar 24, 2024Updated 2 years ago
- A yolov7-tiny model inference applied on qualcomm snpe for pedestrian detection with embedded system.☆13Sep 23, 2024Updated last year
- HunyuanDiT with TensorRT and libtorch☆18May 22, 2024Updated last year
- This repository is a read-only mirror of https://gitlab.arm.com/kleidi/kleidiai☆137Updated this week
- ☆32Apr 10, 2026Updated last week
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆21Apr 1, 2026Updated 2 weeks ago
- This repo provides the C++ implementation of YOLO-NAS based on ONNXRuntime for performing object detection in real-time.Support float32/f…☆46Apr 1, 2024Updated 2 years ago
- Large Language Model Onnx Inference Framework☆35Nov 25, 2025Updated 4 months ago
- Stable Diffusion+LCM在SG2300X上,纵享丝滑一秒出图☆17Nov 29, 2024Updated last year
- The simple implementation of UDP broadcasting and multicast☆29Apr 21, 2016Updated 9 years ago
- 8-bit CUDA functions for PyTorch, modified to build on Jetson Xavier☆15Apr 26, 2023Updated 2 years ago
- Sample projects for InferenceHelper, a Helper Class for Deep Learning Inference Frameworks: TensorFlow Lite, TensorRT, OpenCV, ncnn, MNN,…☆22Mar 27, 2022Updated 4 years ago