quic / ai-hub-appsLinks

The Qualcomm® AI Hub apps are a collection of state-of-the-art machine learning models optimized for performance (latency, memory etc.) and ready to deploy on Qualcomm® devices.

☆252

Alternatives and similar repositories for ai-hub-apps

Users that are interested in ai-hub-apps are comparing it to the libraries listed below

Sorting:

quic / ai-hub-models
The Qualcomm® AI Hub Models are a collection of state-of-the-art machine learning models optimized for performance (latency, memory etc.)…
☆756Updated this week
google-ai-edge / ai-edge-torch
Supporting PyTorch models with the Google AI Edge TFLite runtime.
☆732Updated this week
quic / qidk
☆149Updated last month
DakeQQ / Native-LLM-for-Android
Demonstration of running a native LLM on Android device.
☆161Updated this week
google-ai-edge / LiteRT
LiteRT continues the legacy of TensorFlow Lite as the trusted, high-performance runtime for on-device AI. Now with LiteRT Next, we're exp…
☆688Updated this week
chraac / llama.cpp
LLM inference in C/C++
☆44Updated 2 weeks ago
UbiquitousLearning / mllm
Fast Multimodal LLM on Mobile Devices
☆983Updated this week
quic / ai-engine-direct-helper
QAI AppBuilder is designed to help developers easily execute models on WoS and Linux platforms. It encapsulates the Qualcomm® AI Runtime …
☆59Updated this week
ZTMIDGO / Android-Stable-diffusion-ONNX
使用Android手机的CPU推理stable diffusion
☆156Updated last year
JackZeng0208 / llama.cpp-android-tutorial
llama.cpp tutorial on Android phone
☆120Updated 3 months ago
openvinotoolkit / openvino.genai
Run Generative AI models with simple C++/Python API and using OpenVINO Runtime
☆316Updated this week
DakeQQ / YOLO-Depth-Estimation-for-Android
Demonstration of combine YOLO and depth estimation on Android device.
☆55Updated 2 months ago
MollySophia / rwkv-qualcomm
Inference RWKV v5, v6 and v7 with Qualcomm AI Engine Direct SDK
☆77Updated 3 weeks ago
microsoft / onnxruntime-genai
Generative AI extensions for onnxruntime
☆783Updated this week
microsoft / T-MAC
Low-bit LLM inference on CPU/NPU with lookup table
☆836Updated 2 months ago
XiaoMi / StableDiffusionOnDevice
本项目是一个通过文字生成图片的项目，基于开源模型Stable Diffusion V1.5生成可以在手机的CPU和NPU上运行的模型，包括其配套的模型运行框架。
☆209Updated last year
saic-fi / MobileQuant
[EMNLP Findings 2024] MobileQuant: Mobile-friendly Quantization for On-device Language Models
☆66Updated 10 months ago
mlc-ai / binary-mlc-llm-libs
☆249Updated last month
wangzhaode / llm-export
llm-export can export llm model to onnx.
☆301Updated 6 months ago
inisis / OnnxSlim
A Toolkit to Help Optimize Onnx Model
☆188Updated this week
kantv-ai / kantv
workbench for learning and practicing on-device AI technology in real scenario with online-TV on Android phone, powered by ggml(llama.cpp…
☆169Updated last month
lx200916 / ChatBotApp
☆36Updated 4 months ago
NVIDIA / TensorRT-Model-Optimizer
A unified library of state-of-the-art model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. …
☆1,093Updated this week
ARM-software / kleidiai
This repository is a read-only mirror of https://gitlab.arm.com/kleidi/kleidiai
☆62Updated 2 weeks ago
jeffzhou2000 / ggml-hexagon
the original reference implementation of a specified llama.cpp backend for Qualcomm Hexagon NPU on Android phone, https://github.com/ggml…
☆27Updated 3 weeks ago
gesanqiu / Chinese_MobileBert_on_SNPE
Run Chinese MobileBert model on SNPE.
☆15Updated 2 years ago
Bip-Rep / sherpa
A mobile Implementation of llama.cpp
☆314Updated last year
DataXujing / Qwen1.5-0.5b-chat-android
基于MNN-llm的安卓手机部署大语言模型：Qwen1.5-0.5B-Chat
☆82Updated last year
google-ai-edge / ai-edge-quantizer
AI Edge Quantizer: flexible post training quantization for LiteRT models.
☆56Updated last week
stevelaskaridis / awesome-mobile-llm
Awesome Mobile LLMs
☆226Updated last week