qualcomm/ai-hub-models

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/qualcomm/ai-hub-models)

qualcomm / ai-hub-models

Qualcomm® AI Hub Models is our collection of state-of-the-art machine learning models optimized for performance (latency, memory etc.) and ready to deploy on Qualcomm® devices.

☆1,045

Alternatives and similar repositories for ai-hub-models

Users that are interested in ai-hub-models are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

qualcomm / ai-hub-apps
View on GitHub
The Qualcomm® AI Hub apps are a collection of state-of-the-art machine learning models optimized for performance (latency, memory etc.) a…
☆409Updated this week
qualcomm / qidk
View on GitHub
☆191Apr 24, 2026Updated 3 weeks ago
quic / aimet-model-zoo
View on GitHub
☆343Feb 12, 2026Updated 3 months ago
MollySophia / rwkv-qualcomm
View on GitHub
Inference RWKV v5, v6 and v7 with Qualcomm AI Engine Direct SDK
☆91Updated this week
quic / aimet
View on GitHub
AIMET is a library that provides advanced quantization and compression techniques for trained neural network models.
☆2,617Updated this week
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
pytorch / executorch
View on GitHub
On-device AI across mobile, embedded and edge for PyTorch
☆4,622Updated this week
haozixu / htp-ops-lib
View on GitHub
Self-implemented NN operators for Qualcomm's Hexagon NPU
☆68Sep 30, 2025Updated 7 months ago
globaledgesoft / Unsupported-Operation-Development-in-SNPE
View on GitHub
This project is intended to build and deploy an SNPE model on Qualcomm Devices, which are having unsupported layers which are not part of…
☆10Oct 4, 2021Updated 4 years ago
quic / ai-engine-direct-helper
View on GitHub
QAI AppBuilder is designed to help developers easily execute models on WoS and Linux platforms. It encapsulates the Qualcomm® AI Runtime …
☆158Updated this week
google-ai-edge / litert-torch
View on GitHub
Support PyTorch model conversion with LiteRT.
☆1,022Updated this week
quic / efficient-transformers
View on GitHub
This library empowers users to seamlessly port pretrained models and checkpoints on the HuggingFace (HF) hub (developed using HF transfor…
☆89Updated this week
UbiquitousLearning / mllm
View on GitHub
Fast Multimodal LLM on Mobile Devices
☆1,508Apr 30, 2026Updated 3 weeks ago
saic-fi / MobileQuant
View on GitHub
[EMNLP Findings 2024] MobileQuant: Mobile-friendly Quantization for On-device Language Models
☆68Sep 22, 2024Updated last year
XiaoMi / StableDiffusionOnDevice
View on GitHub
本项目是一个通过文字生成图片的项目，基于开源模型Stable Diffusion V1.5生成可以在手机的CPU和NPU上运行的模型，包括其配套的模型运行框架。
☆239Mar 29, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
argmaxinc / WhisperKitAndroid
View on GitHub
On-device Speech Recognition for Android
☆208Jan 24, 2026Updated 3 months ago
kantv-ai / kantv
View on GitHub
workbench for learning and practicing on-device AI technology in real scenario with online-TV on Android phone, powered by ggml(llama.cpp…
☆192Jun 12, 2025Updated 11 months ago
Achazwl / mlc
View on GitHub
MiniCPM on Android platform.
☆640Mar 19, 2025Updated last year
gesanqiu / SNPE_Tutorial
View on GitHub
A simple tutorial of SNPE.
☆185Mar 30, 2023Updated 3 years ago
facebookresearch / MobileLLM
View on GitHub
MobileLLM Optimizing Sub-billion Parameter Language Models for On-Device Use Cases. In ICML 2024.
☆1,438Apr 30, 2026Updated 3 weeks ago
microsoft / onnxruntime-genai
View on GitHub
Generative AI extensions for onnxruntime
☆1,029Updated this week
google-ai-edge / LiteRT
View on GitHub
LiteRT, successor to TensorFlow Lite. is Google's On-device framework for high-performance ML & GenAI deployment on edge platforms, via e…
☆2,392May 14, 2026Updated last week
airockchip / rknn3-toolkit
View on GitHub
☆56Apr 25, 2026Updated 3 weeks ago
MediaTek-NeuroPilot / tflite-neuron-delegate
View on GitHub
MediaTek's TFLite delegate
☆53Dec 8, 2025Updated 5 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
PINTO0309 / onnx2tf
View on GitHub
A tool for converting ONNX files to LiteRT/TFLite/TensorFlow, PyTorch native code (nn.Module), TorchScript (.pt), state_dict (.pt), Expor…
☆957Apr 1, 2026Updated last month
Plumess / yolov5-qnn
View on GitHub
YOLOv5在高通AI Engine Direct环境下进行QNN量化，CPU推理的项目
☆17Sep 10, 2024Updated last year
DakeQQ / Native-LLM-for-Android
View on GitHub
Demonstration of running a native LLM on Android device.
☆249May 14, 2026Updated last week
microsoft / onnxruntime-inference-examples
View on GitHub
Examples for using ONNX Runtime for machine learning inferencing.
☆1,645Feb 24, 2026Updated 2 months ago
ARM-software / kleidiai
View on GitHub
This repository is a read-only mirror of https://gitlab.arm.com/kleidi/kleidiai
☆144May 13, 2026Updated last week
mlc-ai / mlc-llm
View on GitHub
Universal LLM Deployment Engine with ML Compilation
☆22,633May 11, 2026Updated last week
alibaba / MNN
View on GitHub
MNN: A blazing-fast, lightweight inference engine battle-tested by Alibaba, powering high-performance on-device LLMs and Edge AI.
☆15,169May 12, 2026Updated last week
chraac / llama.cpp
View on GitHub
LLM inference in C/C++
☆52Updated this week
NVIDIA / TensorRT-LLM
View on GitHub
TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizat…
☆13,669Updated this week
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
powerserve-project / PowerServe
View on GitHub
High-speed and easy-use LLM serving framework for local deployment
☆153Aug 7, 2025Updated 9 months ago
rlghksdbs / CASR
View on GitHub
☆17Oct 2, 2024Updated last year
microsoft / onnxruntime
View on GitHub
ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
☆20,533Updated this week
wangzhaode / mnn-llm
View on GitHub
llm deploy project based mnn. This project has merged into MNN.
☆1,616Jan 20, 2025Updated last year
OpenPPL / ppl.nn
View on GitHub
A primitive library for neural network
☆1,369Nov 24, 2024Updated last year
Hozzu / TFLite-aarch64-linux-with-delegate
View on GitHub
Deep learning inference SW framework based on TensorFlow Lite for Aarch64 Linux with GPU and Hexagon delegate
☆13Mar 11, 2025Updated last year
google-ai-edge / ai-edge-quantizer
View on GitHub
AI Edge Quantizer: flexible post training quantization for LiteRT models.
☆132Updated this week