quic / ai-hub-modelsLinks

The Qualcomm® AI Hub Models are a collection of state-of-the-art machine learning models optimized for performance (latency, memory etc.) and ready to deploy on Qualcomm® devices.

☆815

Alternatives and similar repositories for ai-hub-models

Users that are interested in ai-hub-models are comparing it to the libraries listed below

Sorting:

quic / ai-hub-apps
The Qualcomm® AI Hub apps are a collection of state-of-the-art machine learning models optimized for performance (latency, memory etc.) a…
☆320Updated 2 weeks ago
google-ai-edge / ai-edge-torch
Supporting PyTorch models with the Google AI Edge TFLite runtime.
☆811Updated this week
google-ai-edge / LiteRT
LiteRT, successor to TensorFlow Lite. is Google's On-device framework for high-performance ML & GenAI deployment on edge platforms, via e…
☆894Updated this week
quic / qidk
☆164Updated 4 months ago
microsoft / onnxruntime-genai
Generative AI extensions for onnxruntime
☆861Updated this week
openvinotoolkit / openvino.genai
Run Generative AI models with simple C++/Python API and using OpenVINO Runtime
☆364Updated last week
pytorch / executorch
On-device AI across mobile, embedded and edge for PyTorch
☆3,374Updated this week
NVIDIA / TensorRT-Model-Optimizer
A unified library of state-of-the-art model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. …
☆1,464Updated last week
quic / aimet-model-zoo
☆337Updated last year
quic / cloud-ai-sdk
Qualcomm Cloud AI SDK (Platform and Apps) enable high performance deep learning inference on Qualcomm Cloud AI platforms delivering high …
☆66Updated 2 months ago
microsoft / onnxruntime-inference-examples
Examples for using ONNX Runtime for machine learning inferencing.
☆1,516Updated last week
mit-han-lab / TinyChatEngine
TinyChatEngine: On-Device LLM Inference Library
☆906Updated last year
DakeQQ / Native-LLM-for-Android
Demonstration of running a native LLM on Android device.
☆191Updated last month
PINTO0309 / onnx2tf
Self-Created Tools to convert ONNX files (NCHW) to TensorFlow/TFLite/Keras format (NHWC). The purpose of this tool is to solve the massiv…
☆869Updated this week
sithu31296 / PyTorch-ONNX-TFLite
Conversion of PyTorch Models into TFLite
☆392Updated 2 years ago
microsoft / onnxruntime-extensions
onnxruntime-extensions: A specialized pre- and post- processing library for ONNX Runtime
☆418Updated last week
UbiquitousLearning / mllm
Fast Multimodal LLM on Mobile Devices
☆1,132Updated this week
staghado / vit.cpp
Inference Vision Transformer (ViT) in plain C/C++ with ggml
☆295Updated last year
inisis / OnnxSlim
A Toolkit to Help Optimize Onnx Model
☆228Updated this week
triton-inference-server / tutorials
This repository contains tutorials and examples for Triton Inference Server
☆792Updated 2 weeks ago
microsoft / T-MAC
Low-bit LLM inference on CPU/NPU with lookup table
☆876Updated 4 months ago
quic / aimet
AIMET is a library that provides advanced quantization and compression techniques for trained neural network models.
☆2,471Updated this week
ThanatosShinji / onnx-tool
A parser, editor and profiler tool for ONNX models.
☆460Updated 2 months ago
dusty-nv / NanoLLM
Optimized local inference for LLMs with HuggingFace-like APIs for quantization, vision/language models, multimodal agents, speech, vector…
☆327Updated last year
intel / auto-round
Advanced Quantization Algorithm for LLMs and VLMs, with support for CPU, Intel GPU, CUDA and HPU.
☆679Updated this week
huggingface / optimum-intel
🤗 Optimum Intel: Accelerate inference with Intel optimization tools
☆502Updated this week
ARM-software / kleidiai
This repository is a read-only mirror of https://gitlab.arm.com/kleidi/kleidiai
☆91Updated this week
huggingface / optimum-quanto
A pytorch quantization backend for optimum
☆999Updated last week
SonySemiconductorSolutions / mct-model-optimization
Model Compression Toolkit (MCT) is an open source project for neural network model optimization under efficient, constrained hardware. Th…
☆419Updated last week
chraac / llama.cpp
LLM inference in C/C++
☆46Updated last week