quic / ai-hub-modelsLinks
The Qualcomm® AI Hub Models are a collection of state-of-the-art machine learning models optimized for performance (latency, memory etc.) and ready to deploy on Qualcomm® devices.
☆839Updated this week
Alternatives and similar repositories for ai-hub-models
Users that are interested in ai-hub-models are comparing it to the libraries listed below
Sorting:
- The Qualcomm® AI Hub apps are a collection of state-of-the-art machine learning models optimized for performance (latency, memory etc.) a…☆338Updated last week
- Supporting PyTorch models with the Google AI Edge TFLite runtime.☆828Updated this week
- LiteRT, successor to TensorFlow Lite. is Google's On-device framework for high-performance ML & GenAI deployment on edge platforms, via e…☆943Updated this week
- ☆166Updated last week
- Generative AI extensions for onnxruntime☆878Updated this week
- On-device AI across mobile, embedded and edge for PyTorch☆3,507Updated this week
- Run Generative AI models with simple C++/Python API and using OpenVINO Runtime☆371Updated this week
- ☆338Updated last year
- A unified library of state-of-the-art model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. …☆1,529Updated this week
- Examples for using ONNX Runtime for machine learning inferencing.☆1,530Updated this week
- AIMET is a library that provides advanced quantization and compression techniques for trained neural network models.☆2,496Updated this week
- Demonstration of running a native LLM on Android device.☆195Updated last week
- onnxruntime-extensions: A specialized pre- and post- processing library for ONNX Runtime☆424Updated this week
- Fast Multimodal LLM on Mobile Devices☆1,183Updated this week
- Low-bit LLM inference on CPU/NPU with lookup table☆887Updated 5 months ago
- A Toolkit to Help Optimize Onnx Model☆236Updated last week
- TinyChatEngine: On-Device LLM Inference Library☆923Updated last year
- 🤗 Optimum Intel: Accelerate inference with Intel optimization tools☆507Updated this week
- SOTA low-bit LLM quantization (INT8/FP8/MXFP8/INT4/MXFP4/NVFP4) & sparsity; leading model compression techniques on TensorFlow, PyTorch, …☆2,525Updated this week
- Qualcomm Cloud AI SDK (Platform and Apps) enable high performance deep learning inference on Qualcomm Cloud AI platforms delivering high …☆67Updated 3 months ago
- QAI AppBuilder is designed to help developers easily execute models on WoS and Linux platforms. It encapsulates the Qualcomm® AI Runtime …☆84Updated last week
- LLM inference in C/C++☆46Updated this week
- Advanced quantization toolkit for LLMs. Native support for WOQ, MXFP4, NVFP4, GGUF, Adaptive Bits and seamless integration with Transform…☆712Updated this week
- Self-Created Tools to convert ONNX files (NCHW) to TensorFlow/TFLite/Keras format (NHWC). The purpose of this tool is to solve the massiv…☆881Updated 3 weeks ago
- This repository is a read-only mirror of https://gitlab.arm.com/kleidi/kleidiai☆95Updated this week
- ☆477Updated this week
- A pytorch quantization backend for optimum☆1,009Updated 3 weeks ago
- ONNX Optimizer☆772Updated 2 weeks ago
- Demonstration of combine YOLO and depth estimation on Android device.☆59Updated this week
- A parser, editor and profiler tool for ONNX models.☆465Updated 2 weeks ago