quic / ai-hub-modelsLinks
Qualcomm® AI Hub Models is our collection of state-of-the-art machine learning models optimized for performance (latency, memory etc.) and ready to deploy on Qualcomm® devices.
☆890Updated this week
Alternatives and similar repositories for ai-hub-models
Users that are interested in ai-hub-models are comparing it to the libraries listed below
Sorting:
- The Qualcomm® AI Hub apps are a collection of state-of-the-art machine learning models optimized for performance (latency, memory etc.) a…☆355Updated last month
- Supporting PyTorch models with the Google AI Edge TFLite runtime.☆903Updated this week
- ☆177Updated 3 weeks ago
- LiteRT, successor to TensorFlow Lite. is Google's On-device framework for high-performance ML & GenAI deployment on edge platforms, via e…☆1,289Updated this week
- Generative AI extensions for onnxruntime☆930Updated this week
- On-device AI across mobile, embedded and edge for PyTorch☆4,126Updated this week
- Run Generative AI models with simple C++/Python API and using OpenVINO Runtime☆414Updated this week
- Examples for using ONNX Runtime for machine learning inferencing.☆1,584Updated last week
- ☆340Updated 2 years ago
- Self-Created Tools to convert ONNX files (NCHW) to TensorFlow/TFLite/Keras format (NHWC). The purpose of this tool is to solve the massiv…☆909Updated last week
- QAI AppBuilder is designed to help developers easily execute models on WoS and Linux platforms. It encapsulates the Qualcomm® AI Runtime …☆102Updated this week
- ☆728Updated this week
- 🤗 Optimum Intel: Accelerate inference with Intel optimization tools☆528Updated this week
- A Toolkit to Help Optimize Onnx Model☆308Updated last week
- Demonstration of running a native LLM on Android device.☆217Updated this week
- A unified library of SOTA model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresse…☆1,805Updated this week
- Fast Multimodal LLM on Mobile Devices☆1,349Updated this week
- onnxruntime-extensions: A specialized pre- and post- processing library for ONNX Runtime☆434Updated last month
- Inference Vision Transformer (ViT) in plain C/C++ with ggml☆305Updated last year
- Neural Network Compression Framework for enhanced OpenVINO™ inference☆1,115Updated this week
- Conversion of PyTorch Models into TFLite☆398Updated 2 years ago
- AIMET is a library that provides advanced quantization and compression techniques for trained neural network models.☆2,537Updated this week
- ☆1,190Updated last month
- TinyChatEngine: On-Device LLM Inference Library☆939Updated last year
- Low-bit LLM inference on CPU/NPU with lookup table☆907Updated 7 months ago
- Qualcomm Cloud AI SDK (Platform and Apps) enable high performance deep learning inference on Qualcomm Cloud AI platforms delivering high …☆71Updated last month
- Efficient Inference of Transformer models☆478Updated last year
- PyTorch Neural Network eXchange☆665Updated last week
- A pytorch quantization backend for optimum☆1,021Updated last month
- 🎯An accuracy-first, highly efficient quantization toolkit for LLMs, designed to minimize quality degradation across Weight-Only Quantiza…☆815Updated this week