quic / ai-hub-modelsLinks
The Qualcomm® AI Hub Models are a collection of state-of-the-art machine learning models optimized for performance (latency, memory etc.) and ready to deploy on Qualcomm® devices.
☆855Updated last week
Alternatives and similar repositories for ai-hub-models
Users that are interested in ai-hub-models are comparing it to the libraries listed below
Sorting:
- The Qualcomm® AI Hub apps are a collection of state-of-the-art machine learning models optimized for performance (latency, memory etc.) a…☆345Updated last week
- Supporting PyTorch models with the Google AI Edge TFLite runtime.☆855Updated this week
- LiteRT, successor to TensorFlow Lite. is Google's On-device framework for high-performance ML & GenAI deployment on edge platforms, via e…☆1,102Updated this week
- ☆168Updated 3 weeks ago
- Generative AI extensions for onnxruntime☆901Updated this week
- Run Generative AI models with simple C++/Python API and using OpenVINO Runtime☆381Updated this week
- 🤗 Optimum Intel: Accelerate inference with Intel optimization tools☆515Updated this week
- On-device AI across mobile, embedded and edge for PyTorch☆3,634Updated this week
- TinyChatEngine: On-Device LLM Inference Library☆931Updated last year
- Self-Created Tools to convert ONNX files (NCHW) to TensorFlow/TFLite/Keras format (NHWC). The purpose of this tool is to solve the massiv…☆889Updated 2 weeks ago
- Demonstration of running a native LLM on Android device.☆202Updated this week
- Fast Multimodal LLM on Mobile Devices☆1,221Updated last week
- Examples for using ONNX Runtime for machine learning inferencing.☆1,552Updated last week
- Low-bit LLM inference on CPU/NPU with lookup table☆898Updated 6 months ago
- LLM inference in C/C++☆48Updated this week
- ☆339Updated 2 years ago
- Conversion of PyTorch Models into TFLite☆397Updated 2 years ago
- onnxruntime-extensions: A specialized pre- and post- processing library for ONNX Runtime☆430Updated this week
- A unified library of SOTA model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresse…☆1,605Updated this week
- Inference Vision Transformer (ViT) in plain C/C++ with ggml☆300Updated last year
- QAI AppBuilder is designed to help developers easily execute models on WoS and Linux platforms. It encapsulates the Qualcomm® AI Runtime …☆92Updated this week
- A pytorch quantization backend for optimum☆1,012Updated 2 weeks ago
- AIMET is a library that provides advanced quantization and compression techniques for trained neural network models.☆2,513Updated this week
- Olive: Simplify ML Model Finetuning, Conversion, Quantization, and Optimization for CPUs, GPUs and NPUs.☆2,196Updated last week
- Qualcomm Cloud AI SDK (Platform and Apps) enable high performance deep learning inference on Qualcomm Cloud AI platforms delivering high …☆70Updated this week
- Model Compression Toolkit (MCT) is an open source project for neural network model optimization under efficient, constrained hardware. Th…☆427Updated this week
- Advanced quantization toolkit for LLMs and VLMs. Native support for WOQ, MXFP4, NVFP4, GGUF, Adaptive Schemes and seamless integration wi…☆753Updated this week
- A Toolkit to Help Optimize Onnx Model☆267Updated last week
- ☆533Updated this week
- llm-export can export llm model to onnx.☆334Updated last month