quic / ai-hub-models
The Qualcomm® AI Hub Models are a collection of state-of-the-art machine learning models optimized for performance (latency, memory etc.) and ready to deploy on Qualcomm® devices.
☆559Updated last week
Alternatives and similar repositories for ai-hub-models:
Users that are interested in ai-hub-models are comparing it to the libraries listed below
- The Qualcomm® AI Hub apps are a collection of state-of-the-art machine learning models optimized for performance (latency, memory etc.) a…☆105Updated last month
- Supporting PyTorch models with the Google AI Edge TFLite runtime.☆423Updated this week
- TensorRT Model Optimizer is a unified library of state-of-the-art model optimization techniques such as quantization, pruning, distillati…☆667Updated last week
- LiteRT is the new name for TensorFlow Lite (TFLite). While the name is new, it's still the same trusted, high-performance runtime for on-…☆229Updated this week
- Generative AI extensions for onnxruntime☆581Updated this week
- ☆119Updated last month
- On-device AI across mobile, embedded and edge for PyTorch☆2,407Updated this week
- Qualcomm Cloud AI SDK (Platform and Apps) enable high performance deep learning inference on Qualcomm Cloud AI platforms delivering high …☆55Updated 2 months ago
- 🤗 Optimum Intel: Accelerate inference with Intel optimization tools☆430Updated this week
- Low-bit LLM inference on CPU with lookup table☆646Updated last week
- Self-Created Tools to convert ONNX files (NCHW) to TensorFlow/TFLite/Keras format (NHWC). The purpose of this tool is to solve the massiv…☆735Updated last month
- Run Generative AI models with simple C++/Python API and using OpenVINO Runtime☆198Updated this week
- onnxruntime-extensions: A specialized pre- and post- processing library for ONNX Runtime☆349Updated this week
- A parser, editor and profiler tool for ONNX models.☆411Updated last week
- MobileLLM Optimizing Sub-billion Parameter Language Models for On-Device Use Cases. In ICML 2024.☆1,217Updated last month
- Advanced Quantization Algorithm for LLMs/VLMs.☆344Updated this week
- ☆1,025Updated last year
- Run generative AI models in sophgo BM1684X☆152Updated this week
- Inference Vision Transformer (ViT) in plain C/C++ with ggml☆244Updated 9 months ago
- TinyChatEngine: On-Device LLM Inference Library☆792Updated 6 months ago
- Common utilities for ONNX converters☆256Updated last month
- LLaMa/RWKV onnx models, quantization and testcase☆356Updated last year
- ☆311Updated last year
- Strong and Open Vision Language Assistant for Mobile Devices☆1,106Updated 9 months ago
- Official implementation of Half-Quadratic Quantization (HQQ)☆732Updated this week
- ONNX Script enables developers to naturally author ONNX functions and models using a subset of Python.☆304Updated this week
- Demonstration of running a native LLM on Android device.☆103Updated 3 weeks ago
- Optimized local inference for LLMs with HuggingFace-like APIs for quantization, vision/language models, multimodal agents, speech, vector…☆226Updated 3 months ago
- Fast Multimodal LLM on Mobile Devices☆661Updated last week
- Universal cross-platform tokenizers binding to HF and sentencepiece☆297Updated 2 months ago