meta-pytorch / executorch-examplesLinks
Example apps and demos using PyTorch's ExecuTorch framework
☆59Updated this week
Alternatives and similar repositories for executorch-examples
Users that are interested in executorch-examples are comparing it to the libraries listed below
Sorting:
- The Qualcomm® AI Hub apps are a collection of state-of-the-art machine learning models optimized for performance (latency, memory etc.) a…☆369Updated last week
- Support PyTorch model conversion with LiteRT.☆930Updated this week
- LiteRT, successor to TensorFlow Lite. is Google's On-device framework for high-performance ML & GenAI deployment on edge platforms, via e…☆1,399Updated this week
- 🤗 Optimum ExecuTorch☆106Updated last week
- Qualcomm® AI Hub Models is our collection of state-of-the-art machine learning models optimized for performance (latency, memory etc.) an…☆909Updated last week
- On-device Speech Recognition for Android☆201Updated 2 weeks ago
- AI Edge Quantizer: flexible post training quantization for LiteRT models.☆96Updated this week
- ☆786Updated this week
- Running any GGUF SLMs/LLMs locally, on-device in Android☆661Updated last month
- ☆237Updated this week
- Awesome Mobile LLMs☆301Updated 2 months ago
- Inference RWKV v5, v6 and v7 with Qualcomm AI Engine Direct SDK☆90Updated this week
- Visualize ONNX models with model-explorer☆67Updated 3 weeks ago
- Inference Vision Transformer (ViT) in plain C/C++ with ggml☆306Updated last year
- Demonstration of running a native LLM on Android device.☆226Updated this week
- A Toolkit to Help Optimize Onnx Model☆383Updated last week
- TTS support with GGML☆218Updated 4 months ago
- Use safetensors with ONNX 🤗☆84Updated 3 weeks ago
- Generative AI extensions for onnxruntime☆953Updated this week
- On-device AI across mobile, embedded and edge for PyTorch☆4,226Updated this week
- Low-bit LLM inference on CPU/NPU with lookup table☆916Updated 8 months ago
- Embeddings from sentence-transformers in Android! Supports all-MiniLM-L6-V2, bge-small-en, snowflake-arctic, model2vec models and more☆67Updated 4 months ago
- 🎯An accuracy-first, highly efficient quantization toolkit for LLMs, designed to minimize quality degradation across Weight-Only Quantiza…☆839Updated this week
- Run Generative AI models with simple C++/Python API and using OpenVINO Runtime☆428Updated this week
- Optimized OpenAI's Whisper TFLite Port for Efficient Offline Inference on Edge Devices☆271Updated last year
- the original reference implementation of a specified llama.cpp backend for Qualcomm Hexagon NPU on Android phone, https://github.com/ggml…☆35Updated 6 months ago
- Fast Multimodal LLM on Mobile Devices☆1,370Updated this week
- This repository is a read-only mirror of https://gitlab.arm.com/kleidi/kleidiai☆113Updated last week
- MobileLLM Optimizing Sub-billion Parameter Language Models for On-Device Use Cases. In ICML 2024.☆1,406Updated 9 months ago
- LLM inference in C/C++☆48Updated this week