meta-pytorch / executorch-examplesLinks
Example apps and demos using PyTorch's ExecuTorch framework
☆53Updated last week
Alternatives and similar repositories for executorch-examples
Users that are interested in executorch-examples are comparing it to the libraries listed below
Sorting:
- The Qualcomm® AI Hub apps are a collection of state-of-the-art machine learning models optimized for performance (latency, memory etc.) a…☆355Updated last month
- Supporting PyTorch models with the Google AI Edge TFLite runtime.☆903Updated this week
- LiteRT, successor to TensorFlow Lite. is Google's On-device framework for high-performance ML & GenAI deployment on edge platforms, via e…☆1,267Updated last week
- 🤗 Optimum ExecuTorch☆101Updated last week
- AI Edge Quantizer: flexible post training quantization for LiteRT models.☆88Updated this week
- Qualcomm® AI Hub Models is our collection of state-of-the-art machine learning models optimized for performance (latency, memory etc.) an…☆890Updated this week
- On-device Speech Recognition for Android☆196Updated 3 months ago
- ☆216Updated this week
- Awesome Mobile LLMs☆290Updated last month
- ☆728Updated this week
- A Toolkit to Help Optimize Onnx Model☆308Updated this week
- Demonstration of running a native LLM on Android device.☆217Updated this week
- Android app for running transformers locally using LLama.cpp & Whisper.cpp☆28Updated last year
- Inference RWKV v5, v6 and v7 with Qualcomm AI Engine Direct SDK☆90Updated last month
- Visualize ONNX models with model-explorer☆66Updated last week
- Embeddings from sentence-transformers in Android! Supports all-MiniLM-L6-V2, bge-small-en, snowflake-arctic, model2vec models and more☆63Updated 3 months ago
- ☆177Updated 3 weeks ago
- Optimized OpenAI's Whisper TFLite Port for Efficient Offline Inference on Edge Devices☆270Updated last year
- This repository is an implementation of quantizing and converting the Llama3-8B-Instruct model weights and deploying it on Android for on…☆74Updated last year
- Fast Multimodal LLM on Mobile Devices☆1,334Updated this week
- This repository is a read-only mirror of https://gitlab.arm.com/kleidi/kleidiai☆111Updated 3 weeks ago
- Inference Vision Transformer (ViT) in plain C/C++ with ggml☆305Updated last year
- Low-bit LLM inference on CPU/NPU with lookup table☆907Updated 7 months ago
- onnxruntime-extensions: A specialized pre- and post- processing library for ONNX Runtime☆434Updated 3 weeks ago
- Let's use Qualcomm NPU in Android☆17Updated 11 months ago
- Efficient Inference of Transformer models☆478Updated last year
- LLM inference in C/C++☆48Updated this week
- A custom RAG pipeline for multi-document QA from PDF/DOCX documents, in Android☆161Updated 2 weeks ago
- Qualcomm Cloud AI SDK (Platform and Apps) enable high performance deep learning inference on Qualcomm Cloud AI platforms delivering high …☆71Updated last month
- IRIS is an android app for interfacing with GGUF / llama.cpp models locally.☆262Updated 11 months ago