meta-pytorch / executorch-examplesLinks
Example apps and demos using PyTorch's ExecuTorch framework
☆49Updated last week
Alternatives and similar repositories for executorch-examples
Users that are interested in executorch-examples are comparing it to the libraries listed below
Sorting:
- The Qualcomm® AI Hub apps are a collection of state-of-the-art machine learning models optimized for performance (latency, memory etc.) a…☆350Updated last week
- Supporting PyTorch models with the Google AI Edge TFLite runtime.☆880Updated last week
- 🤗 Optimum ExecuTorch☆93Updated this week
- AI Edge Quantizer: flexible post training quantization for LiteRT models.☆84Updated this week
- LiteRT, successor to TensorFlow Lite. is Google's On-device framework for high-performance ML & GenAI deployment on edge platforms, via e…☆1,163Updated this week
- On-device Speech Recognition for Android☆190Updated 2 months ago
- ☆201Updated last week
- Qualcomm® AI Hub Models is our collection of state-of-the-art machine learning models optimized for performance (latency, memory etc.) an…☆868Updated last week
- Android app for running transformers locally using LLama.cpp & Whisper.cpp☆28Updated last year
- Awesome Mobile LLMs☆284Updated last month
- Embeddings from sentence-transformers in Android! Supports all-MiniLM-L6-V2, bge-small-en, snowflake-arctic, model2vec models and more☆61Updated 3 months ago
- Visualize ONNX models with model-explorer☆66Updated 2 weeks ago
- This repository is an implementation of quantizing and converting the Llama3-8B-Instruct model weights and deploying it on Android for on…☆73Updated last year
- ☆617Updated this week
- Inference RWKV v5, v6 and v7 with Qualcomm AI Engine Direct SDK☆89Updated 3 weeks ago
- Running any GGUF SLMs/LLMs locally, on-device in Android☆615Updated 3 weeks ago
- Use safetensors with ONNX 🤗☆78Updated 2 months ago
- A Toolkit to Help Optimize Onnx Model☆288Updated this week
- No-code CLI designed for accelerating ONNX workflows☆222Updated 6 months ago
- Run Generative AI models with simple C++/Python API and using OpenVINO Runtime☆403Updated this week
- A custom RAG pipeline for multi-document QA from PDF/DOCX documents, in Android☆157Updated 3 weeks ago
- Inference Vision Transformer (ViT) in plain C/C++ with ggml☆302Updated last year
- Demonstration of running a native LLM on Android device.☆210Updated last week
- ONNX Script enables developers to naturally author ONNX functions and models using a subset of Python.☆414Updated this week
- the original reference implementation of a specified llama.cpp backend for Qualcomm Hexagon NPU on Android phone, https://github.com/ggml…☆35Updated 5 months ago
- ☆172Updated this week
- onnxruntime-extensions: A specialized pre- and post- processing library for ONNX Runtime☆431Updated last week
- Generative AI extensions for onnxruntime☆911Updated this week
- Advanced quantization toolkit for LLMs and VLMs. Support for WOQ, MXFP4, NVFP4, GGUF, Adaptive Schemes and seamless integration with Tra…☆775Updated this week
- Android wrapper for Inference Llama 2 in one file of pure C☆18Updated 2 years ago