pytorch-labs / executorch-examplesLinks
Example apps and demos using PyTorch's ExecuTorch framework
β11Updated last week
Alternatives and similar repositories for executorch-examples
Users that are interested in executorch-examples are comparing it to the libraries listed below
Sorting:
- π€ Optimum ExecuTorchβ53Updated last week
- On-device Speech Recognition for Androidβ104Updated this week
- FlashAttention (Metal Port)β497Updated 9 months ago
- LiteRT continues the legacy of TensorFlow Lite as the trusted, high-performance runtime for on-device AI. Now with LiteRT Next, we're expβ¦β595Updated this week
- 1.58 Bit LLM on Apple Silicon using MLXβ214Updated last year
- llama.cpp fork with additional SOTA quants and improved performanceβ608Updated this week
- No-code CLI designed for accelerating ONNX workflowsβ198Updated 2 weeks ago
- Awesome Mobile LLMsβ204Updated 3 weeks ago
- reference implementation of the backend for llama.cpp on Android phone equipped with Qualcomm's Hexagon NPU, details can be seen at httpβ¦β23Updated this week
- A minimalistic C++ Jinja templating engine for LLM chat templatesβ156Updated last month
- Fast Matrix Multiplications for Lookup Table-Quantized LLMsβ371Updated 2 months ago
- Universal cross-platform tokenizers binding to HF and sentencepieceβ350Updated this week
- Supporting PyTorch models with the Google AI Edge TFLite runtime.β678Updated this week
- Fast Hadamard transform in CUDA, with a PyTorch interfaceβ201Updated last year
- Demonstration of combine YOLO and depth estimation on Android device.β51Updated last month
- β137Updated this week
- β213Updated 5 months ago
- Local LLM Server with GPU and NPU Accelerationβ138Updated last week
- AI Tensor Engine for ROCmβ208Updated this week
- Thin wrapper around GGML to make life easierβ35Updated 3 weeks ago
- AI Edge Quantizer: flexible post training quantization for LiteRT models.β49Updated this week
- A ggml (C++) re-implementation of tortoise-ttsβ187Updated 10 months ago
- A proxy that hosts multiple single-model runners such as LLama.cpp and vLLMβ11Updated 3 weeks ago
- Run transformers (incl. LLMs) on the Apple Neural Engine.β61Updated last year
- This repository is a read-only mirror of https://gitlab.arm.com/kleidi/kleidiaiβ51Updated last week
- a simple Flash Attention v2 implementation with ROCM (RDNA3 GPU, roc wmma), mainly used for stable diffusion(ComfyUI) in Windows ZLUDA enβ¦β43Updated 10 months ago
- ONNX Script enables developers to naturally author ONNX functions and models using a subset of Python.β360Updated this week
- LLM Inference on consumer devicesβ119Updated 3 months ago
- Implementation of mamba with rustβ87Updated last year
- A safetensors extension to efficiently store sparse quantized tensors on diskβ129Updated this week