Awesome Mobile LLMs
☆352May 31, 2026Updated last week
Alternatives and similar repositories for awesome-mobile-llm
Users that are interested in awesome-mobile-llm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- codebase for "MELTing Point: Mobile Evaluation of Language Transformers"☆20Jul 19, 2024Updated last year
- Fast Multimodal LLM on Mobile Devices☆1,532Apr 30, 2026Updated last month
- ☆43Mar 29, 2025Updated last year
- Android app for the Hole in your Palm project, making LLMs accessible on-device!☆19May 3, 2024Updated 2 years ago
- ☆27May 23, 2026Updated 2 weeks ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- TinyChatEngine: On-Device LLM Inference Library☆953Jul 4, 2024Updated last year
- PyTorch compilation tutorial covering TorchScript, torch.fx, and Slapo☆17Mar 13, 2023Updated 3 years ago
- Low-bit LLM inference on CPU/NPU with lookup table☆965Jun 5, 2025Updated last year
- "Efficient Federated Learning for Modern NLP", to appear at MobiCom 2023.☆34Aug 18, 2023Updated 2 years ago
- List Flower resources☆12Feb 4, 2022Updated 4 years ago
- 🚗🗣️📡🗾🏁 A framework for navigation tasks that can build the 3D scene graph in real-time and utilize large language model (LLM) to gui…☆27Oct 14, 2024Updated last year
- Typescript parser combinator library☆15Jan 9, 2026Updated 5 months ago
- Efficient SDE samplers including Gaussian-based probabilistic solvers. Written in JAX.☆10Feb 8, 2025Updated last year
- Demonstration of running a native LLM on Android device.☆255Updated this week
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- MobileLLM Optimizing Sub-billion Parameter Language Models for On-Device Use Cases. In ICML 2024.☆1,444Apr 30, 2026Updated last month
- [TMLR 2024] Efficient Large Language Models: A Survey☆1,260Jun 23, 2025Updated 11 months ago
- A curated list for Efficient Large Language Models☆2,018Jun 17, 2025Updated 11 months ago
- ☆11Feb 5, 2026Updated 4 months ago
- Strong and Open Vision Language Assistant for Mobile Devices☆1,355Apr 15, 2024Updated 2 years ago
- LLM.swift is a simple and readable library that allows you to interact with large language models locally with ease for macOS, iOS, watch…☆863Dec 6, 2025Updated 6 months ago
- Swift package for reading and writing Safetensors files.☆13Feb 6, 2026Updated 4 months ago
- llama and other large language models on iOS and MacOS offline using GGML library.☆2,037Jan 30, 2026Updated 4 months ago
- A mobile Implementation of llama.cpp☆328Feb 1, 2024Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Native C/C++ core for ToolNeuron — JNI + llama.cpp bindings for fast, private, on‑device LLM inference on Android.☆30Dec 14, 2025Updated 5 months ago
- Calling LLM APIs on a Raspberry Pi for lulz☆24Apr 17, 2023Updated 3 years ago
- The open-source project for "Mandheling: Mixed-Precision On-Device DNN Training with DSP Offloading"[MobiCom'2022]☆19Aug 4, 2022Updated 3 years ago
- On-device AI across mobile, embedded and edge for PyTorch☆4,716Updated this week
- Official Implementation of SAM-Decoding: Speculative Decoding via Suffix Automaton☆50May 12, 2026Updated 3 weeks ago
- ☆52Aug 6, 2024Updated last year
- The Qualcomm® AI Hub apps are a collection of state-of-the-art machine learning models optimized for performance (latency, memory etc.) a…☆420Updated this week
- a lightweight LLM model inference framework☆753Apr 7, 2024Updated 2 years ago
- LiteRT, successor to TensorFlow Lite. is Google's On-device framework for high-performance ML & GenAI deployment on edge platforms, via e…☆2,511Updated this week
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Multi-Agent LLM System for Digital Scam Protection☆15Dec 19, 2024Updated last year
- paper and its code for AI System☆363May 14, 2026Updated 3 weeks ago
- Curated list of methods that focuses on improving the efficiency of diffusion models☆43Jul 9, 2024Updated last year
- UE5 plugin for singleton management of interfaces, actors, and components☆14Mar 19, 2025Updated last year
- MobiSys#114☆23Aug 17, 2023Updated 2 years ago
- Official code for DAM: Dynamic Adapter Merging for Continual Video QA Learning☆15Apr 25, 2024Updated 2 years ago
- 🤗 Optimum ExecuTorch☆131May 26, 2026Updated 2 weeks ago