Awesome Mobile LLMs
☆322Apr 6, 2026Updated this week
Alternatives and similar repositories for awesome-mobile-llm
Users that are interested in awesome-mobile-llm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Fast Multimodal LLM on Mobile Devices☆1,457Mar 29, 2026Updated last week
- ☆27Oct 25, 2023Updated 2 years ago
- ☆43Mar 29, 2025Updated last year
- ☆18Mar 25, 2026Updated 2 weeks ago
- TinyChatEngine: On-Device LLM Inference Library☆949Jul 4, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- PyTorch compilation tutorial covering TorchScript, torch.fx, and Slapo☆17Mar 13, 2023Updated 3 years ago
- Low-bit LLM inference on CPU/NPU with lookup table☆944Jun 5, 2025Updated 10 months ago
- Code Repository for Blog - How to Productionize Large Language Models (LLMs)☆12Mar 27, 2024Updated 2 years ago
- "Efficient Federated Learning for Modern NLP", to appear at MobiCom 2023.☆33Aug 18, 2023Updated 2 years ago
- List Flower resources☆12Feb 4, 2022Updated 4 years ago
- ☆11Mar 31, 2026Updated last week
- (WACV'24) Kaizen: Practical self-supervised continual learning with continual fine-tuning☆16Oct 29, 2024Updated last year
- 🚗🗣️📡🗾🏁 A framework for navigation tasks that can build the 3D scene graph in real-time and utilize large language model (LLM) to gui…☆25Oct 14, 2024Updated last year
- Typescript parser combinator library☆15Jan 9, 2026Updated 3 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Efficient SDE samplers including Gaussian-based probabilistic solvers. Written in JAX.☆10Feb 8, 2025Updated last year
- Android app for the Hole in your Palm project, making LLMs accessible on-device!☆18May 3, 2024Updated last year
- the original reference implementation of a specified llama.cpp backend for Qualcomm Hexagon NPU on Android phone, https://github.com/ggml…☆38Jul 14, 2025Updated 8 months ago
- [NAACL 2025🔥] MEDA: Dynamic KV Cache Allocation for Efficient Multimodal Long-Context Inference☆19Jun 19, 2025Updated 9 months ago
- Work in progress LLM framework.☆15Oct 31, 2024Updated last year
- ☆13May 11, 2023Updated 2 years ago
- To deploy Transformer models in CV to mobile devices.☆18Jan 20, 2022Updated 4 years ago
- MobileLLM Optimizing Sub-billion Parameter Language Models for On-Device Use Cases. In ICML 2024.☆1,420Apr 21, 2025Updated 11 months ago
- Flutter / Dart bindings for llama.cpp☆20Sep 30, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- [TMLR 2024] Efficient Large Language Models: A Survey☆1,257Jun 23, 2025Updated 9 months ago
- Strong and Open Vision Language Assistant for Mobile Devices☆1,350Apr 15, 2024Updated last year
- Learn and build GPU RTL from scratch☆20Aug 1, 2025Updated 8 months ago
- ☆14Sep 27, 2021Updated 4 years ago
- ☆35Feb 10, 2025Updated last year
- Official Implementation of SAM-Decoding: Speculative Decoding via Suffix Automaton☆47Feb 13, 2025Updated last year
- Calling LLM APIs on a Raspberry Pi for lulz☆24Apr 17, 2023Updated 2 years ago
- The open-source project for "Mandheling: Mixed-Precision On-Device DNN Training with DSP Offloading"[MobiCom'2022]☆19Aug 4, 2022Updated 3 years ago
- Federated Learning - PyTorch☆15Jun 27, 2021Updated 4 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- llama and other large language models on iOS and MacOS offline using GGML library.☆2,008Jan 30, 2026Updated 2 months ago
- An Android App recreating the Simon Says game. Uses MediaPipe to run an LLM on device☆22Sep 18, 2025Updated 6 months ago
- Demonstration of running a native LLM on Android device.☆240Mar 30, 2026Updated last week
- ☆50Aug 6, 2024Updated last year
- a lightweight LLM model inference framework☆751Apr 7, 2024Updated 2 years ago
- React hook to run pyodide in a web worker☆12Jan 29, 2025Updated last year
- 🤗 Optimum ExecuTorch☆122Apr 2, 2026Updated last week