Awesome Mobile LLMs
☆332Apr 6, 2026Updated 3 weeks ago
Alternatives and similar repositories for awesome-mobile-llm
Users that are interested in awesome-mobile-llm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- codebase for "MELTing Point: Mobile Evaluation of Language Transformers"☆19Jul 19, 2024Updated last year
- ☆26Nov 10, 2025Updated 5 months ago
- Fast Multimodal LLM on Mobile Devices☆1,484Updated this week
- ☆27Oct 25, 2023Updated 2 years ago
- Android app for the Hole in your Palm project, making LLMs accessible on-device!☆19May 3, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- TinyChatEngine: On-Device LLM Inference Library☆953Jul 4, 2024Updated last year
- PyTorch compilation tutorial covering TorchScript, torch.fx, and Slapo☆17Mar 13, 2023Updated 3 years ago
- Low-bit LLM inference on CPU/NPU with lookup table☆953Jun 5, 2025Updated 10 months ago
- Code Repository for Blog - How to Productionize Large Language Models (LLMs)☆12Mar 27, 2024Updated 2 years ago
- "Efficient Federated Learning for Modern NLP", to appear at MobiCom 2023.☆33Aug 18, 2023Updated 2 years ago
- List Flower resources☆12Feb 4, 2022Updated 4 years ago
- (WACV'24) Kaizen: Practical self-supervised continual learning with continual fine-tuning☆16Oct 29, 2024Updated last year
- Typescript parser combinator library☆15Jan 9, 2026Updated 3 months ago
- Code for paper "ElasticTrainer: Speeding Up On-Device Training with Runtime Elastic Tensor Selection" (MobiSys'23)☆14Nov 1, 2023Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆13May 11, 2023Updated 2 years ago
- Repository for the AWARE smartphone sensing platform.☆17Nov 25, 2025Updated 5 months ago
- [NAACL 2025🔥] MEDA: Dynamic KV Cache Allocation for Efficient Multimodal Long-Context Inference☆20Jun 19, 2025Updated 10 months ago
- To deploy Transformer models in CV to mobile devices.☆18Jan 20, 2022Updated 4 years ago
- ☆108Oct 2, 2024Updated last year
- MobileLLM Optimizing Sub-billion Parameter Language Models for On-Device Use Cases. In ICML 2024.☆1,428Apr 21, 2025Updated last year
- Flutter / Dart bindings for llama.cpp☆20Sep 30, 2023Updated 2 years ago
- [TMLR 2024] Efficient Large Language Models: A Survey☆1,259Jun 23, 2025Updated 10 months ago
- A curated list for Efficient Large Language Models☆1,993Jun 17, 2025Updated 10 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆11Feb 5, 2026Updated 2 months ago
- Strong and Open Vision Language Assistant for Mobile Devices☆1,350Apr 15, 2024Updated 2 years ago
- ☆14Sep 27, 2021Updated 4 years ago
- ☆17Oct 19, 2023Updated 2 years ago
- ☆35Feb 10, 2025Updated last year
- Swift package for reading and writing Safetensors files.☆13Feb 6, 2026Updated 2 months ago
- llama and other large language models on iOS and MacOS offline using GGML library.☆2,021Jan 30, 2026Updated 3 months ago
- ☆102Jan 17, 2024Updated 2 years ago
- A mobile Implementation of llama.cpp☆327Feb 1, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Native C/C++ core for ToolNeuron — JNI + llama.cpp bindings for fast, private, on‑device LLM inference on Android.☆31Dec 14, 2025Updated 4 months ago
- Official Implementation of SAM-Decoding: Speculative Decoding via Suffix Automaton☆47Feb 13, 2025Updated last year
- An Android App recreating the Simon Says game. Uses MediaPipe to run an LLM on device☆22Sep 18, 2025Updated 7 months ago
- Demonstration of running a native LLM on Android device.☆246Apr 12, 2026Updated 2 weeks ago
- ☆50Aug 6, 2024Updated last year
- The Qualcomm® AI Hub apps are a collection of state-of-the-art machine learning models optimized for performance (latency, memory etc.) a…☆404Updated this week
- LiteRT, successor to TensorFlow Lite. is Google's On-device framework for high-performance ML & GenAI deployment on edge platforms, via e…☆2,301Updated this week