Awesome Mobile LLMs
☆361May 31, 2026Updated last month
Alternatives and similar repositories for awesome-mobile-llm
Users that are interested in awesome-mobile-llm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- codebase for "MELTing Point: Mobile Evaluation of Language Transformers"☆21Jul 19, 2024Updated last year
- ☆26Jun 2, 2026Updated 3 weeks ago
- Fast Multimodal LLM on Mobile Devices☆1,552Jun 9, 2026Updated 3 weeks ago
- ☆43Mar 29, 2025Updated last year
- Android app for the Hole in your Palm project, making LLMs accessible on-device!☆19May 3, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆28May 23, 2026Updated last month
- TinyChatEngine: On-Device LLM Inference Library☆956Jul 4, 2024Updated last year
- PyTorch compilation tutorial covering TorchScript, torch.fx, and Slapo☆17Mar 13, 2023Updated 3 years ago
- Low-bit LLM inference on CPU/NPU with lookup table☆966Jun 5, 2025Updated last year
- Code Repository for Blog - How to Productionize Large Language Models (LLMs)☆12Mar 27, 2024Updated 2 years ago
- [WACV 2024] Meta-Learned Kernel For Blind Super-Resolution Kernel Estimation☆14Jul 11, 2024Updated last year
- "Efficient Federated Learning for Modern NLP", to appear at MobiCom 2023.☆34Aug 18, 2023Updated 2 years ago
- ☆11May 25, 2026Updated last month
- List Flower resources☆12Feb 4, 2022Updated 4 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- 🚗🗣️📡🗾🏁 A framework for navigation tasks that can build the 3D scene graph in real-time and utilize large language model (LLM) to gui…☆28Oct 14, 2024Updated last year
- Typescript parser combinator library☆15Jan 9, 2026Updated 5 months ago
- Code for paper "ElasticTrainer: Speeding Up On-Device Training with Runtime Elastic Tensor Selection" (MobiSys'23)☆14Nov 1, 2023Updated 2 years ago
- Efficient SDE samplers including Gaussian-based probabilistic solvers. Written in JAX.☆10Feb 8, 2025Updated last year
- the original reference implementation of a specified llama.cpp backend for Qualcomm Hexagon NPU on Android phone, history of ggml-hexagon…☆45Updated this week
- Work in progress LLM framework.☆16Oct 31, 2024Updated last year
- Demonstration of running a native LLM on Android device.☆257Updated this week
- Repository for the AWARE smartphone sensing platform.☆19Nov 25, 2025Updated 7 months ago
- To deploy Transformer models in CV to mobile devices.☆18Jan 20, 2022Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆109Oct 2, 2024Updated last year
- MobileLLM Optimizing Sub-billion Parameter Language Models for On-Device Use Cases. In ICML 2024.☆1,450Apr 30, 2026Updated 2 months ago
- [TMLR 2024] Efficient Large Language Models: A Survey☆1,258Jun 23, 2025Updated last year
- A curated list for Efficient Large Language Models☆2,019Jun 17, 2025Updated last year
- ☆11Feb 5, 2026Updated 4 months ago
- ☆14Sep 27, 2021Updated 4 years ago
- ☆17Oct 19, 2023Updated 2 years ago
- LLM.swift is a simple and readable library that allows you to interact with large language models locally with ease for macOS, iOS, watch…☆865Dec 6, 2025Updated 6 months ago
- ☆35Feb 10, 2025Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- llama and other large language models on iOS and MacOS offline using GGML library.☆2,043Jan 30, 2026Updated 5 months ago
- ☆103Jan 17, 2024Updated 2 years ago
- A mobile Implementation of llama.cpp☆327Feb 1, 2024Updated 2 years ago
- Native C/C++ core for ToolNeuron — JNI + llama.cpp bindings for fast, private, on‑device LLM inference on Android.☆30Dec 14, 2025Updated 6 months ago
- The open-source project for "Mandheling: Mixed-Precision On-Device DNN Training with DSP Offloading"[MobiCom'2022]☆19Aug 4, 2022Updated 3 years ago
- On-device AI across mobile, embedded and edge for PyTorch☆4,766Updated this week
- Federated Learning - PyTorch☆15Jun 27, 2021Updated 5 years ago