Awesome Mobile LLMs
☆313Mar 20, 2026Updated this week
Alternatives and similar repositories for awesome-mobile-llm
Users that are interested in awesome-mobile-llm are comparing it to the libraries listed below
Sorting:
- codebase for "MELTing Point: Mobile Evaluation of Language Transformers"☆18Jul 19, 2024Updated last year
- ☆27Oct 25, 2023Updated 2 years ago
- ☆43Mar 29, 2025Updated 11 months ago
- ☆13Feb 7, 2026Updated last month
- TinyChatEngine: On-Device LLM Inference Library☆945Jul 4, 2024Updated last year
- PyTorch compilation tutorial covering TorchScript, torch.fx, and Slapo☆17Mar 13, 2023Updated 3 years ago
- Low-bit LLM inference on CPU/NPU with lookup table☆932Jun 5, 2025Updated 9 months ago
- Code Repository for Blog - How to Productionize Large Language Models (LLMs)☆12Mar 27, 2024Updated last year
- [WACV 2024] Meta-Learned Kernel For Blind Super-Resolution Kernel Estimation☆14Jul 11, 2024Updated last year
- ☆10Jun 18, 2024Updated last year
- List Flower resources☆12Feb 4, 2022Updated 4 years ago
- (WACV'24) Kaizen: Practical self-supervised continual learning with continual fine-tuning☆16Oct 29, 2024Updated last year
- 🚗🗣️📡🗾🏁 A framework for navigation tasks that can build the 3D scene graph in real-time and utilize large language model (LLM) to gui…☆24Oct 14, 2024Updated last year
- Typescript parser combinator library☆15Jan 9, 2026Updated 2 months ago
- Code for paper "ElasticTrainer: Speeding Up On-Device Training with Runtime Elastic Tensor Selection" (MobiSys'23)☆14Nov 1, 2023Updated 2 years ago
- Android app for the Hole in your Palm project, making LLMs accessible on-device!☆18May 3, 2024Updated last year
- Work in progress LLM framework.☆15Oct 31, 2024Updated last year
- ☆13May 11, 2023Updated 2 years ago
- To deploy Transformer models in CV to mobile devices.☆18Jan 20, 2022Updated 4 years ago
- ☆104Oct 2, 2024Updated last year
- On-device LLM Inference Powered by X-Bit Quantization☆305Mar 2, 2026Updated 2 weeks ago
- MobileLLM Optimizing Sub-billion Parameter Language Models for On-Device Use Cases. In ICML 2024.☆1,417Apr 21, 2025Updated 10 months ago
- [TMLR 2024] Efficient Large Language Models: A Survey☆1,256Jun 23, 2025Updated 8 months ago
- A curated list for Efficient Large Language Models☆1,967Jun 17, 2025Updated 9 months ago
- ☆11Feb 5, 2026Updated last month
- Strong and Open Vision Language Assistant for Mobile Devices☆1,345Apr 15, 2024Updated last year
- Learn and build GPU RTL from scratch☆20Aug 1, 2025Updated 7 months ago
- ☆14Sep 27, 2021Updated 4 years ago
- ☆35Feb 10, 2025Updated last year
- ☆17Oct 19, 2023Updated 2 years ago
- Official Implementation of SAM-Decoding: Speculative Decoding via Suffix Automaton☆44Feb 13, 2025Updated last year
- The open-source project for "Mandheling: Mixed-Precision On-Device DNN Training with DSP Offloading"[MobiCom'2022]☆19Aug 4, 2022Updated 3 years ago
- ☆48Aug 6, 2024Updated last year
- Demonstration of running a native LLM on Android device.☆236Mar 14, 2026Updated last week
- The Qualcomm® AI Hub apps are a collection of state-of-the-art machine learning models optimized for performance (latency, memory etc.) a…☆387Mar 13, 2026Updated last week
- a lightweight LLM model inference framework☆747Apr 7, 2024Updated last year
- Artifacts for ATC '22 paper "Faster Software Packet Processing on FPGA NICs with eBPF Program Warping"☆17May 20, 2022Updated 3 years ago
- paper and its code for AI System☆355Feb 10, 2026Updated last month
- 🤗 Optimum ExecuTorch☆121Updated this week