Awesome Mobile LLMs
☆343May 1, 2026Updated 2 weeks ago
Alternatives and similar repositories for awesome-mobile-llm
Users that are interested in awesome-mobile-llm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- codebase for "MELTing Point: Mobile Evaluation of Language Transformers"☆19Jul 19, 2024Updated last year
- ☆26Nov 10, 2025Updated 6 months ago
- Fast Multimodal LLM on Mobile Devices☆1,508Apr 30, 2026Updated 3 weeks ago
- ☆43Mar 29, 2025Updated last year
- Android app for the Hole in your Palm project, making LLMs accessible on-device!☆19May 3, 2024Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆26May 2, 2026Updated 2 weeks ago
- TinyChatEngine: On-Device LLM Inference Library☆952Jul 4, 2024Updated last year
- Low-bit LLM inference on CPU/NPU with lookup table☆955Jun 5, 2025Updated 11 months ago
- Code Repository for Blog - How to Productionize Large Language Models (LLMs)☆12Mar 27, 2024Updated 2 years ago
- ☆11Mar 31, 2026Updated last month
- (WACV'24) Kaizen: Practical self-supervised continual learning with continual fine-tuning☆16Oct 29, 2024Updated last year
- 🚗🗣️📡🗾🏁 A framework for navigation tasks that can build the 3D scene graph in real-time and utilize large language model (LLM) to gui…☆26Oct 14, 2024Updated last year
- Typescript parser combinator library☆15Jan 9, 2026Updated 4 months ago
- Efficient SDE samplers including Gaussian-based probabilistic solvers. Written in JAX.☆10Feb 8, 2025Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Work in progress LLM framework.☆15Oct 31, 2024Updated last year
- To deploy Transformer models in CV to mobile devices.☆18Jan 20, 2022Updated 4 years ago
- ☆109Oct 2, 2024Updated last year
- On-device LLM Inference Powered by X-Bit Quantization☆312Updated this week
- MobileLLM Optimizing Sub-billion Parameter Language Models for On-Device Use Cases. In ICML 2024.☆1,438Apr 30, 2026Updated 3 weeks ago
- Flutter / Dart bindings for llama.cpp☆20Sep 30, 2023Updated 2 years ago
- [TMLR 2024] Efficient Large Language Models: A Survey☆1,258Jun 23, 2025Updated 10 months ago
- ☆11Feb 5, 2026Updated 3 months ago
- Strong and Open Vision Language Assistant for Mobile Devices☆1,353Apr 15, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆14Sep 27, 2021Updated 4 years ago
- ☆17Oct 19, 2023Updated 2 years ago
- ☆35Feb 10, 2025Updated last year
- llama and other large language models on iOS and MacOS offline using GGML library.☆2,026Jan 30, 2026Updated 3 months ago
- ☆102Jan 17, 2024Updated 2 years ago
- Federated Learning - PyTorch☆15Jun 27, 2021Updated 4 years ago
- Demonstration of running a native LLM on Android device.☆249Updated this week
- ☆50Aug 6, 2024Updated last year
- The Qualcomm® AI Hub apps are a collection of state-of-the-art machine learning models optimized for performance (latency, memory etc.) a…☆409Updated this week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- LiteRT, successor to TensorFlow Lite. is Google's On-device framework for high-performance ML & GenAI deployment on edge platforms, via e…☆2,392May 14, 2026Updated last week
- A command-line interface tool that converts natural language instructions into shell commands using OpenAI's GPT-4.☆21Mar 11, 2025Updated last year
- a lightweight LLM model inference framework☆752Apr 7, 2024Updated 2 years ago
- Artifacts for ATC '22 paper "Faster Software Packet Processing on FPGA NICs with eBPF Program Warping"☆17May 20, 2022Updated 4 years ago
- paper and its code for AI System☆362May 14, 2026Updated last week
- A lightweight server for LightGBM☆15Oct 16, 2020Updated 5 years ago
- A direct convolution library targeting ARM multi-core CPUs.☆12Nov 27, 2024Updated last year