Demonstration of running a native LLM on Android device.
☆226Updated this week
Alternatives and similar repositories for Native-LLM-for-Android
Users that are interested in Native-LLM-for-Android are comparing it to the libraries listed below
Sorting:
- Demonstration of combine YOLO and depth estimation on Android device.☆67Nov 15, 2025Updated 3 months ago
- 基于MNN-llm的安卓手机部署大语言模型:Qwen1.5-0.5B-Chat☆90Apr 8, 2024Updated last year
- Utilizes ONNX Runtime for audio denoising.☆115Dec 27, 2025Updated 2 months ago
- Utilizes ONNX Runtime for speech activity detection.☆42Dec 10, 2025Updated 2 months ago
- ☆17Dec 7, 2023Updated 2 years ago
- Utilizes ONNX Runtime to transcribe audio into text.☆81Updated this week
- Android本地运行mnn-llm语言模型简单示例☆13Oct 2, 2025Updated 4 months ago
- llm deploy project based onnx.☆50Oct 9, 2024Updated last year
- llm-export can export llm model to onnx.☆344Oct 24, 2025Updated 4 months ago
- Run pytorch models on GPU Android with Vulkan backend☆10Aug 15, 2023Updated 2 years ago
- Demonstrate Yolov9 model with Qualcomm Hexagon NPU and DirectML☆12Nov 27, 2024Updated last year
- Let's use Qualcomm NPU in Android☆18Feb 18, 2025Updated last year
- Fast Multimodal LLM on Mobile Devices☆1,401Feb 20, 2026Updated last week
- Large Language Model Onnx Inference Framework☆35Nov 25, 2025Updated 3 months ago
- 大模型API性能指标比较 - 深入分析TTFT、TPS等关键指标☆20Sep 12, 2024Updated last year
- PersonaTalk Hack☆15Jan 10, 2025Updated last year
- Transcribe subtitles and translate them offline with ease.☆40Jan 10, 2026Updated last month
- 基于MuseTalk的数字人代码。☆35Sep 14, 2024Updated last year
- Running the F5-TTS by ONNX Runtime standalone with GUI☆24Dec 10, 2024Updated last year
- Chinese and English Bilinguish G2P☆22Jul 16, 2023Updated 2 years ago
- Inference RWKV v5, v6 and v7 with Qualcomm AI Engine Direct SDK☆90Feb 14, 2026Updated 2 weeks ago
- 安卓手机部署DeepSeek-R1 蒸馏的1.5B模型☆23Feb 4, 2025Updated last year
- mnn asr demo.☆25Mar 24, 2025Updated 11 months ago
- ☆23Jan 3, 2024Updated 2 years ago
- Audio driven video synthesis☆40Aug 11, 2022Updated 3 years ago
- A mobile Implementation of llama.cpp☆327Feb 1, 2024Updated 2 years ago
- Examples of AI model running on the board, such as horizon/rockchip and so on.☆21Jul 10, 2023Updated 2 years ago
- Running the F5-TTS by ONNX Runtime☆191Jan 7, 2026Updated last month
- [EMNLP Findings 2024] MobileQuant: Mobile-friendly Quantization for On-device Language Models☆67Sep 22, 2024Updated last year
- ☆23Jan 5, 2026Updated last month
- ☆28Jun 30, 2025Updated 8 months ago
- An onnx-based quantitation tool.☆71Jan 8, 2024Updated 2 years ago
- qwen2 and llama3 cpp implementation☆49Jun 7, 2024Updated last year
- OpenAI compatible API for open source LLMs☆16Oct 30, 2023Updated 2 years ago
- EdgeInfer enables efficient edge intelligence by running small AI models, including embeddings and OnnxModels, on resource-constrained de…☆50Apr 17, 2024Updated last year
- QAI AppBuilder is designed to help developers easily execute models on WoS and Linux platforms. It encapsulates the Qualcomm® AI Runtime …☆125Updated this week
- 用于学习GOT/Qwen/OnnxLLm☆53Oct 8, 2024Updated last year
- stable diffusion using mnn☆67Sep 28, 2023Updated 2 years ago
- SAM and lama inpaint,包含QT的GUI交互界面,实现了交互式可实时显示结果的画点、画框进行SAM,然后通过进行Inpaint,具体操作看readme里的视频。☆52Jan 30, 2024Updated 2 years ago