Demonstration of running a native LLM on Android device.
☆249May 14, 2026Updated last week
Alternatives and similar repositories for Native-LLM-for-Android
Users that are interested in Native-LLM-for-Android are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Utilizes ONNX Runtime for speech activity detection.☆44Dec 10, 2025Updated 5 months ago
- Demonstrate Yolov9 model with Qualcomm Hexagon NPU and DirectML☆12Nov 27, 2024Updated last year
- Utilizes ONNX Runtime for audio denoising.☆124Dec 27, 2025Updated 4 months ago
- Transcribe subtitles and translate them offline with ease.☆44Jan 10, 2026Updated 4 months ago
- 基于MNN-llm的安卓手机部署大语言模型:Qwen1.5-0.5B-Chat☆95Apr 8, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Utilizes ONNX Runtime for TTS model.☆62Updated this week
- Let's use Qualcomm NPU in Android☆20Feb 18, 2025Updated last year
- Run pytorch models on GPU Android with Vulkan backend☆10Aug 15, 2023Updated 2 years ago
- IRIS is an android app for interfacing with GGUF / llama.cpp models locally.☆278Feb 1, 2025Updated last year
- Android本地运行mnn-llm语言模型简单示例☆13Oct 2, 2025Updated 7 months ago
- llm deploy project based onnx.☆49Oct 9, 2024Updated last year
- llm-export can export llm model to onnx.☆352May 8, 2026Updated last week
- Export the STFT or ISTFT process in ONNX format.☆43Mar 16, 2026Updated 2 months ago
- Fast Multimodal LLM on Mobile Devices☆1,508Apr 30, 2026Updated 3 weeks ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- mnn asr demo.☆27Mar 24, 2025Updated last year
- Running the F5-TTS by ONNX Runtime standalone with GUI☆25Dec 10, 2024Updated last year
- Large Language Model Onnx Inference Framework☆35Nov 25, 2025Updated 5 months ago
- ffmpeg+cuvid+tensorrt+multicamera☆12Dec 31, 2024Updated last year
- 安卓手机部署DeepSeek-R1 蒸馏的1.5B模型☆24Feb 4, 2025Updated last year
- A tutorial for runing LLM in Andriod Termux with Vulkan GPU acceleration☆16Jan 27, 2025Updated last year
- Android app for the Hole in your Palm project, making LLMs accessible on-device!☆19May 3, 2024Updated 2 years ago
- Running the F5-TTS by ONNX Runtime☆197Apr 28, 2026Updated 3 weeks ago
- 大模型API性能指标比较 - 深入分析TTFT、TPS等关键指标☆20Sep 12, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆41Mar 18, 2026Updated 2 months ago
- ☆23Jan 3, 2024Updated 2 years ago
- llm deploy project based mnn. This project has merged into MNN.☆1,616Jan 20, 2025Updated last year
- PersonaTalk Hack☆15Jan 10, 2025Updated last year
- 参考自mlc-llm,个人尝试在android手机上部署大模型并运行☆90Aug 5, 2024Updated last year
- HunyuanDiT with TensorRT and libtorch☆18May 22, 2024Updated last year
- The repository supports TensorRT, QNN platform inference, 2D obstacle detection yolo series (yolov5, yolov8, yolo11, yolox), semantic seg…☆20May 6, 2025Updated last year
- ☆18Mar 28, 2024Updated 2 years ago
- segment-anything based mnn☆37Dec 13, 2023Updated 2 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Inference RWKV v5, v6 and v7 with Qualcomm AI Engine Direct SDK☆91Updated this week
- [EMNLP Findings 2024] MobileQuant: Mobile-friendly Quantization for On-device Language Models☆68Sep 22, 2024Updated last year
- A mobile Implementation of llama.cpp☆328Feb 1, 2024Updated 2 years ago
- stable diffusion using mnn☆68Sep 28, 2023Updated 2 years ago
- Support PyTorch model conversion with LiteRT.☆1,022Updated this week
- pre-training llama3 using chinese☆13May 1, 2024Updated 2 years ago
- ☆33Jul 23, 2024Updated last year