Demonstration of running a native LLM on Android device.
☆257Jun 25, 2026Updated this week
Alternatives and similar repositories for Native-LLM-for-Android
Users that are interested in Native-LLM-for-Android are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Demonstration of combine YOLO and depth estimation on Android device.☆71Nov 15, 2025Updated 7 months ago
- Utilizes ONNX Runtime for audio denoising.☆132Jun 6, 2026Updated 3 weeks ago
- Transcribe subtitles and translate them offline with ease.☆45Jun 15, 2026Updated 2 weeks ago
- 基于MNN-llm的安卓手机部署大语言模型:Qwen1.5-0.5B-Chat☆97Apr 8, 2024Updated 2 years ago
- ☆18Dec 7, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Utilizes ONNX Runtime for TTS model.☆64Updated this week
- Let's use Qualcomm NPU in Android☆20Feb 18, 2025Updated last year
- Run pytorch models on GPU Android with Vulkan backend☆10Aug 15, 2023Updated 2 years ago
- IRIS is an android app for interfacing with GGUF / llama.cpp models locally.☆282Feb 1, 2025Updated last year
- llm deploy project based onnx.☆49Oct 9, 2024Updated last year
- llm-export can export llm model to onnx.☆352May 8, 2026Updated last month
- A basic startup guide on running LLMs on android locally or using an external ollama server☆38Jan 31, 2024Updated 2 years ago
- Fast Multimodal LLM on Mobile Devices☆1,552Jun 9, 2026Updated 3 weeks ago
- Running the F5-TTS by ONNX Runtime standalone with GUI☆26Dec 10, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Large Language Model Onnx Inference Framework☆35Nov 25, 2025Updated 7 months ago
- QAI AppBuilder is designed to help developers easily execute models on WoS and Linux platforms. It encapsulates the Qualcomm® AI Runtime …☆176Updated this week
- ffmpeg+cuvid+tensorrt+multicamera☆12Dec 31, 2024Updated last year
- Memory Cleaner, Phone Booster and Optimizer.☆10Nov 20, 2018Updated 7 years ago
- 安卓手机部署DeepSeek-R1 蒸馏的1.5B模型☆24Feb 4, 2025Updated last year
- Android app for the Hole in your Palm project, making LLMs accessible on-device!☆19May 3, 2024Updated 2 years ago
- Running the F5-TTS by ONNX Runtime☆202Jun 5, 2026Updated 3 weeks ago
- ☆23Jan 3, 2024Updated 2 years ago
- llm deploy project based mnn. This project has merged into MNN.☆1,617Jan 20, 2025Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- 参考自mlc-llm,个人尝试在android手机上部署大模型并运行☆90Aug 5, 2024Updated last year
- The repository supports TensorRT, QNN platform inference, 2D obstacle detection yolo series (yolov5, yolov8, yolo11, yolox), semantic seg…☆20May 6, 2025Updated last year
- ☆18Mar 28, 2024Updated 2 years ago
- Inference RWKV v5, v6 and v7 with Qualcomm AI Engine Direct SDK☆94Jun 8, 2026Updated 3 weeks ago
- [EMNLP Findings 2024] MobileQuant: Mobile-friendly Quantization for On-device Language Models☆68Sep 22, 2024Updated last year
- A mobile Implementation of llama.cpp☆327Feb 1, 2024Updated 2 years ago
- stable diffusion using mnn☆68Sep 28, 2023Updated 2 years ago
- Support PyTorch model conversion with LiteRT.☆1,048Updated this week
- pre-training llama3 using chinese☆13May 1, 2024Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆33Jul 23, 2024Updated last year
- Chinese and English Bilinguish G2P☆22Jul 16, 2023Updated 2 years ago
- Spliting the ASR probability distribution results into the chinese pinyin, so as to extract more effective feature for the chinese speech…☆21Mar 16, 2023Updated 3 years ago
- ☆28Jun 30, 2025Updated last year
- ☆17Oct 16, 2023Updated 2 years ago
- Real-time speech recognition and voice activity detection (VAD) using next-gen Kaldi with ncnn without Internet connection. Support iOS, …☆1,750Oct 20, 2025Updated 8 months ago
- Voice Craft is a desktop AI assistance tool designed to help people with disabilities operate a computer using their voice. This tool can…☆18May 23, 2023Updated 3 years ago