DakeQQ / Native-LLM-for-Android
Demonstration of running a native LLM on Android device.
☆129Updated last week
Alternatives and similar repositories for Native-LLM-for-Android:
Users that are interested in Native-LLM-for-Android are comparing it to the libraries listed below
- 基于MNN-llm的安卓手机部署大语言模型:Qwen1.5-0.5B-Chat☆76Updated last year
- 参考自mlc-llm,个人尝试在android手机上部署大模型并运行☆86Updated 8 months ago
- llm-export can export llm model to onnx.☆282Updated 3 months ago
- 使用Android手机的CPU推理stable diffusion☆151Updated last year
- Inference RWKV v5, v6 and v7 with Qualcomm AI Engine Direct SDK☆62Updated last week
- 本项目是一个通过文字生成图片的项目,基于开源模型Stable Diffusion V1.5生成可以在手机的CPU和NPU上运行的模型,包括其配套的模型运行框架。☆153Updated last year
- Run Chinese MobileBert model on SNPE.☆14Updated last year
- MiniCPM on Android platform.☆629Updated last month
- stable diffusion using mnn☆68Updated last year
- ☆29Updated last year
- The Qualcomm® AI Hub apps are a collection of state-of-the-art machine learning models optimized for performance (latency, memory etc.) a…☆180Updated 2 weeks ago
- A Toolkit to Help Optimize Onnx Model☆140Updated this week
- Port of Facebook's LLaMA model in C/C++☆92Updated this week
- simplify >2GB large onnx model☆55Updated 4 months ago
- ☆33Updated last year
- ☆124Updated last year
- Android app for running transformers locally using LLama.cpp & Whisper.cpp☆26Updated 10 months ago
- llm deploy project based onnx.☆36Updated 6 months ago
- Demonstration of combine YOLO and depth estimation on Android device.☆46Updated 3 weeks ago
- NVIDIA TensorRT Hackathon 2023复赛选题:通义千问Qwen-7B用TensorRT-LLM模型搭建及优化☆42Updated last year
- 使用Android cpu 运行 RWKV V4 ONNX☆70Updated last year
- run ChatGLM2-6B in BM1684X☆49Updated last year
- A converter for llama2.c legacy models to ncnn models.☆87Updated last year
- Large Language Model Onnx Inference Framework☆32Updated 3 months ago
- ☆32Updated 9 months ago
- export llama to onnx☆121Updated 3 months ago
- ☆84Updated 2 years ago
- DashInfer is a native LLM inference engine aiming to deliver industry-leading performance atop various hardware architectures, including …☆243Updated this week
- GLM Series Edge Models☆136Updated 2 months ago
- A Toolkit to Help Optimize Large Onnx Model☆153Updated 11 months ago