DataXujing / Qwen1.5-0.5b-chat-androidLinks
基于MNN-llm的安卓手机部署大语言模型:Qwen1.5-0.5B-Chat
☆82Updated last year
Alternatives and similar repositories for Qwen1.5-0.5b-chat-android
Users that are interested in Qwen1.5-0.5b-chat-android are comparing it to the libraries listed below
Sorting:
- Demonstration of running a native LLM on Android device.☆161Updated this week
- run ChatGLM2-6B in BM1684X☆49Updated last year
- llm-export can export llm model to onnx.☆301Updated 6 months ago
- NVIDIA TensorRT Hackathon 2023复赛选题:通义千问Qwen-7B用TensorRT-LLM模型搭建及优化☆42Updated last year
- ☆49Updated 9 months ago
- qwen2 and llama3 cpp implementation☆45Updated last year
- stable diffusion using mnn☆66Updated last year
- 参考自mlc-llm,个人尝试在android手机上部署大模型并运行☆87Updated last year
- 本项目是一个通过文字生成图片的项目,基于开源模型Stable Diffusion V1.5生成可以在手机的CPU和NPU上运行的模型,包括其配套的模型运行框架。☆209Updated last year
- simplify >2GB large onnx model☆61Updated 8 months ago
- 安卓手机部署DeepSeek-R1 蒸馏的1.5B模型☆22Updated 6 months ago
- GPT2⚡NCNN⚡中文对话⚡x86⚡Android☆80Updated 3 years ago
- ☆90Updated 2 years ago
- Large Language Model Onnx Inference Framework☆36Updated 6 months ago
- Inference RWKV v5, v6 and v7 with Qualcomm AI Engine Direct SDK☆77Updated 3 weeks ago
- llm deploy project based onnx.☆42Updated 9 months ago
- 用于学习GOT/Qwen/OnnxLLm☆53Updated 9 months ago
- ncnn android paddle ocr v5☆77Updated 2 months ago
- CLIP⚡NCNN⚡基于自然语言的图片搜索(Image Search)⚡以字搜图⚡x86⚡Android☆259Updated 2 years ago
- 使用OpenCV+onnxruntime部署中文clip做以文搜图,给出一句话来描述想要的图片,就能从图库中搜出来符合要求的图片。包含C++和Python两个版本的程序☆77Updated last year
- 研究GOT-OCR-项目落地加速,不限语言☆61Updated 9 months ago
- an example of segment-anything infer by ncnn☆123Updated 2 years ago
- Compare multiple optimization methods on triton to imporve model service performance☆52Updated last year
- ☆124Updated last year
- 大模型部署实战:TensorRT-LLM, Triton Inference Server, vLLM☆26Updated last year
- Android paddleocr demo infer by ncnn☆198Updated last year
- GLM Series Edge Models☆146Updated last month
- Run generative AI models in sophgo BM1684X/BM1688☆231Updated last week
- A Toolkit to Help Optimize Large Onnx Model☆157Updated last year
- ☢️ TensorRT 2023复赛——基于TensorRT-LLM的Llama模型推断加速优化☆50Updated last year