DataXujing / Qwen1.5-0.5b-chat-androidLinks

基于MNN-llm的安卓手机部署大语言模型：Qwen1.5-0.5B-Chat

☆82

Alternatives and similar repositories for Qwen1.5-0.5b-chat-android

Users that are interested in Qwen1.5-0.5b-chat-android are comparing it to the libraries listed below

Sorting:

DakeQQ / Native-LLM-for-Android
Demonstration of running a native LLM on Android device.
☆161Updated this week
sophgo / ChatGLM2-TPU
run ChatGLM2-6B in BM1684X
☆49Updated last year
wangzhaode / llm-export
llm-export can export llm model to onnx.
☆301Updated 6 months ago
FeiGeChuanShu / trt2023
NVIDIA TensorRT Hackathon 2023复赛选题：通义千问Qwen-7B用TensorRT-LLM模型搭建及优化
☆42Updated last year
Tlntin / qwen-ascend-llm
☆49Updated 9 months ago
yvonwin / qwen2.cpp
qwen2 and llama3 cpp implementation
☆45Updated last year
wangzhaode / mnn-stable-diffusion
stable diffusion using mnn
☆66Updated last year
TroyTzou / mlc-llm-android
参考自mlc-llm，个人尝试在android手机上部署大模型并运行
☆87Updated last year
XiaoMi / StableDiffusionOnDevice
本项目是一个通过文字生成图片的项目，基于开源模型Stable Diffusion V1.5生成可以在手机的CPU和NPU上运行的模型，包括其配套的模型运行框架。
☆209Updated last year
luchangli03 / onnxsim_large_model
simplify >2GB large onnx model
☆61Updated 8 months ago
DataXujing / DeepSeek-R1-Android
安卓手机部署DeepSeek-R1 蒸馏的1.5B模型
☆22Updated 6 months ago
EdVince / GPT2-ChineseChat-NCNN
GPT2⚡NCNN⚡中文对话⚡x86⚡Android
☆80Updated 3 years ago
Tlntin / ChatGLM2-6B-TensorRT
☆90Updated 2 years ago
inisis / OnnxLLM
Large Language Model Onnx Inference Framework
☆36Updated 6 months ago
MollySophia / rwkv-qualcomm
Inference RWKV v5, v6 and v7 with Qualcomm AI Engine Direct SDK
☆77Updated 3 weeks ago
wangzhaode / onnx-llm
llm deploy project based onnx.
☆42Updated 9 months ago
BaofengZan / GOT-OCRv2-onnx
用于学习GOT/Qwen/OnnxLLm
☆53Updated 9 months ago
nihui / ncnn-android-ppocrv5
ncnn android paddle ocr v5
☆77Updated 2 months ago
EdVince / CLIP-ImageSearch-NCNN
CLIP⚡NCNN⚡基于自然语言的图片搜索(Image Search)⚡以字搜图⚡x86⚡Android
☆259Updated 2 years ago
hpc203 / Chinese-CLIP-opencv-onnxrun
使用OpenCV+onnxruntime部署中文clip做以文搜图，给出一句话来描述想要的图片，就能从图库中搜出来符合要求的图片。包含C++和Python两个版本的程序
☆77Updated last year
1694439208 / GOT-OCR-Inference
研究GOT-OCR-项目落地加速，不限语言
☆61Updated 9 months ago
FeiGeChuanShu / segment-anything-ncnn
an example of segment-anything infer by ncnn
☆123Updated 2 years ago
bug-developer021 / YOLOV5_optimization_on_triton
Compare multiple optimization methods on triton to imporve model service performance
☆52Updated last year
daquexian / faster-rwkv
☆124Updated last year
DataXujing / TensorRT-LLM-ChatGLM3
大模型部署实战：TensorRT-LLM, Triton Inference Server, vLLM
☆26Updated last year
FeiGeChuanShu / ncnn_paddleocr
Android paddleocr demo infer by ncnn
☆198Updated last year
zai-org / GLM-Edge
GLM Series Edge Models
☆146Updated last month
sophgo / LLM-TPU
Run generative AI models in sophgo BM1684X/BM1688
☆231Updated last week
tsingmicro-toolchain / OnnxSlim
A Toolkit to Help Optimize Large Onnx Model
☆157Updated last year
TRT2022 / trtllm-llama
☢️ TensorRT 2023复赛——基于TensorRT-LLM的Llama模型推断加速优化
☆50Updated last year