DataXujing / Qwen1.5-0.5b-chat-android
基于MNN-llm的安卓手机部署大语言模型:Qwen1.5-0.5B-Chat
☆77Updated last year
Alternatives and similar repositories for Qwen1.5-0.5b-chat-android
Users that are interested in Qwen1.5-0.5b-chat-android are comparing it to the libraries listed below
Sorting:
- run ChatGLM2-6B in BM1684X☆49Updated last year
- Demonstration of running a native LLM on Android device.☆136Updated this week
- 参考自mlc-llm,个人尝试在android手机上部署大模型并运行☆86Updated 9 months ago
- stable diffusion using mnn☆68Updated last year
- NVIDIA TensorRT Hackathon 2023复赛选题:通义千问Qwen-7B用TensorRT-LLM模型搭建及优化☆42Updated last year
- llm-export can export llm model to onnx.☆289Updated 3 months ago
- GPT2⚡NCNN⚡中文对话⚡x86⚡Android☆79Updated 3 years ago
- 本项目是一个通过文字生成图片的项目,基于开源模型Stable Diffusion V1.5生成可以在手机的CPU和NPU上运行的模型,包括其配套的模型运行框架。☆188Updated last year
- qwen2 and llama3 cpp implementation☆44Updated 11 months ago
- ☆90Updated last year
- Compare multiple optimization methods on triton to imporve model service performance☆50Updated last year
- 安卓手机部署DeepSeek-R1 蒸馏的1.5B模型☆21Updated 3 months ago
- ☆124Updated last year
- ☆44Updated 6 months ago
- ☆27Updated 6 months ago
- Large Language Model Onnx Inference Framework☆33Updated 4 months ago
- Inference RWKV v5, v6 and v7 with Qualcomm AI Engine Direct SDK☆64Updated last week
- Android paddleocr demo infer by ncnn☆194Updated 9 months ago
- simplify >2GB large onnx model☆56Updated 5 months ago
- ncnn android yolov8 realtime detection, segmentation, pose estimation, classification and obb☆88Updated last week
- export llama to onnx☆124Updated 4 months ago
- ☆32Updated 9 months ago
- an example of segment-anything infer by ncnn☆121Updated 2 years ago
- llm deploy project based onnx.☆36Updated 7 months ago
- A Deeplearn Model to rec table in photo with ncnn. 一个深度学习模型用于检测图片中的表格 画像内のテーブルを検出するためのディープラーニング モデル☆18Updated 2 months ago
- ppstructure deploy by ncnn☆32Updated 10 months ago
- A converter for llama2.c legacy models to ncnn models.☆87Updated last year
- 使用Android cpu 运行 RWKV V4 ONNX☆70Updated last year
- ☢️ TensorRT 2023复赛——基于TensorRT-LLM的Llama模型推断加速优化☆47Updated last year
- 研究GOT-OCR-项目落地加速,不限语言☆60Updated 6 months ago