DataXujing / Qwen1.5-0.5b-chat-androidLinks
基于MNN-llm的安卓手机部署大语言模型:Qwen1.5-0.5B-Chat
☆88Updated last year
Alternatives and similar repositories for Qwen1.5-0.5b-chat-android
Users that are interested in Qwen1.5-0.5b-chat-android are comparing it to the libraries listed below
Sorting:
- Demonstration of running a native LLM on Android device.☆210Updated last week
- ncnn android paddle ocr v5☆131Updated 2 months ago
- qwen2 and llama3 cpp implementation☆49Updated last year
- stable diffusion using mnn☆67Updated 2 years ago
- run ChatGLM2-6B in BM1684X☆49Updated last year
- 本项目是一个通过文字生成图片的项目,基于开源模型Stable Diffusion V1.5生成可以在手机的CPU和NPU上运行的模型,包括其配套的模型运行框架。☆229Updated last year
- GPT2⚡NCNN⚡中文对话⚡x86⚡Android☆82Updated 3 years ago
- llm-export can export llm model to onnx.☆337Updated 2 months ago
- llm deploy project based onnx.☆48Updated last year
- NVIDIA TensorRT Hackathon 2023复赛选题:通义千问Qwen-7B用TensorRT-LLM模型搭建及优化☆43Updated 2 years ago
- 安卓手机部署DeepSeek-R1 蒸馏的1.5B模型☆22Updated 10 months ago
- 参考自mlc-llm,个人尝试在android手机上部署大模型并运行☆91Updated last year
- 使用OpenCV+onnxruntime部署中文clip做以文搜图,给出一句话来描述想要的图片,就能从图库中搜出来符合要求的图片。包含C++和Python两个版本的程序☆85Updated last year
- Large Language Model Onnx Inference Framework☆36Updated last month
- ppstructure deploy by ncnn☆34Updated last year
- ☆54Updated last year
- CLIP⚡NCNN⚡基于自然语言的图片搜索(Image Search)⚡以字搜图⚡x86⚡Android☆278Updated 2 years ago
- Android paddleocr demo infer by ncnn☆211Updated last year
- ☆90Updated 2 years ago
- Compare multiple optimization methods on triton to imporve model service performance☆52Updated last year
- Inference deployment of the llama3☆11Updated last year
- Inference RWKV v5, v6 and v7 with Qualcomm AI Engine Direct SDK☆89Updated 3 weeks ago
- an example of segment-anything infer by ncnn☆124Updated 2 years ago
- 基于rknn的官方Android项目rknn_yolov5_android_apk_demo进行修改,部署人脸检测模型retinaface和106人脸关键点检测模型,支持实时人脸检测。支持rk356x和rk3588设备npu推理。☆25Updated last year
- ☆43Updated last year
- 大模型API性能指标比较 - 深入分析TTFT、TPS等关键指标☆19Updated last year
- 用于学习GOT/Qwen/OnnxLLm☆53Updated last year
- A high performance, high expansion, easy to use framework for AI application. 为AI应用的开发者提供一套统一的高性能、易用的编程框架,快速基于AI全栈服务、开发跨端边云的AI行业应用,支持GPU,…☆160Updated last year
- simplify >2GB large onnx model☆70Updated last year
- ☆33Updated last year