基于MNN-llm的安卓手机部署大语言模型:Qwen1.5-0.5B-Chat
☆93Apr 8, 2024Updated 2 years ago
Alternatives and similar repositories for Qwen1.5-0.5b-chat-android
Users that are interested in Qwen1.5-0.5b-chat-android are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- mnn asr demo.☆26Mar 24, 2025Updated last year
- Recording models☆12Sep 19, 2023Updated 2 years ago
- Android本地运行mnn-llm语言模型简单示例☆13Oct 2, 2025Updated 6 months ago
- ☆33Jul 23, 2024Updated last year
- Demonstration of running a native LLM on Android device.☆243Updated this week
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- NVIDIA TensorRT Hackathon 2023复赛选题:通义千问Qwen-7B用TensorRT-LLM模型搭建及优化☆43Oct 20, 2023Updated 2 years ago
- llm deploy project based mnn. This project has merged into MNN.☆1,615Jan 20, 2025Updated last year
- Whisper in TensorRT-LLM☆17Sep 21, 2023Updated 2 years ago
- segment-anything based mnn☆37Dec 13, 2023Updated 2 years ago
- ☆16Nov 23, 2022Updated 3 years ago
- GPT2⚡NCNN⚡中文对话⚡x86⚡Android☆82Mar 25, 2022Updated 4 years ago
- qwen2 and llama3 cpp implementation☆50Jun 7, 2024Updated last year
- YOLOv12 TensorRT 端到端模型加速推理和INT8量化实现☆13Mar 5, 2025Updated last year
- ☆41Oct 8, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- stable diffusion using mnn☆67Sep 28, 2023Updated 2 years ago
- MobileSAM のエンコーダー/デコーダーをONNXに変換し、推論するサンプル☆12Apr 11, 2024Updated 2 years ago
- Let's use Qualcomm NPU in Android☆18Feb 18, 2025Updated last year
- ncnn HiFi-GAN☆29Sep 29, 2024Updated last year
- ☆34Jul 2, 2025Updated 9 months ago
- ChineseOcr Lite Mnn,超轻量级中文OCR PC Demo,使用MNN推理☆27Mar 26, 2021Updated 5 years ago
- ☆10Jul 18, 2024Updated last year
- llm-export can export llm model to onnx.☆348Oct 24, 2025Updated 5 months ago
- transformer tokenizers (e.g. BERT tokenizer) in C++ (WIP)☆18Apr 7, 2022Updated 4 years ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- HunyuanDiT with TensorRT and libtorch☆18May 22, 2024Updated last year
- ONNX-compatible DocShadow: High-Resolution Document Shadow Removal. Supports TensorRT 🚀☆25Sep 13, 2023Updated 2 years ago
- UMatcher: A modern template matching model☆79May 31, 2025Updated 10 months ago
- An easy way to run, test, benchmark and tune OpenCL kernel files☆24Aug 25, 2023Updated 2 years ago
- High Performan Ai Model Web Server. Mainly support computer vision model. Quickly establish your own ai-model server. https://github.com/…☆45May 13, 2025Updated 11 months ago
- paper-read-notes☆13Sep 26, 2024Updated last year
- 使用yolov5和ncnn,将其部署到安卓移动端,我提供了精心挑选的相互匹配的版本,不会报错,为您节省了许多时间☆13Jul 22, 2022Updated 3 years ago
- 使用ONNXRuntime部署DeDoDe:"局部特征匹配:检测,不要描述——描述,不要检测"。依然是C++和Python两个版本的程序☆23Dec 22, 2023Updated 2 years ago
- snpe tutorial☆10Dec 25, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Inference RWKV v5, v6 and v7 with Qualcomm AI Engine Direct SDK☆91Feb 14, 2026Updated 2 months ago
- Implemented communication across languages. (From C++ to Java). Largely based on https://github.com/nihui/ncnn-android-nanodet/☆11Oct 1, 2022Updated 3 years ago
- QT+NCNN 小米手机运行YOLOv8s☆154Feb 3, 2023Updated 3 years ago
- Large Language Model Onnx Inference Framework☆35Nov 25, 2025Updated 4 months ago
- vits Android部署☆350Mar 31, 2024Updated 2 years ago
- Dockerfiles for poetry/mlc-llm(rk3588)/...☆10Sep 13, 2023Updated 2 years ago
- 天池 NVIDIA TensorRT Hackathon 2023 —— 生成式AI模型优化赛 初赛第三名方案☆50Aug 16, 2023Updated 2 years ago