chraac / llama-cpp-qnn-builderLinks
☆11Updated last month
Alternatives and similar repositories for llama-cpp-qnn-builder
Users that are interested in llama-cpp-qnn-builder are comparing it to the libraries listed below
Sorting:
- LLM inference in C/C++☆48Updated this week
- Inference RWKV v5, v6 and v7 with Qualcomm AI Engine Direct SDK☆88Updated this week
- 本项目是一个通过文字生成图片的项目,基于开源模型Stable Diffusion V1.5生 成可以在手机的CPU和NPU上运行的模型,包括其配套的模型运行框架。☆227Updated last year
- llm deploy project based onnx.☆47Updated last year
- llm-export can export llm model to onnx.☆333Updated last month
- Demonstration of running a native LLM on Android device.☆202Updated this week
- stable diffusion using mnn☆67Updated 2 years ago
- 基于MNN-llm的安卓手 机部署大语言模型:Qwen1.5-0.5B-Chat☆85Updated last year
- QAI AppBuilder is designed to help developers easily execute models on WoS and Linux platforms. It encapsulates the Qualcomm® AI Runtime …☆92Updated this week
- A Toolkit to Help Optimize Onnx Model☆267Updated this week
- ncnn android paddle ocr v5☆127Updated last month
- Run generative AI models in sophgo BM1684X/BM1688☆254Updated this week
- The Qualcomm® AI Hub apps are a collection of state-of-the-art machine learning models optimized for performance (latency, memory etc.) a…☆345Updated this week
- ☆1,096Updated 2 weeks ago
- Inference RWKV with multiple supported backends.☆70Updated this week
- Large Language Model Onnx Inference Framework☆36Updated last week
- A repo for llm on ncnn☆60Updated last week
- Yolov12 model supports android deployment.☆126Updated 5 months ago
- Port of Facebook's LLaMA model in C/C++☆103Updated last week
- A converter for llama2.c legacy models to ncnn models.☆80Updated last year
- My develoopment fork of llama.cpp. For now working on RK3588 NPU and Tenstorrent backend☆110Updated 3 weeks ago
- PyTorch Neural Network eXchange☆656Updated 2 weeks ago
- Run Chinese MobileBert model on SNPE.☆15Updated 2 years ago
- Port of Facebook's LLaMA model in C/C++☆64Updated 7 months ago
- Explore LLM model deployment based on AXera's AI chips☆130Updated last week
- 一个模块化,全过程可离线,低占用率的对话机器人/智能音箱☆121Updated 9 months ago
- A Toolkit to Help Optimize Large Onnx Model☆162Updated last month
- ☆84Updated 2 years ago
- Run Large Language Models on RK3588 with GPU-acceleration☆117Updated 2 years ago
- ☆125Updated last year