tc-mb / llama.cppLinks
Port of Facebook's LLaMA model in C/C++
☆108Updated last week
Alternatives and similar repositories for llama.cpp
Users that are interested in llama.cpp are comparing it to the libraries listed below
Sorting:
- Port of Facebook's LLaMA model in C/C++☆67Updated 9 months ago
- GLM Series Edge Models☆158Updated 7 months ago
- 研究GOT-OCR-项目落地加速,不限语言☆62Updated last year
- ☆242Updated 11 months ago
- Cook up amazing multimodal AI applications effortlessly with MiniCPM-o☆290Updated this week
- Demonstration of running a native LLM on Android device.☆226Updated this week
- 基于MNN-llm的安卓手机部署大语言模型:Qwen1.5-0.5B-Chat☆89Updated last year
- ☆341Updated 3 months ago
- Explore LLM model deployment based on AXera's AI chips☆139Updated this week
- ☆31Updated last year
- Get up and running with Llama 3, Mistral, Gemma, and other large language models.☆32Updated last week
- MiniCPM on Android platform.☆636Updated 10 months ago
- stable diffusion using mnn☆67Updated 2 years ago
- qwen2 and llama3 cpp implementation☆49Updated last year
- ☆55Updated last year
- Run generative AI models in sophgo BM1684X/BM1688☆266Updated 2 weeks ago
- llm-export can export llm model to onnx.☆343Updated 3 months ago
- This is a user guide for the MiniCPM and MiniCPM-V series of small language models (SLMs) developed by ModelBest. “面壁小钢炮” focuses on achi…☆299Updated 7 months ago
- A Toolkit for Running On-device Large Language Models (LLMs) in APP☆81Updated last year
- Transformer framework for edge computing based on C++.☆130Updated last year
- C++ implementation of Qwen-LM☆616Updated last year
- Inference RWKV v5, v6 and v7 with Qualcomm AI Engine Direct SDK☆90Updated this week
- xllamacpp - a Python wrapper of llama.cpp☆73Updated this week
- 本项目是一个通过文字生成图片的项目,基于开源模型Stable Diffusion V1.5生成可以在手机的CPU和NPU上运行的模型,包括其配套的模型运行框架。☆231Updated last year
- run ChatGLM2-6B in BM1684X☆49Updated last year
- llm deploy project based onnx.☆49Updated last year
- run chatglm3-6b in BM1684X☆39Updated last year
- ☆125Updated 2 years ago
- Inference RWKV with multiple supported backends.☆77Updated this week
- ☆379Updated last year