wangzhaode / mnn-llmLinks
llm deploy project based mnn. This project has merged into MNN.
☆1,604Updated 8 months ago
Alternatives and similar repositories for mnn-llm
Users that are interested in mnn-llm are comparing it to the libraries listed below
Sorting:
- a lightweight LLM model inference framework☆738Updated last year
- fastllm是后端无依赖的高性能大模型推理库。同时支持张量并行推理稠密模型和混合模式推理MOE模型,任意10G以上显卡即可推理满血DeepSeek。双路9004/9005服务器+单显卡部署DeepSeek满血满精度原版模型,单并发20tps;INT4量化模型单并发30tp…☆3,982Updated last week
- C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4(V)☆2,975Updated last year
- TigerBot: A multi-language multi-task LLM☆2,257Updated 9 months ago
- C++ implementation of Qwen-LM☆607Updated 10 months ago
- 计图大模型推理库,具有高性能、配置要求低、中文支持好、可移植等特点☆2,429Updated 7 months ago
- A 13B large language model developed by Baichuan Intelligent Technology☆2,963Updated 2 years ago
- Chinese-LLaMA 1&2、Chinese-Falcon 基础模型;ChatFlow中文对话模型;中文OpenLLaMA模型;NLP预训练/指令微调数据集