QwenLM / qwen.cppLinks
C++ implementation of Qwen-LM
☆616Updated last year
Alternatives and similar repositories for qwen.cpp
Users that are interested in qwen.cpp are comparing it to the libraries listed below
Sorting:
- a lightweight LLM model inference framework☆749Updated last year
- 支持中文场景的的小语言模型 llama2.c-zh☆150Updated last year
- Yuan 2.0 Large Language Model☆690Updated last year
- Efficient AI Inference & Serving☆479Updated 2 years ago
- The official repo of Aquila2 series proposed by BAAI, including pretrained & chat large language models.☆446Updated last year
- 中文Mixtral混合专家大模型(Chinese Mixtral MoE LLMs)☆609Updated last year
- llm deploy project based mnn. This project has merged into MNN.☆1,615Updated last year
- XVERSE-13B: A multilingual large language model developed by XVERSE Technology Inc.☆645Updated last year
- ☆183Updated last week
- Phi2-Chinese-0.2B 从0开始训练自己的Phi2中文小模型,支持接入langchain加载本地知识库做检索增强生成RAG。Training your own Phi2 small chat model from scratch.☆584Updated last year
- DashInfer is a native LLM inference engine aiming to deliver industry-leading performance atop various hardware architectures, including …☆274Updated 6 months ago
- 中文Mixtral-8x7B(Chinese-Mixtral-8x7B)☆655Updated last year
- LLM Inference benchmark☆433Updated last year
- llm-export can export llm model to onnx.☆343Updated 3 months ago
- Open Multilingual Chatbot for Everyone☆1,273Updated 8 months ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆140Updated last year
- Orion-14B is a family of models includes a 14B foundation LLM, and a series of models: a chat model, a long context model, a quantized mo…☆810Updated last year
- This repo contains the data preparation, tokenization, training and inference code for BLOOMChat. BLOOMChat is a 176 billion parameter mu…☆584Updated 2 years ago
- Play LLaMA2 (official / 中文版 / INT4 / llama2.cpp) Together! ONLY 3 STEPS! ( non GPU / 5GB vRAM / 8~14GB vRAM)☆541Updated 2 years ago
- Implement OpenAI APIs and plugin-enabled ChatGPT with open source LLM and other models.☆121Updated last year
- OrionStar-Yi-34B-Chat 是一款开源中英文Chat模型,由猎户星空基于Yi-34B开源模型、使用15W+高质量语料微调而成。☆264Updated last year
- 360zhinao☆290Updated 8 months ago
- LLaMa/RWKV onnx models, quantization and testcase☆366Updated 2 years ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆132Updated last year
- A toolkit for inference and evaluation of 'mixtral-8x7b-32kseqlen' from Mistral AI☆773Updated 2 years ago
- ggml implementation of the baichuan13b model (adapted from llama.cpp)☆55Updated 2 years ago
- The official codes for "Aurora: Activating chinese chat capability for Mixtral-8x7B sparse Mixture-of-Experts through Instruction-Tuning"☆264Updated last year
- BiLLa: A Bilingual LLaMA with Enhanced Reasoning Ability☆416Updated 2 years ago
- CMMLU: Measuring massive multitask language understanding in Chinese☆801Updated last year
- Skywork series models are pre-trained on 3.2TB of high-quality multilingual (mainly Chinese and English) and code data. We have open-sour…☆1,474Updated 11 months ago