QwenLM / qwen.cpp
C++ implementation of Qwen-LM
☆550Updated 10 months ago
Related projects ⓘ
Alternatives and complementary repositories for qwen.cpp
- 中文Mixtral混合专家大模型(Chinese Mixtral MoE LLMs)☆581Updated 6 months ago
- The official repo of Aquila2 series proposed by BAAI, including pretrained & chat large language models.☆436Updated 3 weeks ago
- a lightweight LLM model inference framework☆699Updated 7 months ago
- 中文Mixtral-8x7B(Chinese-Mixtral-8x7B)☆641Updated 2 months ago
- Efficient AI Inference & Serving☆456Updated 10 months ago
- DashInfer is a native LLM inference engine aiming to deliver industry-leading performance atop various hardware architectures, including …☆135Updated 2 months ago
- Phi2-Chinese-0.2B 从0开始训练自己的Phi2中文小模型,支持接入langchain加载本地知识库做检索增强生成RAG。Training your own Phi2 small chat model from scratch.☆482Updated 3 months ago
- A streamlined and customizable framework for efficient large model evaluation and performance benchmarking☆231Updated this week
- CMMLU: Measuring massive multitask language understanding in Chinese☆696Updated this week
- A high-throughput and memory-efficient inference and serving engine for LLMs☆121Updated 10 months ago
- Yuan 2.0 Large Language Model☆681Updated 3 months ago
- LLM Inference benchmark☆349Updated 3 months ago
- XVERSE-13B: A multilingual large language model developed by XVERSE Technology Inc.☆650Updated 7 months ago
- The official codes for "Aurora: Activating chinese chat capability for Mixtral-8x7B sparse Mixture-of-Experts through Instruction-Tuning"☆257Updated 6 months ago
- RTP-LLM: Alibaba's high-performance LLM inference engine for diverse applications.☆541Updated 3 weeks ago
- GAOKAO-Bench is an evaluation framework that utilizes GAOKAO questions as a dataset to evaluate large language models.☆546Updated 7 months ago
- 支持中文场景的的小语言模型 llama2.c-zh☆143Updated 8 months ago
- Mixture-of-Experts (MoE) Language Model☆180Updated 2 months ago
- DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models☆1,000Updated 9 months ago
- BiLLa: A Bilingual LLaMA with Enhanced Reasoning Ability☆421Updated last year
- 通义千问VLLM推理部署DEMO☆437Updated 7 months ago
- 使用peft库,对chatGLM-6B/chatGLM2-6B实现4bit的QLoRA高效微调,并做lora model和base model的merge及4bit的量化(quantize)。☆354Updated last year
- Official github repo for C-Eval, a Chinese evaluation suite for foundation models [NeurIPS 2023]☆1,633Updated last year
- Firefly中文LLaMA-2大模型,支持增量预训练Baichuan2、Llama2、Llama、Falcon、Qwen、Baichuan、InternLM、Bloom等大模型☆396Updated last year
- unified embedding model☆828Updated last year
- Alpaca Chinese Dataset -- 中文指令微调数据集【人工+GPT4o持续更新】☆184Updated last month
- 🩹Editing large language models within 10 seconds⚡☆1,281Updated last year
- 中文书籍收录整理, Collection of Chinese Books☆173Updated 10 months ago
- A toolkit for inference and evaluation of 'mixtral-8x7b-32kseqlen' from Mistral AI☆764Updated 10 months ago
- 360zhinao☆280Updated last month