haonan-li / CMMLULinks

CMMLU: Measuring massive multitask language understanding in Chinese

☆774

Alternatives and similar repositories for CMMLU

Users that are interested in CMMLU are comparing it to the libraries listed below

Sorting:

hkust-nlp / ceval
Official github repo for C-Eval, a Chinese evaluation suite for foundation models [NeurIPS 2023]
☆1,760Updated last week
OpenLMLab / GAOKAO-Bench
GAOKAO-Bench is an evaluation framework that utilizes GAOKAO questions as a dataset to evaluate large language models.
☆673Updated 6 months ago
THUDM / AlignBench
大模型多维度中文对齐评测基准 (ACL 2024)
☆401Updated 11 months ago
chaoswork / sft_datasets
开源SFT数据集整理,随时补充
☆529Updated 2 years ago
X-PLUG / CValues
面向中文大模型价值观的评估与对齐研究
☆530Updated 2 years ago
beyondguo / LLM-Tuning
Tuning LLMs with no tears💦; Sample Design Engineering (SDE) for more efficient downstream-tuning.
☆1,008Updated last year
hikariming / chat-dataset-baseline
人工精调的中文对话数据集和一段chatglm的微调代码
☆1,184Updated 3 months ago
yangjianxin1 / Firefly-LLaMA2-Chinese
Firefly中文LLaMA-2大模型，支持增量预训练Baichuan2、Llama2、Llama、Falcon、Qwen、Baichuan、InternLM、Bloom等大模型
☆410Updated last year
HIT-SCIR / huozi
活字通用大模型
☆393Updated 10 months ago
jianzhnie / LLamaTuner
Easy and Efficient Finetuning LLMs. (Supported LLama, LLama2, LLama3, Qwen, Baichuan, GLM , Falcon) 大模型高效量化训练+部署.
☆609Updated 6 months ago
flageval-baai / FlagEval
FlagEval is an evaluation toolkit for AI large foundation models.
☆339Updated 3 months ago
michael-wzhu / Chinese-LlaMA2
Repo for adapting Meta LlaMA2 in Chinese! META最新发布的LlaMA2的汉化版！（完全开源可商用）
☆742Updated last year
wangyuxinwhy / uniem
unified embedding model
☆864Updated last year
IEIT-Yuan / Yuan-2.0
Yuan 2.0 Large Language Model
☆689Updated last year
FlagOpen / FlagData
☆348Updated last year
SkyworkAI / Skywork
Skywork series models are pre-trained on 3.2TB of high-quality multilingual (mainly Chinese and English) and code data. We have open-sour…
☆1,409Updated 4 months ago
yanqiangmiffy / InstructGLM
ChatGLM-6B 指令学习|指令数据|Instruct
☆652Updated 2 years ago
zhenbench / z-bench
Z-Bench 1.0 by 真格基金：一个麻瓜的大语言模型中文测试集。Z-Bench is a LLM prompt dataset for non-technical users, developed by an enthusiastic AI-focused team…
☆496Updated 2 years ago
ymcui / Chinese-Mixtral
中文Mixtral混合专家大模型（Chinese Mixtral MoE LLMs）
☆604Updated last year
xverse-ai / XVERSE-13B
XVERSE-13B: A multilingual large language model developed by XVERSE Technology Inc.
☆645Updated last year
shuxueslpi / chatGLM-6B-QLoRA
使用peft库，对chatGLM-6B/chatGLM2-6B实现4bit的QLoRA高效微调，并做lora model和base model的merge及4bit的量化（quantize）。
☆361Updated last year
AndrewZhe / lawyer-llama
中文法律LLaMA (LLaMA for Chinese legel domain)
☆954Updated 11 months ago
zhihaiLLM / wisdomInterrogatory
☆520Updated last year
HIT-SCIR / Chinese-Mixtral-8x7B
中文Mixtral-8x7B（Chinese-Mixtral-8x7B）
☆650Updated 11 months ago
FlagAI-Open / Aquila2
The official repo of Aquila2 series proposed by BAAI, including pretrained & chat large language models.
☆444Updated 9 months ago
airaria / Visual-Chinese-LLaMA-Alpaca
多模态中文LLaMA&Alpaca大语言模型（VisualCLA）
☆451Updated 2 years ago
liangwq / Chatglm_lora_multi-gpu
chatglm多gpu用deepspeed和
☆407Updated last year
Tencent / TencentPretrain
Tencent Pre-training framework in PyTorch & Pre-trained Model Zoo
☆1,078Updated 11 months ago
charent / Phi2-mini-Chinese
Phi2-Chinese-0.2B 从0开始训练自己的Phi2中文小模型，支持接入langchain加载本地知识库做检索增强生成RAG。Training your own Phi2 small chat model from scratch.
☆559Updated last year
Duxiaoman-DI / XuanYuan
轩辕：度小满中文金融对话大模型
☆1,246Updated 6 months ago