zhenbench / z-bench
Z-Bench 1.0 by 真格基金:一个麻瓜的大语言模型中文测试集。Z-Bench is a LLM prompt dataset for non-technical users, developed by an enthusiastic AI-focused team in Zhenfund.
☆487Updated last year
Alternatives and similar repositories for z-bench:
Users that are interested in z-bench are comparing it to the libraries listed below
- ChatGLM-6B 指令学习|指令数据|Instruct☆653Updated last year
- CMMLU: Measuring massive multitask language understanding in Chinese☆717Updated last month
- ☆216Updated last year
- BiLLa: A Bilingual LLaMA with Enhanced Reasoning Ability☆421Updated last year
- Official github repo for C-Eval, a Chinese evaluation suite for foundation models [NeurIPS 2023]☆1,670Updated last year
- alpaca中文指令微调数据集☆392Updated last year
- unified embedding model☆846Updated last year
- 人工精调的中文对话数据集和一段chatglm的微调代码☆1,166Updated 8 months ago
- Provide OpenAI style API for ChatGLM-6B and Chinese Embeddings Model☆516Updated last year
- GAOKAO-Bench is an evaluation framework that utilizes GAOKAO questions as a dataset to evaluate large language models.☆584Updated 3 weeks ago
- 大模型多维度中文对齐评测基准 (ACL 2024)☆355Updated 5 months ago
- 语言模型中文认知能力分析☆236Updated last year
- 面向中文大模型价值观的评估与对齐研究☆490Updated last year
- FlagEval is an evaluation toolkit for AI large foundation models.☆316Updated 6 months ago
- Luotuo Embedding(骆驼嵌入) is a text embedding model, which developed by 李鲁鲁, 冷子昂, 陈启源, 蒟蒻等.☆261Updated last year
- pCLUE: 1000000+多任务提示学习数据集☆476Updated 2 years ago
- 中文法律LLaMA (LLaMA for Chinese legel domain)☆885Updated 5 months ago
- ChatGLM2-6B 全参数微调,支持多轮对话的高效微调。☆398Updated last year
- 中文Mixtral混合专家大模型(Chinese Mixtral MoE LLMs)☆593Updated 8 months ago
- 中文Mixtral-8x7B(Chinese-Mixtral-8x7B)☆644Updated 5 months ago
- Chinese large language model base generated through incremental pre-training on Chinese datasets☆234Updated last year
- Panda项目是于2023年5月启动的开源海外中文大语言模型项目,致力于大模型时代探索整个技术栈,旨在推动中文自然语言处理领域的创新和合作。☆1,034Updated last year
- chatglm多gpu用deepspeed和☆403Updated 6 months ago
- Firefly中文LLaMA-2大模型,支持增量预训练Baichuan2、Llama2、Llama、Falcon、Qwen、Baichuan、InternLM、Bloom等大模型☆403Updated last year
- 📝 An Awesome Collection of Chinese Legal Dataset and Relevant Resources. 致力于收集全面的中文法律数据源☆810Updated last year
- 聚宝盆(Cornucopia): 中文金融系列开源可商用大模型,并提供一套高效轻量化的垂直领域LLM训练框架(Pretraining、SFT、RLHF、Quantize等)☆611Updated last year
- Llama2开源模型中文版-全方位测评,基于SuperCLUE的OPEN基准 | Llama2 Chinese evaluation with SuperCLUE☆127Updated last year
- ☆180Updated last year
- chatglm 6b finetuning and alpaca finetuning☆1,541Updated 9 months ago
- 探索中文instruct数据在ChatGLM, LLaMA上的微调表现☆390Updated last year