open-compass / GAOKAO-Eval
☆99Updated last month
Alternatives and similar repositories for GAOKAO-Eval:
Users that are interested in GAOKAO-Eval are comparing it to the libraries listed below
- ☆80Updated last year
- LingoWhale-8B: Open Bilingual LLMs | 开源双语预训练大模型☆135Updated last year
- ☆216Updated last year
- SuperCLUE琅琊榜:中文通用大模型匿名对战评价基准☆143Updated 6 months ago
- AM (Advanced Mathematics) Chat is a large language model that integrates advanced mathematical knowledge, exercises in higher mathematics…☆169Updated 5 months ago
- 本项目用于大模型数学解题能力方面的数据集合成,模型训练及评测,相关文章记录。☆67Updated 4 months ago
- CodeGPT: A Code-Related Dialogue Dataset Generated by GPT and for GPT☆110Updated last year
- backend for fastnlp MOSS project☆60Updated 6 months ago
- 😜 表情包视觉数据集,使用glm-4v、step-1v的图像解析能力标注。☆107Updated 8 months ago
- Gaokao Benchmark for AI☆105Updated 2 years ago
- 🔥Your Daily Dose of AI Research from Hugging Face 🔥 Stay updated with the latest AI breakthroughs! This bot automatically collects and…☆40Updated this week
- 顾名思义:手搓的RAG☆116Updated 10 months ago
- 【逐条进行中】人为审核+加修改每一条的弱智吧精选问题QA数据集☆121Updated this week
- 利用免费的大模型api来结合你的私域数据来生成sft训练数据(妥妥白嫖)支持llamafactory等工具的训练数据格式synthetic data☆116Updated last month
- A Python Package to Access World-Class Generative Models☆126Updated 7 months ago
- Light local website for displaying performances from different chat models.☆85Updated last year
- ☆444Updated last year
- DRT-o1: Optimized Deep Reasoning Translation via Long Chain-of-Thought☆194Updated 2 weeks ago
- The official codes for "Aurora: Activating chinese chat capability for Mixtral-8x7B sparse Mixture-of-Experts through Instruction-Tuning"☆259Updated 8 months ago
- GAOKAO-Bench is an evaluation framework that utilizes GAOKAO questions as a dataset to evaluate large language models.☆580Updated last week
- XVERSE-65B: A multilingual large language model developed by XVERSE Technology Inc.☆133Updated 9 months ago
- Alpaca Chinese Dataset -- 中文指令微调数据集【人工+GPT4o持续更新】☆193Updated 3 months ago
- ChatGLM-6B-Slim:裁减掉20K图片Token的ChatGLM-6B,完全一样的性能,占用更小的显存。☆126Updated last year
- “百聆”是一个基于LLaMA的语言对齐增强的英语/中文大语言模型,具有优越的英语/中文能力,在多语言和通用任务等多项测试中取得ChatGPT 90%的性能。BayLing is an English/Chinese LLM equipped with advanced l…☆307Updated last month
- 大模型多维度中文对齐评测基准 (ACL 2024)☆353Updated 5 months ago
- 2024 Alibaba Global Mathematics Competition AI Track Global 2nd Place Project (Agent Universe)☆51Updated 7 months ago
- ☆221Updated 8 months ago
- Chinese tokens in tiktoken tokenizers.☆30Updated 8 months ago
- ☆125Updated last year
- Evaluation for AI apps and agent☆36Updated last year