open-compass / GAOKAO-EvalLinks
☆109Updated 6 months ago
Alternatives and similar repositories for GAOKAO-Eval
Users that are interested in GAOKAO-Eval are comparing it to the libraries listed below
Sorting:
- ☆84Updated last year
- Gaokao Benchmark for AI☆108Updated 2 years ago
- 😜 表情包视觉数据集,使用glm-4v、step-1v的图像解析能力标注。☆122Updated last year
- The official codes for "Aurora: Activating chinese chat capability for Mixtral-8x7B sparse Mixture-of-Experts through Instruction-Tuning"☆263Updated last year
- CodeGPT: A Code-Related Dialogue Dataset Generated by GPT and for GPT☆113Updated 2 years ago
- 顾名思义:手搓的RAG☆124Updated last year
- TurtleBench: Evaluating Top Language Models via Real-World Yes/No Puzzles.☆149Updated 8 months ago
- AM (Advanced Mathematics) Chat is a large language model that integrates advanced mathematical knowledge, exercises in higher mathematics…☆192Updated 10 months ago
- ☆224Updated last year
- 🔥Your Daily Dose of AI Research from Hugging Face 🔥 Stay updated with the latest AI breakthroughs! This bot automatically collects and…☆52Updated this week
- Deep Reasoning Translation (DRT) Project☆224Updated 3 weeks ago
- SuperCLUE琅琊榜:中文通用大模型匿名对战评价基准☆144Updated last year
- ☆228Updated last year
- “百聆”是一个基于LLaMA的语言对齐增强的英语/中文大语言模型,具有优越的英语/中文能力,在多语言和通用任务等多项测试中取得ChatGPT 90%的性能。BayLing is an English/Chinese LLM equipped with advanced l…☆316Updated 6 months ago
- 使用langchain进行任务规划,构建子任务的会话场景资源,通过MCTS任务执行器,来让每个子任务通过在上下文中资源,通过自身反思探索来获取自身对问题的最优答案;这种方式依赖模型的对齐偏好,我们在每种偏好上设计了一个工程框架,来完成自我对不同答案的奖励进行采样策略☆29Updated last month
- A visuailzation tool to make deep understaning and easier debugging for RLHF training.☆213Updated 4 months ago
- 利用免费的大模型api来结合你的私域数据来生成sft训练数据(妥妥白嫖)支持llamafactory等工具的训练数据格式synthetic data☆167Updated 7 months ago
- Alpaca Chinese Dataset -- 中文指令微调数据集☆208Updated 8 months ago
- LingoWhale-8B: Open Bilingual LLMs | 开源双语预训练大模型☆139Updated last year
- Light local website for displaying performances from different chat models.☆87Updated last year
- The complete training code of the open-source high-performance Llama model, including the full process from pre-training to RLHF.☆69Updated 2 years ago
- 我们是第一个完全可商用的角色大模型。☆40Updated 10 months ago
- FlagEval is an evaluation toolkit for AI large foundation models.☆337Updated 2 months ago
- GAOKAO-Bench is an evaluation framework that utilizes GAOKAO questions as a dataset to evaluate large language models.☆659Updated 5 months ago
- ☆128Updated 2 years ago
- 【逐条处理完成】人为审核+修改每一条的弱智吧精选问题QA数据集☆208Updated 2 months ago
- backend for fastnlp MOSS project☆59Updated 11 months ago
- A Python Package to Access World-Class Generative Models☆127Updated last year
- 面向中文大模型价值观的评估与对齐研究☆522Updated last year
- 文本去重☆72Updated last year