open-compass / GAOKAO-Eval
☆104Updated 4 months ago
Alternatives and similar repositories for GAOKAO-Eval:
Users that are interested in GAOKAO-Eval are comparing it to the libraries listed below
- Gaokao Benchmark for AI☆108Updated 2 years ago
- ☆221Updated last year
- ☆83Updated last year
- LingoWhale-8B: Open Bilingual LLMs | 开源双语预训练大模型☆138Updated last year
- SuperCLUE琅琊榜:中文通用大模型匿名对战评价基准☆145Updated 9 months ago
- 😜 表情包视觉数据集,使用glm-4v、step-1v的图像解析能力标注。☆119Updated 11 months ago
- 顾名思义:手搓的RAG☆121Updated last year
- 🔥Your Daily Dose of AI Research from Hugging Face 🔥 Stay updated with the latest AI breakthroughs! This bot automatically collects and…☆50Updated last week
- ☆225Updated 11 months ago
- DRT-o1: Optimized Deep Reasoning Translation via Long Chain-of-Thought☆212Updated 3 months ago
- ☆62Updated 2 months ago
- AM (Advanced Mathematics) Chat is a large language model that integrates advanced mathematical knowledge, exercises in higher mathematics…☆183Updated 8 months ago
- The complete training code of the open-source high-performance Llama model, including the full process from pre-training to RLHF.☆68Updated last year
- Light local website for displaying performances from different chat models.☆86Updated last year
- GAOKAO-Bench is an evaluation framework that utilizes GAOKAO questions as a dataset to evaluate large language models.☆632Updated 3 months ago
- CodeGPT: A Code-Related Dialogue Dataset Generated by GPT and for GPT☆113Updated last year
- 星辰语义大模型TeleChat2是由中国电信人工智能研究院研发训练的大语言模型,是首个完全国产算力训练并开源的千亿参数模型☆227Updated 3 weeks ago
- ☆143Updated 9 months ago
- 中文大语言模型评测2024高考数学专题☆17Updated 10 months ago
- Dataset and evaluation script for "Evaluating Hallucinations in Chinese Large Language Models"☆124Updated 10 months ago
- The official codes for "Aurora: Activating chinese chat capability for Mixtral-8x7B sparse Mixture-of-Experts through Instruction-Tuning"☆260Updated 11 months ago
- ☆128Updated last year
- 面向中文大模型价值观的评估与对齐研究☆505Updated last year
- [ACL'2024 Findings] GAOKAO-MM: A Chinese Human-Level Benchmark for Multimodal Models Evaluation☆55Updated last year
- ☆46Updated 10 months ago
- A visuailzation tool to make deep understaning and easier debugging for RLHF training.☆186Updated last month
- 国内首个全参数训练的法律大模型 HanFei-1.0 (韩非)☆116Updated last year
- ☆122Updated last year
- SOTA Math Opensource LLM☆332Updated last year
- Skywork-MoE: A Deep Dive into Training Techniques for Mixture-of-Experts Language Models☆131Updated 10 months ago