llmeval/Llmeval-Gaokao2024-Math

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/llmeval/Llmeval-Gaokao2024-Math)

llmeval / Llmeval-Gaokao2024-Math

LLM evaluation on 2024 Chinese Gaokao Mathematics — zero-contamination benchmark with dual prompt formats

☆21

Alternatives and similar repositories for Llmeval-Gaokao2024-Math

Users that are interested in Llmeval-Gaokao2024-Math are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

llmeval / LLMEval-Fair
View on GitHub
[ACL 2026] A large-scale longitudinal study on robust and fair evaluation of LLMs — 200K+ generative questions across 13 disciplines
☆40May 21, 2026Updated 2 months ago
lauhaide / clads
View on GitHub
XWikisCorpus, cross-lingual summarisation, multi-lingual summarisation, pre-trained language models, zero-shot and few-shot summarisation…
☆10Nov 4, 2022Updated 3 years ago
NoSyu / VHUCM
View on GitHub
Implementation of Variational Hierarchical User-based Conversation Model
☆10Jul 2, 2021Updated 5 years ago
XL2248 / SOV-MAS
View on GitHub
The code and data for "Summary-Oriented Vision Modeling for Multimodal Abstractive Summarization"
☆11May 16, 2023Updated 3 years ago
tsuruoka-lab / AMI-Meeting-Parallel-Corpus
View on GitHub
AMI Meeting Parallel Corpus
☆13Dec 11, 2020Updated 5 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
OpenLMLab / GAOKAO-Bench-Updates
View on GitHub
GAOGAO-Bench-Updates is a supplement to the GAOKAO-Bench, a dataset to evaluate large language models.
☆48Jan 7, 2025Updated last year
XL2248 / CPCC
View on GitHub
Code and Data for the ACL21 paper "Modeling Bilingual Conversational Characteristics for Neural Chat Translation"
☆12Dec 17, 2021Updated 4 years ago
korokes / MCLS
View on GitHub
Assist Non-native Viewers: Multimodal Crosslingual Summarization for How2 Videos
☆10Sep 2, 2024Updated last year
GussailRaat / EMNLP-19-IIM
View on GitHub
Context-aware-Interactive-Attention-for-Multi-modal-Sentiment-and Emotion-Analysis
☆11Feb 24, 2021Updated 5 years ago
KongLongGeFDU / TransferTOD
View on GitHub
The code repository of paper "TransferTOD: A Generalizable Chinese Multi-Domain Task-Oriented Dialogue System with Transfer Capabilities"
☆20May 12, 2026Updated 2 months ago
shreydesai / hurricane
View on GitHub
Code and datasets for the ACL 2020 paper "Detecting Perceived Emotions in Hurricane Disasters"
☆12Oct 4, 2022Updated 3 years ago
zhongpeixiang / affect-rich-conversational-model
View on GitHub
The PyTorch code for paper: An Affect-Rich Neural Conversational Model with Biased Attention and Weighted Cross-Entropy Loss
☆12Oct 7, 2019Updated 6 years ago
YLXDXX / AM601-kaoyan
View on GitHub
中国科学院大学，601高等数学甲，历年考研真题收集整理
☆13Aug 4, 2025Updated 11 months ago
IsakZhang / XABSA
View on GitHub
☆10Nov 29, 2021Updated 4 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
fandongmeng / DTMT_InDec
View on GitHub
Implementation of DTMT with incremental decoding
☆13Feb 20, 2021Updated 5 years ago
thunlp / SE-Bench
View on GitHub
Official repo for "SE-Bench: Benchmarking Self-Evolution with Knowledge Internalization"
☆28Mar 24, 2026Updated 4 months ago
ECNU-ICALK / SocraticMath
View on GitHub
[CIKM 2024] Boosting Large Language Models with Socratic Method for Conversational Mathematics Teaching
☆14Apr 2, 2026Updated 3 months ago
XL2248 / VHM
View on GitHub
Code for the ACL2022 main conference paper "A Variational Hierarchical Model for Neural Cross-Lingual Summarization"
☆18Sep 5, 2022Updated 3 years ago
XL2248 / AGDT
View on GitHub
Code for "A Novel Aspect-Guided Deep Transition Model for Aspect Based Sentiment Analysis." on EMNLP 2019.
☆21Dec 22, 2019Updated 6 years ago
SidU / MathBlackBox
View on GitHub
☆11Jul 21, 2024Updated 2 years ago
Unbabel / BConTrasT
View on GitHub
☆20Aug 17, 2021Updated 4 years ago
llmeval / LLMEval-Med
View on GitHub
[EMNLP 2025] A real-world clinical benchmark for medical LLMs with physician validation — 2,996 questions from EHRs
☆28May 21, 2026Updated 2 months ago
HuihuiChyan / BJTUNLP_Practice2019
View on GitHub
This is the official leaderboard of the six practice for the new commers of BJTUNLPers.
☆15Dec 17, 2019Updated 6 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
csitfun / GLoRE
View on GitHub
a benckmark for evaluating logical reasoning of LLMs
☆23Jan 25, 2024Updated 2 years ago
StevenLau6 / FINDSum
View on GitHub
A Large-Scale Dataset for Long Text and Multi-Table Summarization
☆18Feb 21, 2024Updated 2 years ago
elog-x / yuque-hexo
View on GitHub
语雀 + Elog + Hexo + GitHub Actions + Vercel 博客解决方案
☆12Jul 9, 2024Updated 2 years ago
boom-R123 / ChatWK
View on GitHub
Usings LLM chat with knowledges
☆21Aug 12, 2023Updated 2 years ago
CLUEbenchmark / SuperCLUE-Math6
View on GitHub
SuperCLUE-Math6：新一代中文原生多轮多步数学推理数据集的探索之旅
☆60Feb 5, 2024Updated 2 years ago
binary-husky / void-terminal
View on GitHub
The CLI & python API for the well-known project gpt-academic.
☆19Sep 22, 2024Updated last year
JocelynSong / EmotionalDialogueSystem
View on GitHub
☆26Oct 21, 2019Updated 6 years ago
ClConstantine / CCNU-Beamer-Theme
View on GitHub
☆10Mar 18, 2024Updated 2 years ago
danielvarab / massive-summ
View on GitHub
☆31Apr 21, 2023Updated 3 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
WuNein / vllm4mteb
View on GitHub
vLLM for embedding tasks using Original LLMs (Qwen2, LLaMA)
☆29Sep 9, 2024Updated last year
thunlp / SchemaReinforcementLearning
View on GitHub
Learning to Generate STRUCTURED Output with Schema Reinforcement Learning
☆26Mar 2, 2025Updated last year
SYSU-MUCFC-FinTech-Research-Center / ZhiLu
View on GitHub
智鹿：中文消金领域对话大模型
☆30Nov 12, 2023Updated 2 years ago
Jsewill / morton
View on GitHub
A Morton Order (Z-Order Curve) library, written in Go.
☆13Mar 24, 2026Updated 4 months ago
hkust-nlp / dart-math
View on GitHub
[NeurIPS'24] Official code for *🎯DART-Math: Difficulty-Aware Rejection Tuning for Mathematical Problem-Solving*
☆120Dec 10, 2024Updated last year
SaKongA / EgoTools
View on GitHub
基于Goodies开源组件制作的华为 Matebook E Go 第三方调节工具
☆28Apr 5, 2026Updated 3 months ago
krystalan / chatgpt_as_nlg_evaluator
View on GitHub
Technical Report: Is ChatGPT a Good NLG Evaluator? A Preliminary Study
☆43Mar 8, 2023Updated 3 years ago