AI-EDU-LAB/E-EVAL

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/AI-EDU-LAB/E-EVAL)

AI-EDU-LAB / E-EVAL

Official github repo for E-Eval, a Chinese K12 education evaluation benchmark for LLMs.

☆29

Alternatives and similar repositories for E-EVAL

Users that are interested in E-EVAL are comparing it to the libraries listed below

Sorting:

SmileWHC / CJEval
View on GitHub
CEduMEval : A Chinese educational multi-task evaluation benchmark
☆16Nov 18, 2024Updated last year
Alpha-VLLM / WeMix-LLM
View on GitHub
☆17Oct 15, 2023Updated 2 years ago
Zc0812 / Edu_Planner
View on GitHub
☆25Apr 8, 2025Updated 10 months ago
p-ortmann / dyntapy
View on GitHub
☆10Mar 8, 2024Updated last year
tristandeleu / jax-jsp-gfn
View on GitHub
Official code for the paper "Joint Bayesian Inference of Graphical Structure and Parameters with a Single Generative Flow Network"
☆16Aug 9, 2023Updated 2 years ago
CLUEbenchmark / Math24o
View on GitHub
Math24o: 高中奥林匹克数学竞赛测评集 High School Olympiad Mathematics Chinese Benchmark
☆11Mar 27, 2025Updated 11 months ago
SuperGPQA / SuperGPQA
View on GitHub
☆185Apr 30, 2025Updated 10 months ago
jiajingyyyyyy / AutoTool
View on GitHub
[AAAI 2026] AutoTool: Efficient Tool Selection for Large Language Model Agents
☆29Dec 28, 2025Updated 2 months ago
sa-and / MCD
View on GitHub
☆12Mar 21, 2024Updated last year
tanmayy24 / Baby_Cry_Detection_Database
View on GitHub
☆13Jan 31, 2023Updated 3 years ago
tsinghua-fib-lab / UGI
View on GitHub
Urban Generative Intelligence (UGI): A Foundational Platform for Embodied Agent and Future City
☆12Dec 17, 2023Updated 2 years ago
JmlrOrg / dmlr-style-file
View on GitHub
☆12Nov 21, 2023Updated 2 years ago
SynthRAD2023 / metrics
View on GitHub
Example of evaluation metrics used in the SynthRAD2023 challenge
☆11Jul 14, 2023Updated 2 years ago
iPieter / ethical-adversaries
View on GitHub
⚖️ Code for the paper "Ethical Adversaries: Towards Mitigating Unfairness with Adversarial Machine Learning".
☆11Dec 8, 2022Updated 3 years ago
alextamkin / active-learning-pretrained-models
View on GitHub
Active Learning Helps Pretrained Models Learn the Intended Task (https://arxiv.org/abs/2204.08491) by Alex Tamkin, Dat Nguyen, Salil Desh…
☆11Nov 22, 2022Updated 3 years ago
metterian / korean_bert_score
View on GitHub
BERT score for text generation
☆12Jan 15, 2025Updated last year
aldo-g / obsidian-llm-test
View on GitHub
☆12Jan 14, 2026Updated last month
ShenzheZhu / A2A-NT
View on GitHub
Official code of "The Automated but Risky Game: Modeling Agent-to-Agent Negotiations and Transactions in Consumer Markets"
☆23Sep 20, 2025Updated 5 months ago
ybai-nlp / EduBench
View on GitHub
Official repository for "EduBench: A Comprehensive Benchmarking Dataset for Evaluating Large Language Models in Diverse Educational Scena…
☆19May 28, 2025Updated 9 months ago
rubenohana / Reservoir-computing-kernels
View on GitHub
☆11Nov 9, 2020Updated 5 years ago
dxzxy12138 / PhysReason
View on GitHub
PhysReason Becnhmark
☆19Jul 8, 2025Updated 7 months ago
eth-lre / PedagogicalRL
View on GitHub
Multi-turn RL framework for aligning models to be tutors instead of answerers. EMNLP 2025 Oral
☆31Dec 11, 2025Updated 2 months ago
aigerimb / FFEVSS
View on GitHub
☆10Dec 29, 2020Updated 5 years ago
Applied-Machine-Learning-Lab / GARLIC
View on GitHub
☆11Dec 11, 2024Updated last year
pkunlp-icler / SCL-RAI
View on GitHub
Code for "SCL-RAI: Span-based Contrastive Learning with Retrieval Augmented Inference for Unlabeled Entity Problem in NER" @COLING-2022
☆11Aug 20, 2022Updated 3 years ago
RuttenStijn / Thesis
View on GitHub
Code and extra figures as part of the thesis about Relative transfer function estimation for multi-microphone speech enhancement based on…
☆11Jan 10, 2018Updated 8 years ago
myracheng / lm_caricature
View on GitHub
code and data associated with CoMPosT: Characterizing and Evaluating Caricature in LLM Simulations
☆11Oct 13, 2023Updated 2 years ago
vvhg1 / guided-text-generation-with-classifier-free-language-diffusion
View on GitHub
Exploring classifier-free guidance in a DDPM language model for text generation towards emotion targets.
☆11Sep 7, 2025Updated 5 months ago
asonabend / ESRL
View on GitHub
Code for Expert Supervised Reinforcement Learning
☆10Apr 7, 2021Updated 4 years ago
skyrahul / DTA2020
View on GitHub
Dynamic Traffic Assignment
☆16Aug 25, 2020Updated 5 years ago
ycpNotFound / GeoGen
View on GitHub
A pipeline for the automatic construction of geometry problems along with step-by-step solutions.
☆17Aug 27, 2025Updated 6 months ago
CLUEbenchmark / SuperCLUE-Code3
View on GitHub
中文原生等级化代码能力测试基准
☆15Apr 11, 2024Updated last year
gkasieczka / DisCo
View on GitHub
☆14Nov 29, 2020Updated 5 years ago
sharan-dce / autograd
View on GitHub
Auto-differentiation library for C++
☆12Jan 16, 2022Updated 4 years ago
fzyzcjy / ai_math_paper_list
View on GitHub
AI for Mathematics Paper List
☆17Jan 14, 2025Updated last year
JusperLee / ExamOnline
View on GitHub
This is a complete online exam system
☆10Dec 27, 2019Updated 6 years ago
ifLab / eduhub
View on GitHub
LLM Application Systems for Education
☆11May 16, 2025Updated 9 months ago
Mokerpoker / k_shortest_paths
View on GitHub
Networkx implementation of Yen's k shortest paths algorithm.
☆11Nov 6, 2018Updated 7 years ago
Caoyumin97 / Bandwidth-based-Signal-Coordination
View on GitHub
Reproducing several bandwidth-based traffic signal coordination models (including MaxBand, MultiBand, etc.)
☆11Sep 18, 2020Updated 5 years ago