Official github repo for E-Eval, a Chinese K12 education evaluation benchmark for LLMs.
☆29Feb 19, 2024Updated 2 years ago
Alternatives and similar repositories for E-EVAL
Users that are interested in E-EVAL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆50Mar 2, 2026Updated last month
- 中文原生等级化代码能力测试基准☆15Apr 11, 2024Updated 2 years ago
- ☆31Nov 9, 2024Updated last year
- [Recsys'2023] "RCL: Multi-Relational Contrastive Learning for Recommendation"☆16Sep 6, 2023Updated 2 years ago
- ☆40Mar 21, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆186Apr 30, 2025Updated 11 months ago
- Math24o: 高中奥林匹克数学竞赛测评集 High School Olympiad Mathematics Chinese Benchmark☆11Mar 27, 2025Updated last year
- ☆21Feb 23, 2023Updated 3 years ago
- ☆60Apr 2, 2026Updated 2 weeks ago
- ☆10Nov 28, 2023Updated 2 years ago
- I fine-tuned (p-tuning) Tsinghua’s open-source large language model, ChatGLM2-6B, using several years of my WeChat chat history. Inspired…☆12Mar 6, 2024Updated 2 years ago
- [ACL 2025] Can MLLMs Understand the Deep Implication Behind Chinese Images?☆21Apr 9, 2026Updated last week
- A pipeline for the automatic construction of geometry problems along with step-by-step solutions.☆17Aug 27, 2025Updated 7 months ago
- ☆15Jul 22, 2024Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆77Jan 24, 2025Updated last year
- 基于bert4keras的SuperGLUE基准代码☆14Jun 25, 2022Updated 3 years ago
- Active Learning Helps Pretrained Models Learn the Intended Task (https://arxiv.org/abs/2204.08491) by Alex Tamkin, Dat Nguyen, Salil Desh…☆11Nov 22, 2022Updated 3 years ago
- Papers on databases and algorithms of image/video quality assessment☆13Jul 9, 2019Updated 6 years ago
- ☆12Nov 21, 2023Updated 2 years ago
- Time-RA: Towards Time Series Reasoning for Anomaly with LLM Feedback☆21Jan 10, 2026Updated 3 months ago
- Source Data of ACL2021 paper "Syntax-Enhanced Pre-trained Model"☆11Jun 1, 2021Updated 4 years ago
- BERT score for text generation☆12Jan 15, 2025Updated last year
- Transformers for Multi-Label Text Classification☆11Sep 18, 2020Updated 5 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Regularized Adversarial Training☆19Jun 28, 2023Updated 2 years ago
- Source code of ICML'22 paper: FEDformer: Frequency Enhanced Decomposed Transformer for Long-term Series Forecasting☆10Jun 10, 2022Updated 3 years ago
- Official completion of “Training on the Benchmark Is Not All You Need”.☆40Dec 31, 2024Updated last year
- RecDCL: Dual Contrastive Learning for Recommendation (WWW'24, Oral)☆30Jul 6, 2024Updated last year
- ☆11Nov 9, 2020Updated 5 years ago
- Keras tutorial code for the SC18 tutorial on Deep Learning at Scale☆12Nov 12, 2018Updated 7 years ago
- The collection of related papers and resources for the paper Time Series Analysis for Education: Methods, Applications, and Future Direct…☆18Apr 12, 2025Updated last year
- Source code of paper "Systematic Assessment of Factual Knowledge in Large Language Models" - EMNLP Findings 2023☆17Mar 17, 2026Updated 3 weeks ago
- Python implementation of network deconvolution as a general method to distinguish direct dependencies in network☆17May 7, 2014Updated 11 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Pytorch implementation for ICLR24:"Online GNN Evaluation Under Test-Time Graph Distribution Shifts"☆16Mar 23, 2024Updated 2 years ago
- ☆14Nov 29, 2020Updated 5 years ago
- ⚖️ Code for the paper "Ethical Adversaries: Towards Mitigating Unfairness with Adversarial Machine Learning".☆11Dec 8, 2022Updated 3 years ago
- ☆18Oct 12, 2022Updated 3 years ago
- TensorFlow model for training AdapNet for semantic segmentation☆14Jun 30, 2019Updated 6 years ago
- Code for DUCK: Rumour Detection on Social Media by Modelling User and Comment Propagation Networks NAACL2022(https://aclanthology.org/202…☆23Jul 18, 2022Updated 3 years ago
- PhysReason Becnhmark☆19Jul 8, 2025Updated 9 months ago