LaVi-Lab / CLEVALinks
[EMNLP 2023 Demo] "CLEVA: Chinese Language Models EVAluation Platform" [ACL 2025 Findings] "C2LEVA: Toward Comprehensive and Contamination-Free Language Model Evaluation"
☆63Updated last month
Alternatives and similar repositories for CLEVA
Users that are interested in CLEVA are comparing it to the libraries listed below
Sorting:
- Data and code for paper "M3Exam: A Multilingual, Multimodal, Multilevel Benchmark for Examining Large Language Models"☆101Updated 2 years ago
- ☆128Updated 2 years ago
- Dataset and evaluation script for "Evaluating Hallucinations in Chinese Large Language Models"☆130Updated last year
- ☆96Updated last year
- 中文大语言模型评测第一期