isen-zhang / ACLUE
Official github repo for ACLUE, an evaluation benchmark focused on ancient Chinese language comprehension
☆27Updated 11 months ago
Alternatives and similar repositories for ACLUE:
Users that are interested in ACLUE are comparing it to the libraries listed below
- ☆95Updated last year
- Clustering and Ranking: Diversity-preserved Instruction Selection through Expert-aligned Quality Estimation☆75Updated 4 months ago
- ☆133Updated 10 months ago
- Dataset and evaluation script for "Evaluating Hallucinations in Chinese Large Language Models"☆120Updated 9 months ago
- MEASURING MASSIVE MULTITASK CHINESE UNDERSTANDING☆87Updated 11 months ago
- A Massive Multi-Level Multi-Subject Knowledge Evaluation benchmark☆100Updated last year
- ☆97Updated 11 months ago
- 中文大语言模型评测第二期☆70Updated last year
- ☆160Updated last year
- EMNLP'2024: Knowledge Verification to Nip Hallucination in the Bud☆22Updated last year
- CLongEval: A Chinese Benchmark for Evaluating Long-Context Large Language Models☆40Updated last year
- ☆80Updated last year
- ☆141Updated 8 months ago
- LAiW: A Chinese Legal Large Language Models Benchmark☆78Updated 8 months ago
- Flames is a highly adversarial benchmark in Chinese for LLM's harmlessness evaluation developed by Shanghai AI Lab and Fudan NLP Group.☆43Updated 9 months ago
- The Corpus & Code for EMNLP 2022 paper "FCGEC: Fine-Grained Corpus for Chinese Grammatical Error Correction" | FCGEC中文语法纠错语料及STG模型☆112Updated 3 months ago
- Benchmarking Complex Instruction-Following with Multiple Constraints Composition (NeurIPS 2024 Datasets and Benchmarks Track)☆70Updated 3 weeks ago
- [ACL 2024 Findings] Learning Fine-Grained Grounded Citations for Attributed Large Language Models☆16Updated 4 months ago
- 🐋 An unofficial implementation of Self-Alignment with Instruction Backtranslation.☆137Updated 8 months ago
- ☆73Updated 6 months ago
- deepspeed+trainer简单高效实现多卡微调大模型☆123Updated last year
- ☆44Updated 9 months ago
- 历届中文句法错误诊断技术评测数据集☆38Updated 2 years ago
- ☆64Updated last year
- ☆21Updated 5 months ago
- An open-source conversational language model developed by the Knowledge Works Research Laboratory at Fudan University.☆64Updated last year
- 怎么训练一个LLM分词器☆142Updated last year
- ☆22Updated last year
- “悟道”数据☆41Updated 3 years ago
- ☆21Updated 3 months ago