isen-zhang / ACLUE
Official github repo for ACLUE, an evaluation benchmark focused on ancient Chinese language comprehension
☆27Updated 11 months ago
Alternatives and similar repositories for ACLUE:
Users that are interested in ACLUE are comparing it to the libraries listed below
- ☆95Updated last year
- Clustering and Ranking: Diversity-preserved Instruction Selection through Expert-aligned Quality Estimation☆75Updated 3 months ago
- A Massive Multi-Level Multi-Subject Knowledge Evaluation benchmark☆100Updated last year
- ☆97Updated 11 months ago
- Dataset and evaluation script for "Evaluating Hallucinations in Chinese Large Language Models"☆120Updated 9 months ago
- MEASURING MASSIVE MULTITASK CHINESE UNDERSTANDING☆88Updated 11 months ago
- ☆141Updated 8 months ago
- ☆133Updated 10 months ago
- [ACL 2024] IEPile: A Large-Scale Information Extraction Corpus☆188Updated 2 months ago
- deepspeed+trainer简单高效实现多卡微调大模型☆123Updated last year
- 中文大语言模型评测第二期☆70Updated last year
- ☆80Updated last year
- EMNLP'2024: Knowledge Verification to Nip Hallucination in the Bud☆22Updated last year
- ☆160Updated last year
- ☆22Updated last year
- This is the repo which record the evolution of LM-based dialogue system. More details can be found in our original survey paper: A Survey…☆57Updated last week
- Llama-3-SynE: A Significantly Enhanced Version of Llama-3 with Advanced Scientific Reasoning and Chinese Language Capabilities | 继续预训练提升 …☆33Updated 2 months ago
- 历届中文句法错误诊断技术评测数据集☆38Updated 2 years ago
- Code & Data for our Paper "NaSGEC: Multi-Domain Chinese Grammatical Error Correction for Native Speaker Texts" (ACL 2023 Findings)☆84Updated 3 weeks ago
- [ACL 23] CodeIE: Large Code Generation Models are Better Few-Shot Information Extractors☆36Updated last month
- 基于DPO算法微调语言大模型,简单好上手。☆31Updated 8 months ago
- LAiW: A Chinese Legal Large Language Models Benchmark☆78Updated 8 months ago
- 中英文信息抽取数据集整理☆17Updated 2 years ago
- ☆128Updated last year
- [AAAI'25] CharacterBench: Benchmarking Character Customization of Large Language Models☆13Updated 3 months ago
- [TALLIP] General and Domain Adaptive Chinese Spelling Check with Error Consistent Pretraining☆56Updated last year
- ☆69Updated 5 months ago
- [EMNLP2024] Aligning Large Language Models on Information Extraction☆44Updated 4 months ago
- CLongEval: A Chinese Benchmark for Evaluating Long-Context Large Language Models☆40Updated last year
- 怎么训练一个LLM分词器☆142Updated last year