isen-zhang / ACLUEView external linksLinks
Official github repo for ACLUE, an evaluation benchmark focused on ancient Chinese language comprehension
☆33Mar 20, 2024Updated last year
Alternatives and similar repositories for ACLUE
Users that are interested in ACLUE are comparing it to the libraries listed below
Sorting:
- The official GitHub repository for AC-EVAL, an ancient Chinese evaluation suite for large language models (LLMs)☆16Nov 12, 2024Updated last year
- An evaluation bentchmark for classical Chinese☆18Dec 13, 2023Updated 2 years ago
- 文言文信息抽取(实体识别+关系抽取)☆10Feb 24, 2023Updated 2 years ago
- A Benchmark for Classical Chinese Based on a Crowdsourcing System.☆59May 25, 2021Updated 4 years ago
- CCL 2023 古汉语通假字语料库的构建及应用研究:通假字资源库☆20Sep 23, 2023Updated 2 years ago
- [EMNLP 2024] TongGu, a classical Chinese language model.☆58Sep 28, 2024Updated last year
- ☆411Jul 20, 2025Updated 6 months ago
- 颜真卿书法家楷书风格的书法汉字图像数据集,图像数共计856张,已开源。☆24Mar 2, 2021Updated 4 years ago
- 古文语言理解测评基准 Classical Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard☆55Aug 23, 2023Updated 2 years ago
- MultiSpanQA: A Dataset for Multi-Span Question Answering☆28Jan 24, 2026Updated 3 weeks ago
- CMMLU: Measuring massive multitask language understanding in Chinese☆802Dec 6, 2024Updated last year
- GAOGAO-Bench-Updates is a supplement to the GAOKAO-Bench, a dataset to evaluate large language models.☆38Jan 7, 2025Updated last year
- Workshop on Language Technologies for Historical and Ancient Languages (LT4HALA)☆35Updated this week
- ☆12Jan 11, 2026Updated last month
- DOMAINEVAL is an auto-constructed benchmark for multi-domain code generation that consists of 2k+ subjects (i.e., description, reference …☆14Dec 12, 2024Updated last year
- 欢迎来到 RAG 检索增强生成!这是一个使用 OpenAI API 和 Milvus 向量数据库的问答系统,结合了检索增强生成(RAG)技术。☆10Nov 4, 2024Updated last year
- [CVPR2024] Learning from Synthetic Human Group Activities☆14Feb 24, 2025Updated 11 months ago
- Code for doing Argument Structure Prediction using Residual Networks and (almost) without symbolic features☆11May 24, 2023Updated 2 years ago
- A framework for few-shot evaluation of autoregressive language models.☆12Jul 14, 2025Updated 7 months ago
- 基于BERT+Biaffine结构的关系抽取模型☆12Feb 23, 2022Updated 3 years ago
- Workflow based on github issues.☆11Apr 30, 2019Updated 6 years ago
- Code for paper: "Executing Arithmetic: Fine-Tuning Large Language Models as Turing Machines"☆11Oct 11, 2024Updated last year
- NLPBench: Evaluating NLP-Related Problem-solving Ability in Large Language Models☆10Oct 27, 2023Updated 2 years ago
- Corpus to accompany: "Selective Vision is the Challenge for Visual Reasoning: A Benchmark for Visual Argument Understanding"☆11Apr 11, 2025Updated 10 months ago
- Align, a general text alignment function☆15Dec 7, 2023Updated 2 years ago
- 中文金融大模型测评基准,六大类二十五任务、等级化评价,国内模型获得A级☆10May 6, 2024Updated last year
- codes for "Self-Checker: Plug-and-Play Modules for Fact-Checking with Large Language Models"☆12Feb 10, 2025Updated last year
- 中文公开聊天语料库☆11Nov 5, 2018Updated 7 years ago
- The official Python library for Openlayer, the Continuous Model Improvement Platform for AI. 📈☆16Feb 10, 2026Updated last week
- benchmarks for evaluating MT models☆11Jun 26, 2024Updated last year
- ☆12Nov 5, 2024Updated last year
- Neural ngram language model in PyTorch.☆10Sep 27, 2018Updated 7 years ago
- Dataset for AAAI paper "Natural Language Inference in Context - Investigating Contextual Reasoning over Long Texts"☆11Nov 18, 2022Updated 3 years ago
- ☆11Nov 5, 2024Updated last year
- A repository for the EMNLP 2021 paper "Is Information Density Uniform in Task-Oriented Dialogues?" and for the CoNLL 2021 paper "Analysin…☆10Jun 17, 2024Updated last year
- [EMNLP 2022] Fine-grained Category Discovery under Coarse-grained supervision with Hierarchical Weighted Self-contrastive Learning☆14Jun 22, 2024Updated last year
- Website for release of TellMeWhy dataset for why question answering☆14Nov 11, 2022Updated 3 years ago
- Survey of available speech datasets for Polish ASR development☆17Jan 1, 2025Updated last year
- 蚂蚁金融自然语言处理竞赛。☆10Sep 3, 2018Updated 7 years ago