Official github repo for ACLUE, an evaluation benchmark focused on ancient Chinese language comprehension
☆33Mar 20, 2024Updated last year
Alternatives and similar repositories for ACLUE
Users that are interested in ACLUE are comparing it to the libraries listed below
Sorting:
- An evaluation bentchmark for classical Chinese☆18Dec 13, 2023Updated 2 years ago
- 文言文信息抽取(实体识别+关系抽取)☆10Feb 24, 2023Updated 3 years ago
- CCL 2023 古汉语通假字语料库的构建及应用研究:通假字资源库☆21Sep 23, 2023Updated 2 years ago
- [EMNLP 2024] TongGu, a classical Chinese language model.☆63Sep 28, 2024Updated last year
- 颜真卿书法家楷书风格的书法汉字图像数据集,图像数共计856张,已开源。☆24Mar 2, 2021Updated 5 years ago
- 古文语言理解测评基准 Classical Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard☆56Aug 23, 2023Updated 2 years ago
- MultiSpanQA: A Dataset for Multi-Span Question Answering☆28Jan 24, 2026Updated last month
- GAOGAO-Bench-Updates is a supplement to the GAOKAO-Bench, a dataset to evaluate large language models.☆39Jan 7, 2025Updated last year
- Workshop on Language Technologies for Historical and Ancient Languages (LT4HALA)☆35Mar 2, 2026Updated last week
- 欢迎来到 RAG 检索增强生成!这是一个使用 OpenAI API 和 Milvus 向量数据库的问答系统,结合了检索增强生成(RAG)技术。☆10Nov 4, 2024Updated last year
- A framework for few-shot evaluation of autoregressive language models.☆12Jul 14, 2025Updated 7 months ago
- Math24o: 高中奥林匹克数学竞赛测评集 High School Olympiad Mathematics Chinese Benchmark☆11Mar 27, 2025Updated 11 months ago
- ☆12Jan 11, 2026Updated last month
- DOMAINEVAL is an auto-constructed benchmark for multi-domain code generation that consists of 2k+ subjects (i.e., description, reference …☆14Dec 12, 2024Updated last year
- 生成训练文本检测数据集☆12Jul 1, 2020Updated 5 years ago
- A Swedish Natural Language Understanding Benchmark☆11Dec 12, 2025Updated 2 months ago
- ☆41Feb 20, 2023Updated 3 years ago
- ☆10Jul 5, 2023Updated 2 years ago
- ☆11Oct 15, 2022Updated 3 years ago
- Open-source repository for the OOPSLA'24 paper "CYCLE: Learning to Self-Refine Code Generation"☆10Mar 8, 2024Updated 2 years ago
- Code and Data for GlitchBench☆13Feb 27, 2024Updated 2 years ago
- NLPBench: Evaluating NLP-Related Problem-solving Ability in Large Language Models☆10Oct 27, 2023Updated 2 years ago
- The official Python library for Openlayer, the Continuous Model Improvement Platform for AI. 📈☆16Feb 25, 2026Updated last week
- Survey of available speech datasets for Polish ASR development☆17Jan 1, 2025Updated last year
- 蚂蚁金融自然语言处理竞赛。☆10Sep 3, 2018Updated 7 years ago
- Finetuning LLaMA with DeepSpeed☆10Apr 14, 2023Updated 2 years ago
- Dataset for AAAI paper "Natural Language Inference in Context - Investigating Contextual Reasoning over Long Texts"☆11Nov 18, 2022Updated 3 years ago
- The official repository of the paper "X as Supervision: Contending with Depth Ambiguity in Unsupervised Monocular 3D Pose Estimation"☆12Jan 22, 2025Updated last year
- Shaping Language Models with Cognitive Insights☆15Feb 29, 2024Updated 2 years ago
- ☆12Mar 5, 2025Updated last year
- LLM red teaming datasets from the paper 'Student-Teacher Prompting for Red Teaming to Improve Guardrails' for the ART of Safety Workshop …☆22Oct 12, 2023Updated 2 years ago
- [EMNLP 2022] Fine-grained Category Discovery under Coarse-grained supervision with Hierarchical Weighted Self-contrastive Learning☆14Jun 22, 2024Updated last year
- Website for release of TellMeWhy dataset for why question answering☆14Nov 11, 2022Updated 3 years ago
- ☆11Nov 5, 2024Updated last year
- Implementation of Differential Learning Rate in Keras☆11Jun 4, 2019Updated 6 years ago
- Code for our project CROWN (Conversational Passage Ranking by Reasoning over Word Networks)☆10Jan 11, 2024Updated 2 years ago
- 基于BERT+Biaffine结构的关系抽取模型☆12Feb 23, 2022Updated 4 years ago
- ☆12Nov 5, 2024Updated last year
- [TACL 2024] Improving Probability-based Prompt Selection Through Unified Evaluation and Analysis☆11Nov 14, 2024Updated last year