isen-zhang / ACLUE
Official github repo for ACLUE, an evaluation benchmark focused on ancient Chinese language comprehension
☆24Updated 10 months ago
Alternatives and similar repositories for ACLUE:
Users that are interested in ACLUE are comparing it to the libraries listed below
- ☆93Updated last year
- Clustering and Ranking: Diversity-preserved Instruction Selection through Expert-aligned Quality Estimation☆74Updated 3 months ago
- A Massive Multi-Level Multi-Subject Knowledge Evaluation benchmark☆100Updated last year
- ☆79Updated last year
- Dataset and evaluation script for "Evaluating Hallucinations in Chinese Large Language Models"☆116Updated 8 months ago
- MEASURING MASSIVE MULTITASK CHINESE UNDERSTANDING☆88Updated 10 months ago
- ☆139Updated 7 months ago
- LAiW: A Chinese Legal Large Language Models Benchmark☆77Updated 7 months ago
- ☆129Updated 10 months ago
- ☆96Updated 10 months ago
- EMNLP'2024: Knowledge Verification to Nip Hallucination in the Bud☆22Updated 11 months ago
- ☆22Updated last year
- ☆159Updated last year
- ☆62Updated last year
- This is the code repo for the paper <UTC-IE: A Unified Token-pair Classification Architecture for Information Extraction>☆15Updated last year
- This is the repo which record the evolution of LM-based dialogue system. More details can be found in our original survey paper: A Survey…☆55Updated 3 months ago
- 中文大语言模型评测第二期☆70Updated last year
- [AAAI'25] CharacterBench: Benchmarking Character Customization of Large Language Models☆12Updated 2 months ago
- Benchmarking Complex Instruction-Following with Multiple Constraints Composition (NeurIPS 2024 Datasets and Benchmarks Track)☆61Updated this week
- 🐋 An unofficial implementation of Self-Alignment with Instruction Backtranslation.☆136Updated 7 months ago
- Implementation of "Investigating the Factual Knowledge Boundary of Large Language Models with Retrieval Augmentation"☆79Updated last year
- 基于DPO算法微调语言大模型,简单好上手。☆30Updated 7 months ago
- An open-source conversational language model developed by the Knowledge Works Research Laboratory at Fudan University.☆64Updated last year
- make LLM easier to use☆59Updated last year
- [EMNLP 2024] The official GitHub repo for the survey paper "Knowledge Conflicts for LLMs: A Survey"☆103Updated 4 months ago
- [ACL 23] CodeIE: Large Code Generation Models are Better Few-Shot Information Extractors☆35Updated last week
- A collection for math word problem (MWP) works, including datasets, algorithms and so on.☆40Updated 7 months ago
- A framework for editing the CoTs for better factuality☆48Updated last year
- [ACL 2024 Findings] Learning Fine-Grained Grounded Citations for Attributed Large Language Models☆14Updated 3 months ago
- EMNLP'2023: Explore-Instruct: Enhancing Domain-Specific Instruction Coverage through Active Exploration☆36Updated 11 months ago