Alibaba-NLP / CDQALinks
CDQA: Chinese Dynamic Question Answering Benchmark
☆17Updated 10 months ago
Alternatives and similar repositories for CDQA
Users that are interested in CDQA are comparing it to the libraries listed below
Sorting:
- Code and data for the paper "Can Large Language Models Understand Real-World Complex Instructions?"(AAAI2024)☆49Updated last year
- [ACL 23] CodeIE: Large Code Generation Models are Better Few-Shot Information Extractors☆38Updated 8 months ago
- ☆96Updated last year
- EMNLP'2024: Knowledge Verification to Nip Hallucination in the Bud☆21Updated last year
- Source code and dataset for EMNLP 2022 paper "MAVEN-ERE: A Unified Large-scale Dataset for Event Coreference, Temporal, Causal, and Subev …☆86Updated 2 years ago
- An Evaluation of ChatGPT on Information Extraction task, including Named Entity Recognition (NER), Relation Extraction (RE), Event Extrac…☆133Updated last year
- Papers and Resources for Information Extraction via Large Language Models☆32Updated 2 years ago
- T2Ranking: A large-scale Chinese benchmark for passage ranking.☆161Updated 2 years ago
- EMNLP'2023 (Findings): Large Language Model Is Not a Good Few-shot Information Extractor, but a Good Reranker for Hard Samples!☆44Updated last year
- ☆35Updated last year
- A large-scale complex question answering evaluation of ChatGPT and similar large-language models☆40Updated last year
- The codes for ACL2022 paper “CQG: A Simple and Effective Controlled Generation Framework for Multi-hop Question Generation☆23Updated 2 years ago
- Pytorch implementation of baseline models of KQA Pro, a large-scale dataset of complex question answering over knowledge base.☆133Updated last year
- ☆40Updated 2 years ago
- An open-source conversational language model developed by the Knowledge Works Research Laboratory at Fudan University.☆65Updated 2 years ago
- [ACL'23 Findings] "Aligning Instruction Tasks Unlocks Large Language Models as Zero-Shot Relation Extractors"☆40Updated last year
- Evaluating ChatGPT’s Information Extraction Capabilities: An Assessment of Performance, Explainability, Calibration, and Faithfulness☆144Updated last year
- ☆43Updated last year
- 一种面向中文复杂问句的查询图生成方法,以及一份含有多种复杂句的中文知识图谱问答数据集☆18Updated 2 years ago
- Data and codes for EMNLP 2022 paper "CDConv: A Benchmark for Contradiction Detection in Chinese Conversations"☆14Updated 2 years ago
- [COLING 2025] Official Repo for Paper "Beyond Boundaries: Learning Universal Entity Taxonomy across Datasets and Languages for Open Named…☆25Updated last month
- ☆145Updated last year
- ☆17Updated 2 years ago
- ☆98Updated last year
- Dataset and evaluation script for "Evaluating Hallucinations in Chinese Large Language Models"☆135Updated last year
- MEASURING MASSIVE MULTITASK CHINESE UNDERSTANDING☆89Updated last year
- Source code and dataset for ACL2022 Findings Paper "LEVEN: A Large-Scale Chinese Legal Event Detection dataset"☆115Updated 2 years ago
- OMGEval😮: An Open Multilingual Generative Evaluation Benchmark for Foundation Models☆35Updated last year
- LAiW: A Chinese Legal Large Language Models Benchmark☆84Updated last year
- ☆25Updated 2 years ago