Alibaba-NLP / CDQALinks
CDQA: Chinese Dynamic Question Answering Benchmark
☆17Updated last year
Alternatives and similar repositories for CDQA
Users that are interested in CDQA are comparing it to the libraries listed below
Sorting:
- Papers and Resources for Information Extraction via Large Language Models☆32Updated 2 years ago
- [ACL 23] CodeIE: Large Code Generation Models are Better Few-Shot Information Extractors☆40Updated 3 weeks ago
- Source code and dataset for EMNLP 2022 paper "MAVEN-ERE: A Unified Large-scale Dataset for Event Coreference, Temporal, Causal, and Subev…☆91Updated 2 years ago
- ☆40Updated 2 years ago
- Code and data for the paper "Can Large Language Models Understand Real-World Complex Instructions?"(AAAI2024)☆50Updated last year
- This is the repo which record the evolution of LM-based dialogue system. More details can be found in our original survey paper: A Survey…☆63Updated 8 months ago
- ☆98Updated last year
- An Evaluation of ChatGPT on Information Extraction task, including Named Entity Recognition (NER), Relation Extraction (RE), Event Extrac…☆134Updated last year
- Repo for ACL2023 paper "Won't Get Fooled Again: Answering Questions with False Premises"☆22Updated 2 years ago
- EMNLP'2023 (Findings): Large Language Model Is Not a Good Few-shot Information Extractor, but a Good Reranker for Hard Samples!☆45Updated last year
- T2Ranking: A large-scale Chinese benchmark for passage ranking.☆161Updated 2 years ago
- The codes for ACL2022 paper “CQG: A Simple and Effective Controlled Generation Framework for Multi-hop Question Generation☆23Updated 3 years ago
- EMNLP'2024: Knowledge Verification to Nip Hallucination in the Bud☆22Updated last year
- ☆98Updated 2 years ago
- Data and codes for EMNLP 2022 paper "CDConv: A Benchmark for Contradiction Detection in Chinese Conversations"☆14Updated 2 years ago
- Pytorch implementation of baseline models of KQA Pro, a large-scale dataset of complex question answering over knowledge base.☆138Updated last year
- LAiW: A Chinese Legal Large Language Models Benchmark☆86Updated last year
- Clustering and Ranking: Diversity-preserved Instruction Selection through Expert-aligned Quality Estimation☆90Updated last year
- Evaluating ChatGPT’s Information Extraction Capabilities: An Assessment of Performance, Explainability, Calibration, and Faithfulness☆144Updated last year
- 中文机器阅读理解数据集☆108Updated 4 years ago
- A Massive Multi-Level Multi-Subject Knowledge Evaluation benchmark☆103Updated 2 years ago
- Code for the IJCAI2020 submission: "BERT-PLI: Modeling Paragraph-Level Interactions for Legal Case Retrieval"☆37Updated last year
- [ACL'23 Findings] "Aligning Instruction Tasks Unlocks Large Language Models as Zero-Shot Relation Extractors"☆40Updated 2 years ago
- The respository of jec-qa.☆60Updated 5 years ago
- ☆35Updated last month
- 一种面向中文复杂问句的查询图生成方法,以及一份含有多种复杂句的中文知识图谱问答数据集☆18Updated 2 years ago
- Dataset and evaluation script for "Evaluating Hallucinations in Chinese Large Language Models"☆136Updated last year
- OMGEval😮: An Open Multilingual Generative Evaluation Benchmark for Foundation Models☆35Updated last year
- ☆147Updated last year
- MEASURING MASSIVE MULTITASK CHINESE UNDERSTANDING☆89Updated last year