Alibaba-NLP / CDQALinks
CDQA: Chinese Dynamic Question Answering Benchmark
☆17Updated 10 months ago
Alternatives and similar repositories for CDQA
Users that are interested in CDQA are comparing it to the libraries listed below
Sorting:
- Code and data for the paper "Can Large Language Models Understand Real-World Complex Instructions?"(AAAI2024)☆50Updated last year
- Data and codes for EMNLP 2022 paper "CDConv: A Benchmark for Contradiction Detection in Chinese Conversations"☆14Updated 2 years ago
- ☆97Updated last year
- [ACL 23] CodeIE: Large Code Generation Models are Better Few-Shot Information Extractors☆38Updated 9 months ago
- Papers and Resources for Information Extraction via Large Language Models☆32Updated 2 years ago
- An Evaluation of ChatGPT on Information Extraction task, including Named Entity Recognition (NER), Relation Extraction (RE), Event Extrac…☆134Updated last year
- The codes for ACL2022 paper “CQG: A Simple and Effective Controlled Generation Framework for Multi-hop Question Generation☆23Updated 3 years ago
- Source code and dataset for EMNLP 2022 paper "MAVEN-ERE: A Unified Large-scale Dataset for Event Coreference, Temporal, Causal, and Subev…☆86Updated 2 years ago
- EMNLP'2024: Knowledge Verification to Nip Hallucination in the Bud☆21Updated last year
- The source code of paper "CHEF: A Pilot Chinese Dataset for Evidence-Based Fact-Checking"☆79Updated 2 years ago
- This is the repo which record the evolution of LM-based dialogue system. More details can be found in our original survey paper: A Survey…☆63Updated 6 months ago
- The respository of jec-qa.☆57Updated 5 years ago
- T2Ranking: A large-scale Chinese benchmark for passage ranking.☆162Updated 2 years ago
- ☆146Updated last year
- Evaluating ChatGPT’s Information Extraction Capabilities: An Assessment of Performance, Explainability, Calibration, and Faithfulness☆144Updated last year
- Chinese AMR Corpus☆38Updated 6 months ago
- ☆15Updated 4 years ago
- ☆19Updated 4 years ago
- Dataset and evaluation script for "Evaluating Hallucinations in Chinese Large Language Models"☆135Updated last year
- ☆35Updated last year
- LAiW: A Chinese Legal Large Language Models Benchmark☆84Updated last year
- Pytorch implementation of baseline models of KQA Pro, a large-scale dataset of complex question answering over knowledge base.☆134Updated last year
- ☆40Updated 2 years ago
- OMGEval😮: An Open Multilingual Generative Evaluation Benchmark for Foundation Models☆35Updated last year
- [ACL'23 Findings] "Aligning Instruction Tasks Unlocks Large Language Models as Zero-Shot Relation Extractors"☆40Updated last year
- ☆98Updated last year
- A large-scale complex question answering evaluation of ChatGPT and similar large-language models☆40Updated last year
- A Large-Scale Chinese Legal Case Retrieval Dataset☆78Updated 10 months ago
- The official repo for our SIGIR'23 Full paper: Structure-aware Pre-trained Language Model for Legal Case Retrieval☆93Updated 2 years ago
- EMNLP'2023 (Findings): Large Language Model Is Not a Good Few-shot Information Extractor, but a Good Reranker for Hard Samples!☆45Updated last year