thunlp / FalseQA
Repo for ACL2023 paper "Won't Get Fooled Again: Answering Questions with False Premises"
☆22Updated last year
Alternatives and similar repositories for FalseQA:
Users that are interested in FalseQA are comparing it to the libraries listed below
- ☆21Updated last year
- Implementation of "Investigating the Factual Knowledge Boundary of Large Language Models with Retrieval Augmentation"☆79Updated last year
- ☆60Updated 2 years ago
- Code for the paper "Open Domain Question Answering with A Unified Knowledge Interface" (ACL 2022)☆57Updated last year
- Resources for our ACL 2023 paper: Distilling Script Knowledge from Large Language Models for Constrained Language Planning☆36Updated last year
- [ICLR24] The open-source repo of THU-KEG's KoLA benchmark.☆50Updated last year
- 🩺 A collection of ChatGPT evaluation reports on various bechmarks.☆48Updated last year
- Code and data for the paper "Can Large Language Models Understand Real-World Complex Instructions?"(AAAI2024)☆46Updated 9 months ago
- Technical Report: Is ChatGPT a Good NLG Evaluator? A Preliminary Study☆43Updated last year
- ☆44Updated 9 months ago
- ☆16Updated 11 months ago
- Do Large Language Models Know What They Don’t Know?☆90Updated 3 months ago
- ☆37Updated last year
- ☆85Updated last year
- ☆28Updated 11 months ago
- ☆122Updated 4 years ago
- ☆31Updated last year
- Merging Generated and Retrieved Knowledge for Open-Domain QA (EMNLP 2023)☆22Updated last year
- Code and data for paper "Context-faithful Prompting for Large Language Models".☆39Updated last year
- Code for KE-Blender, EMNLP 2021☆19Updated 2 years ago
- Official Code for "PPT: Pre-trained Prompt Tuning for Few-shot Learning". ACL 2022☆109Updated 2 years ago
- Paper list of "The Life Cycle of Knowledge in Big Language Models: A Survey"☆60Updated last year
- [ACL 23] CodeIE: Large Code Generation Models are Better Few-Shot Information Extractors☆35Updated last week
- Benchmarking Complex Instruction-Following with Multiple Constraints Composition (NeurIPS 2024 Datasets and Benchmarks Track)☆61Updated this week
- This code accompanies the paper DisentQA: Disentangling Parametric and Contextual Knowledge with Counterfactual Question Answering.☆17Updated last year
- Evaluating the Ripple Effects of Knowledge Editing in Language Models☆53Updated 10 months ago
- Code for our SIGIR 2022 accepted paper : P3 Ranker: Mitigating the Gaps between Pre-training and Ranking Fine-tuning with Prompt-based L…☆17Updated last year
- ☆33Updated 2 years ago
- Data and codes for EMNLP 2022 paper "CDConv: A Benchmark for Contradiction Detection in Chinese Conversations"☆15Updated last year
- Official Implementation of "Probing Language Models for Pre-training Data Detection"☆17Updated 2 months ago