cloudygoose / blindspot_nlg
☆19Updated 9 months ago
Related projects ⓘ
Alternatives and complementary repositories for blindspot_nlg
- ☆25Updated 7 months ago
- Codes for ACL 2023 Paper "Fact-Checking Complex Claims with Program-Guided Reasoning"☆27Updated last year
- AbstainQA, ACL 2024☆19Updated 3 weeks ago
- ☆26Updated 6 months ago
- ☆47Updated 6 months ago
- Code for the ACL 2023 Paper "Fact-Checking Complex Claims with Program-Guided Reasoning"☆47Updated last year
- Official implementation of the ACL 2023 paper: "Zero-shot Faithful Factual Error Correction"☆17Updated last year
- ☆22Updated 8 months ago
- WikiWhy is a new benchmark for evaluating LLMs' ability to explain between cause-effect relationships. It is a QA dataset containing 9000…☆46Updated 11 months ago
- Github repository for "FELM: Benchmarking Factuality Evaluation of Large Language Models" (NeurIPS 2023)☆54Updated 10 months ago
- ☆13Updated 2 years ago
- ☆16Updated last year
- Code for the ACL2023 paper: CAT: A Contextualized Conceptualization and Instantiation Framework for Commonsense Reasoning (https://aclant…☆10Updated last year
- Code for ACL 2023 paper "BOLT: Fast Energy-based Controlled Text Generation with Tunable Biases".☆18Updated last year
- ☆82Updated last year
- ☆37Updated last year
- Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering☆21Updated 2 weeks ago
- About Data and Codes for EMNLP 2023 System Demo Paper "QACHECK: A Demonstration System for Question-Guided Multi-Hop Fact-Checking"☆15Updated 10 months ago
- [EMNLP 2023] MQuAKE: Assessing Knowledge Editing in Language Models via Multi-Hop Questions☆101Updated last month
- ☆15Updated 9 months ago
- ☆36Updated 10 months ago
- The official repository for Multi3WOZ: A Multilingual, Multi-Domain, Multi-Parallel Dataset for Training and Evaluating Culturally Adapte…☆14Updated 9 months ago
- What does the bot say? ACL 2024☆13Updated 2 months ago
- Extracting Cultural Commonsense Knowledge at Scale (WWW 2023)☆10Updated 8 months ago
- Code and Data for NeurIPS2021 Paper "A Dataset for Answering Time-Sensitive Questions"☆62Updated 2 years ago
- ☆24Updated last year
- ☆17Updated 2 years ago
- First explanation metric (diagnostic report) for text generation evaluation☆60Updated 3 months ago
- ☆66Updated 9 months ago
- Code and dataset for the paper: Generating Literal and Implied Subquestions to Fact-check Complex Claims☆22Updated last year