yasumasaonoe / creakLinks
☆20Updated 2 years ago
Alternatives and similar repositories for creak
Users that are interested in creak are comparing it to the libraries listed below
Sorting:
- ☆18Updated 4 years ago
- ☆58Updated 3 years ago
- Code for Editing Factual Knowledge in Language Models☆141Updated 3 years ago
- ☆15Updated 2 years ago
- Code and Data for NeurIPS2021 Paper "A Dataset for Answering Time-Sensitive Questions"☆73Updated 3 years ago
- ☆51Updated last year
- ☆36Updated last year
- ☆75Updated last year
- FRANK: Factuality Evaluation Benchmark☆59Updated 2 years ago
- ☆50Updated 2 years ago
- Codes for ACL 2023 Paper "Fact-Checking Complex Claims with Program-Guided Reasoning"☆31Updated 2 years ago
- Constrained Decoding Project☆17Updated last year
- ☆32Updated 2 years ago
- Code for paper "CrossFit : A Few-shot Learning Challenge for Cross-task Generalization in NLP" (https://arxiv.org/abs/2104.08835)☆112Updated 3 years ago
- ☆82Updated 2 years ago
- EMNLP 2022: "MABEL: Attenuating Gender Bias using Textual Entailment Data" https://arxiv.org/abs/2210.14975☆38Updated last year
- We construct and introduce DIALFACT, a testing benchmark dataset crowd-annotated conversational claims, paired with pieces of evidence fr…☆43Updated 2 years ago
- ☆177Updated last year
- ☆100Updated 3 years ago
- ☆72Updated last year
- The Unreliability of Explanations in Few-shot Prompting for Textual Reasoning (NeurIPS 2022)☆16Updated 2 years ago
- code associated with ACL 2021 DExperts paper☆117Updated 2 years ago
- This code accompanies the paper DisentQA: Disentangling Parametric and Contextual Knowledge with Counterfactual Question Answering.☆16Updated 2 years ago
- Data and code for "A Question Answering Evaluation Framework for Faithfulness Assessment in Abstractive Summarization" (ACL 2020)☆49Updated 2 years ago
- ☆88Updated 3 years ago
- Dataset for Unified Editing, EMNLP 2023. This is a model editing dataset where edits are natural language phrases.☆22Updated last year
- WikiWhy is a new benchmark for evaluating LLMs' ability to explain between cause-effect relationships. It is a QA dataset containing 9000…☆47Updated last year
- ☆88Updated 2 years ago
- An original implementation of "Noisy Channel Language Model Prompting for Few-Shot Text Classification"☆131Updated 3 years ago
- Codes for the EMNLP2021 paper: Benchmarking Commonsense Knowledge Base Population (https://aclanthology.org/2021.emnlp-main.705.pdf). An …☆26Updated last year