StonyBrookNLP / tellmewhyLinks
Website for release of TellMeWhy dataset for why question answering
☆14Updated 2 years ago
Alternatives and similar repositories for tellmewhy
Users that are interested in tellmewhy are comparing it to the libraries listed below
Sorting:
- ☆17Updated 2 years ago
- Perturbation CheckLists for Evaluating NLG Evaluation Metrics, EMNLP 2021☆9Updated 3 years ago
- Code Repo for the ACL21 paper "Common Sense Beyond English: Evaluating and Improving Multilingual LMs for Commonsense Reasoning"☆22Updated 3 years ago
- ☆44Updated last year
- TBC☆27Updated 2 years ago
- Code and dataset for the EMNLP 2021 Finding paper "Can NLI Models Verify QA Systems’ Predictions?"☆25Updated last year
- ☆58Updated 3 years ago
- Code and data for paper "On the Robustness of Reading Comprehension Models to Entity Renaming" (NAACL'22)☆11Updated 2 years ago
- This repository contains the dataset and code for "WiCE: Real-World Entailment for Claims in Wikipedia" in EMNLP 2023.☆41Updated last year
- Code for NAACL 2022 paper "Reframing Human-AI Collaboration for Generating Free-Text Explanations"☆31Updated 2 years ago
- We construct and introduce DIALFACT, a testing benchmark dataset crowd-annotated conversational claims, paired with pieces of evidence fr…☆41Updated 2 years ago
- Code for ModularQA☆28Updated 3 years ago
- ☆48Updated 2 years ago
- Code associated with the paper: "Few-Shot Self-Rationalization with Natural Language Prompts"☆13Updated 3 years ago
- Easy-to-use MIRAGE code for faithful answer attribution in RAG applications. Paper: https://aclanthology.org/2024.emnlp-main.347/☆24Updated 2 months ago
- The corresponding code from our paper " COINS: Dynamically Generating COntextualized Inference Rules for Narrative Story Completion (ACL …☆18Updated 2 years ago
- Benchmarking Generalization to New Tasks from Natural Language Instructions☆26Updated 3 years ago
- [EMNLP 2021] Dataset and PyTorch Code for ExplaGraphs: An Explanation Graph Generation Task for Structured Commonsense Reasoning☆11Updated 2 years ago
- ☆31Updated last year
- Code for ACL 21: Generating Query Focused Summaries from Query-Free Resources☆33Updated 2 years ago
- ☆24Updated last year
- This repository contains the code for "How many data points is a prompt worth?"☆48Updated 4 years ago
- Code and pre-trained models for "ReasonBert: Pre-trained to Reason with Distant Supervision", EMNLP'2021☆29Updated 2 years ago
- ☆21Updated 3 years ago
- The LM Contamination Index is a manually created database of contamination evidences for LMs.☆78Updated last year
- Official Github repo for the paper "Evaluating the Evaluation of Diversity in Natural Language Generation"☆19Updated 4 years ago
- ☆13Updated last year
- ☆48Updated last year
- ACL 2021 paper "Style is NOT a single variable: Case Studies for Cross-Style Language Understanding " by Dongyeop Kang and Eduard Hovy☆14Updated 3 years ago
- A benchmark dataset for evaluating dialog system and natural language generation metrics.☆37Updated 2 years ago