dair-iitd / ECQA-DatasetLinks
Dataaset Release for Explanations for CommonsenseQA, ACL 2021 Paper
☆20Updated 4 years ago
Alternatives and similar repositories for ECQA-Dataset
Users that are interested in ECQA-Dataset are comparing it to the libraries listed below
Sorting:
- ☆83Updated 2 years ago
- ☆50Updated 2 years ago
- Supporting code for ReCEval paper☆31Updated last year
- Code for paper "Leakage-Adjusted Simulatability: Can Models Generate Non-Trivial Explanations of Their Behavior in Natural Language?"☆22Updated 5 years ago
- Code Repo for the ACL21 paper "Common Sense Beyond English: Evaluating and Improving Multilingual LMs for Commonsense Reasoning"☆23Updated 4 years ago
- ☆15Updated 5 years ago
- ☆36Updated last year
- ☆58Updated 3 years ago
- Data and Code Release for "On the Potential of Lexico-logical Alignments for Semantic Parsing to SQL Queries"☆55Updated 5 years ago
- [EMNLP 2021] Dataset and PyTorch Code for ExplaGraphs: An Explanation Graph Generation Task for Structured Commonsense Reasoning☆15Updated 3 years ago
- A unified approach to explain conditional text generation models. Pytorch. The code of paper "Local Explanation of Dialogue Response Gene…☆16Updated 3 years ago
- ☆74Updated last year
- The Shifted and The Overlooked: A Task-oriented Investigation of User-GPT Interactions (EMNLP 2023))☆13Updated 2 years ago
- A paper list of research conducted based on wikiHow☆27Updated 3 years ago
- ☆21Updated 4 years ago
- Companion repo for "Evaluating Verifiability in Generative Search Engines".☆85Updated 2 years ago
- WinoWhy provides human-annotated reasons for answering WSC questions.☆18Updated 5 years ago
- Code for the paper "Critical Thinking for Language Models"☆12Updated 4 years ago
- [EMNLP 2020] PyTorch code of PRover: Proof Generation for Interpretable Reasoning over Rules☆19Updated 2 years ago
- Code for preprint: Summarizing Differences between Text Distributions with Natural Language☆43Updated 2 years ago
- Tools and datasets for Aristo Leaderboards☆42Updated 4 years ago
- ☆47Updated 2 years ago
- WikiWhy is a new benchmark for evaluating LLMs' ability to explain between cause-effect relationships. It is a QA dataset containing 9000…☆48Updated 2 years ago
- This repository contains the code for "How many data points is a prompt worth?"☆48Updated 4 years ago
- Benchmarking Generalization to New Tasks from Natural Language Instructions☆26Updated 4 years ago
- ☆49Updated 2 years ago
- EMNLP 2022: Generating Natural Language Proofs with Verifier-Guided Search https://arxiv.org/abs/2205.12443☆86Updated last year
- Resources for Retrieval Augmentation for Commonsense Reasoning: A Unified Approach. EMNLP 2022.☆23Updated 3 years ago
- Code and Models for the paper "End-to-End Training of Multi-Document Reader and Retriever for Open-Domain Question Answering" (NeurIPS 20…☆109Updated 3 years ago
- TBC☆28Updated 3 years ago