allenai / OpenBookQA
Code for experiments on OpenBookQA from the EMNLP 2018 paper "Can a Suit of Armor Conduct Electricity? A New Dataset for Open Book Question Answering"
☆126Updated 3 years ago
Alternatives and similar repositories for OpenBookQA:
Users that are interested in OpenBookQA are comparing it to the libraries listed below
- ☆97Updated 2 years ago
- Code for the arXiv paper: "LLMs as Factual Reasoners: Insights from Existing Benchmarks and Beyond"☆59Updated 9 months ago
- TAP (Translucent Answer Prediction), is a system to identify answers and evidence (in the form of supporting facts) in an RCQA task that …☆28Updated 4 years ago
- Few-shot NLP benchmark for unified, rigorous eval☆91Updated 2 years ago
- ARC Question Solvers☆83Updated 3 years ago
- ☆36Updated last year
- Token-level Reference-free Hallucination Detection☆93Updated last year
- ☆77Updated 8 months ago
- This repository contains the code for "How many data points is a prompt worth?"☆48Updated 3 years ago
- Official repository for our EACL 2023 paper "LongEval: Guidelines for Human Evaluation of Faithfulness in Long-form Summarization" (https…☆43Updated 5 months ago
- This repository maintains the QAConv dataset, a question-answering dataset on informative conversations including business emails, panel …☆81Updated 3 months ago
- Companion repo for "Evaluating Verifiability in Generative Search Engines".☆82Updated last year
- EMNLP 2021 Adapting Language Models for Zero-shot Learning by Meta-tuning on Dataset and Prompt Collections☆50Updated 3 years ago
- XCOPA: A Multilingual Dataset for Causal Commonsense Reasoning☆101Updated 3 years ago
- Author implementation of the paper "CommonsenseQA: A Question Answering Challenge Targeting Commonsense Knowledge"☆155Updated 5 months ago
- Dataset for NAACL 2021 paper: "DART: Open-Domain Structured Data Record to Text Generation"☆148Updated 2 years ago
- ☆48Updated last year
- SacreROUGE is a library dedicated to the use and development of text generation evaluation metrics with an emphasis on summarization.☆139Updated 2 years ago
- A BART version of an open-domain QA model in a closed-book setup☆120Updated 4 years ago
- Dense hybrid representations for text retrieval☆62Updated last year
- Code & data for EMNLP 2020 paper "MOCHA: A Dataset for Training and Evaluating Reading Comprehension Metrics".☆16Updated 2 years ago
- The official implementation of ACL 2020, "Logic-Guided Data Augmentation and Regularization for Consistent Question Answering".☆71Updated 5 months ago
- Data and Code Release for "On the Potential of Lexico-logical Alignments for Semantic Parsing to SQL Queries"☆52Updated 4 years ago
- ☆37Updated 2 years ago
- A reference-free metric for measuring summary quality, learned from human ratings.☆42Updated 2 years ago
- Question Answering and Generation for Summarization☆68Updated 2 years ago
- ☆67Updated 3 years ago
- Codes for ACL-IJCNLP 2021 Paper "Zero-shot Fact Verification by Claim Generation"☆63Updated 3 years ago
- Code and pre-trained models for "ReasonBert: Pre-trained to Reason with Distant Supervision", EMNLP'2021☆29Updated last year
- ☆44Updated last year