allenai / OpenBookQALinks
Code for experiments on OpenBookQA from the EMNLP 2018 paper "Can a Suit of Armor Conduct Electricity? A New Dataset for Open Book Question Answering"
☆128Updated 4 years ago
Alternatives and similar repositories for OpenBookQA
Users that are interested in OpenBookQA are comparing it to the libraries listed below
Sorting:
- ARC Question Solvers☆82Updated 4 years ago
- XCOPA: A Multilingual Dataset for Causal Commonsense Reasoning☆105Updated 4 years ago
- This repository maintains the QAConv dataset, a question-answering dataset on informative conversations including business emails, panel …☆83Updated 10 months ago
- ☆30Updated 3 years ago
- This repository contains the code for "How many data points is a prompt worth?"☆48Updated 4 years ago
- Dataset for NAACL 2021 paper: "DART: Open-Domain Structured Data Record to Text Generation"☆155Updated 2 years ago
- Few-shot NLP benchmark for unified, rigorous eval☆92Updated 3 years ago
- ☆97Updated 3 years ago
- Official repository for our EACL 2023 paper "LongEval: Guidelines for Human Evaluation of Faithfulness in Long-form Summarization" (https…☆44Updated last year
- Code and pre-trained models for "ReasonBert: Pre-trained to Reason with Distant Supervision", EMNLP'2021☆29Updated 2 years ago
- Language model Prompt And Query Archive☆158Updated 4 years ago
- Code for ModularQA☆28Updated 4 years ago
- ☆78Updated last year
- TAP (Translucent Answer Prediction), is a system to identify answers and evidence (in the form of supporting facts) in an RCQA task that …☆28Updated 5 years ago
- EMNLP 2021 - CTC: A Unified Framework for Evaluating Natural Language Generation☆98Updated 2 years ago
- The accompanying code for "Injecting Numerical Reasoning Skills into Language Models" (Mor Geva*, Ankit Gupta* and Jonathan Berant, ACL 2…☆89Updated last year
- FEVER (Fact Extraction and VERification) Annotation Platform and Baselines☆113Updated last year
- An original implementation of EMNLP 2020, "AmbigQA: Answering Ambiguous Open-domain Questions"☆119Updated 3 years ago
- We are creating a challenging new benchmark MultiReQA: A Cross-Domain Evaluation for Retrieval Question Answering Models. Retrieval quest…☆31Updated 5 years ago
- The official implementation of ACL 2020, "Logic-Guided Data Augmentation and Regularization for Consistent Question Answering".☆71Updated last year
- AIS is an evaluation framework for assessing whether the output of natural language models only contains information about the external w…☆31Updated 2 years ago
- Companion repo for "Evaluating Verifiability in Generative Search Engines".☆83Updated 2 years ago
- Code for the arXiv paper: "LLMs as Factual Reasoners: Insights from Existing Benchmarks and Beyond"☆59Updated 7 months ago
- EMNLP 2021 Adapting Language Models for Zero-shot Learning by Meta-tuning on Dataset and Prompt Collections☆50Updated 3 years ago
- A framework for building semantic parsers (including neural module networks) with AllenNLP, built by the authors of AllenNLP☆108Updated 3 years ago
- Detect hallucinated tokens for conditional sequence generation.☆64Updated 3 years ago
- resources for the IBM Airlines Table-Question-Answering Benchmark☆31Updated 3 years ago
- ☆59Updated last year
- ☆49Updated 2 years ago
- A BART version of an open-domain QA model in a closed-book setup☆119Updated 5 years ago