google-research-datasets / natural-questions
Natural Questions (NQ) contains real user questions issued to Google search, and answers found from Wikipedia by annotators. NQ is designed for the training and evaluation of automatic question answering systems.
☆969Updated 3 years ago
Alternatives and similar repositories for natural-questions:
Users that are interested in natural-questions are comparing it to the libraries listed below
- Library for Knowledge Intensive Language Tasks☆928Updated 2 years ago
- Autoregressive Entity Retrieval☆781Updated last year
- This dataset contains 108,463 human-labeled and 656k noisily labeled pairs that feature the importance of modeling structure, context, an…☆555Updated 3 years ago
- Officially supported AllenNLP models☆538Updated 2 years ago
- XTREME is a benchmark for the evaluation of the cross-lingual generalization ability of pre-trained multilingual models that covers 40 ty…☆638Updated 2 years ago
- Shared repository for open-sourced projects from the Google AI Language team.☆1,650Updated last week
- Entity Linker solution☆1,177Updated last year
- jiant is an nlp toolkit☆1,661Updated last year
- ToTTo is an open-domain English table-to-text dataset with over 120,000 training examples that proposes a controlled generation task: giv…☆438Updated 5 months ago
- ACL2020 Tutorial: Open-Domain Question Answering☆833Updated 4 years ago
- Evaluating Cross-lingual Sentence Representations☆449Updated 3 years ago
- The Schema-Guided Dialogue Dataset☆557Updated last year
- TyDi QA contains 200k human-annotated question-answer pairs in 11 Typologically Diverse languages, written without seeing the answer and …☆300Updated 4 years ago
- Adversarial Natural Language Inference Benchmark☆396Updated 2 years ago
- ☆472Updated 3 years ago
- [ACL 2021] Learning Dense Representations of Phrases at Scale; EMNLP'2021: Phrase Retrieval Learns Passage Retrieval, Too https://arxiv.o…☆603Updated 2 years ago
- A full Python Implementation of the ROUGE Metric (not a wrapper)☆685Updated 3 months ago
- Resources for the NAACL 2018 paper "A Discourse-Aware Attention Model for Abstractive Summarization of Long Documents"☆364Updated last year
- Plug and Play Language Model implementation. Allows to steer topic and attributes of GPT-2 models.☆1,138Updated last year
- Visually Explore the Stanford Question Answering Dataset☆558Updated last year
- BLEURT is a metric for Natural Language Generation based on transfer learning.☆715Updated last year
- This repository contains the NarrativeQA dataset. It includes the list of documents with Wikipedia summaries, links to full stories, and …☆471Updated 4 years ago
- ☆480Updated 3 years ago
- Anserini is a Lucene toolkit for reproducible information retrieval research☆1,047Updated this week
- UnifiedQA: Crossing Format Boundaries With a Single QA System☆431Updated 2 years ago
- Code for using and evaluating SpanBERT.☆895Updated last year
- Dense Passage Retriever - is a set of tools and models for open domain Q&A task.☆1,753Updated last year
- Scripts and links to recreate the ELI5 dataset.☆320Updated 3 years ago
- SQuAD Question Answering Using BERT, PyTorch☆397Updated 2 years ago
- An elaborate and exhaustive paper list for Named Entity Recognition (NER)☆394Updated 3 years ago