google-research-datasets / natural-questions
Natural Questions (NQ) contains real user questions issued to Google search, and answers found from Wikipedia by annotators. NQ is designed for the training and evaluation of automatic question answering systems.
☆981Updated 3 years ago
Alternatives and similar repositories for natural-questions:
Users that are interested in natural-questions are comparing it to the libraries listed below
- Library for Knowledge Intensive Language Tasks☆935Updated 2 years ago
- jiant is an nlp toolkit☆1,663Updated last year
- Autoregressive Entity Retrieval☆782Updated last year
- Officially supported AllenNLP models☆538Updated 2 years ago
- ACL2020 Tutorial: Open-Domain Question Answering☆834Updated 4 years ago
- ☆481Updated 3 years ago
- TyDi QA contains 200k human-annotated question-answer pairs in 11 Typologically Diverse languages, written without seeing the answer and …☆301Updated 4 years ago
- Visually Explore the Stanford Question Answering Dataset☆562Updated last year
- The Schema-Guided Dialogue Dataset☆562Updated last year
- UnifiedQA: Crossing Format Boundaries With a Single QA System☆432Updated 2 years ago
- XTREME is a benchmark for the evaluation of the cross-lingual generalization ability of pre-trained multilingual models that covers 40 ty…☆640Updated 2 years ago
- This dataset contains 108,463 human-labeled and 656k noisily labeled pairs that feature the importance of modeling structure, context, an…☆557Updated 3 years ago
- Code for using and evaluating SpanBERT.☆896Updated last year
- Shared repository for open-sourced projects from the Google AI Language team.☆1,658Updated last week
- ToTTo is an open-domain English table-to-text dataset with over 120,000 training examples that proposes a controlled generation task: giv…☆441Updated 6 months ago
- Evaluating Cross-lingual Sentence Representations☆450Updated 3 years ago
- Entity Linker solution☆1,184Updated last year
- Dense Passage Retriever - is a set of tools and models for open domain Q&A task.☆1,766Updated last year
- A summary of must-read papers for Neural Question Generation (NQG)☆583Updated 3 years ago
- An elaborate and exhaustive paper list for Named Entity Recognition (NER)☆394Updated 3 years ago
- [ACL 2021] Learning Dense Representations of Phrases at Scale; EMNLP'2021: Phrase Retrieval Learns Passage Retrieval, Too https://arxiv.o…☆603Updated 2 years ago
- BLEURT is a metric for Natural Language Generation based on transfer learning.☆719Updated last year
- New dataset☆303Updated 3 years ago
- MS MARCO(Microsoft Machine Reading Comprehension) is a large scale dataset focused on machine reading comprehension and question answerin…☆216Updated last year
- Data and Code for ICLR2020 Paper "TabFact: A Large-scale Dataset for Table-based Fact Verification"☆392Updated last year
- Source code for end-to-end dialogue model from the MultiWOZ paper (Budzianowski et al. 2018, EMNLP)☆887Updated 2 months ago
- [DEPRECATED] Repo for exploring multi-task learning approaches to learning sentence representations☆791Updated 3 years ago
- High-accuracy NLP parser with models for 11 languages.☆880Updated 3 years ago
- Anserini is a Lucene toolkit for reproducible information retrieval research☆1,046Updated this week
- An Open-Source Package for Information Retrieval.☆448Updated 2 years ago