google-research-datasets / natural-questionsLinks
Natural Questions (NQ) contains real user questions issued to Google search, and answers found from Wikipedia by annotators. NQ is designed for the training and evaluation of automatic question answering systems.
☆1,027Updated 4 years ago
Alternatives and similar repositories for natural-questions
Users that are interested in natural-questions are comparing it to the libraries listed below
Sorting:
- Library for Knowledge Intensive Language Tasks☆953Updated 3 years ago
- BLEURT is a metric for Natural Language Generation based on transfer learning.☆748Updated 2 years ago
- XTREME is a benchmark for the evaluation of the cross-lingual generalization ability of pre-trained multilingual models that covers 40 ty…☆645Updated 2 years ago
- Shared repository for open-sourced projects from the Google AI Language team.☆1,695Updated last month
- Officially supported AllenNLP models☆547Updated 2 years ago
- jiant is an nlp toolkit☆1,670Updated 2 years ago
- UnifiedQA: Crossing Format Boundaries With a Single QA System☆441Updated 3 years ago
- Adversarial Natural Language Inference Benchmark☆397Updated 3 years ago
- ☆524Updated 4 years ago
- [DEPRECATED] Repo for exploring multi-task learning approaches to learning sentence representations☆808Updated 4 years ago
- Dense Passage Retriever - is a set of tools and models for open domain Q&A task.☆1,825Updated 2 years ago
- Autoregressive Entity Retrieval☆793Updated 2 years ago
- [ACL 2021] Learning Dense Representations of Phrases at Scale; EMNLP'2021: Phrase Retrieval Learns Passage Retrieval, Too https://arxiv.o…☆605Updated 3 years ago
- ACL2020 Tutorial: Open-Domain Question Answering☆833Updated 4 years ago
- A full Python Implementation of the ROUGE Metric (not a wrapper)☆700Updated 8 months ago
- Scripts and links to recreate the ELI5 dataset.☆326Updated 3 years ago
- This dataset contains 108,463 human-labeled and 656k noisily labeled pairs that feature the importance of modeling structure, context, an…☆560Updated 3 years ago
- Resources for the "SummEval: Re-evaluating Summarization Evaluation" paper☆400Updated last year
- TyDi QA contains 200k human-annotated question-answer pairs in 11 Typologically Diverse languages, written without seeing the answer and …☆310Updated 5 years ago
- Code for using and evaluating SpanBERT.☆899Updated 2 years ago
- The Schema-Guided Dialogue Dataset☆580Updated 2 years ago
- Visually Explore the Stanford Question Answering Dataset☆568Updated last year
- Anserini is a Lucene toolkit for reproducible information retrieval research☆1,065Updated this week
- MS MARCO(Microsoft Machine Reading Comprehension) is a large scale dataset focused on machine reading comprehension, question answering, …☆328Updated 2 years ago
- Topic-Aware Convolutional Neural Networks for Extreme Summarization☆367Updated 2 years ago
- ToTTo is an open-domain English table-to-text dataset with over 120,000 training examples that proposes a controlled generation task: giv…☆453Updated 10 months ago
- Tools to download and cleanup Common Crawl data☆1,021Updated 2 years ago
- A research project for natural language generation, containing the official implementations by MSRA NLC team.☆733Updated last year
- Source code for end-to-end dialogue model from the MultiWOZ paper (Budzianowski et al. 2018, EMNLP)☆908Updated 6 months ago
- A summary of must-read papers for Neural Question Generation (NQG)☆586Updated 3 years ago