google-research-datasets / natural-questions
Natural Questions (NQ) contains real user questions issued to Google search, and answers found from Wikipedia by annotators. NQ is designed for the training and evaluation of automatic question answering systems.
☆962Updated 3 years ago
Alternatives and similar repositories for natural-questions:
Users that are interested in natural-questions are comparing it to the libraries listed below
- Library for Knowledge Intensive Language Tasks☆920Updated 2 years ago
- Code for using and evaluating SpanBERT.☆895Updated last year
- Officially supported AllenNLP models☆534Updated 2 years ago
- Shared repository for open-sourced projects from the Google AI Language team.☆1,640Updated 2 months ago
- This dataset contains 108,463 human-labeled and 656k noisily labeled pairs that feature the importance of modeling structure, context, an…☆555Updated 3 years ago
- XTREME is a benchmark for the evaluation of the cross-lingual generalization ability of pre-trained multilingual models that covers 40 ty…☆636Updated 2 years ago
- A python tool for evaluating the quality of sentence embeddings.☆2,091Updated 9 months ago
- Visually Explore the Stanford Question Answering Dataset☆559Updated last year
- jiant is an nlp toolkit☆1,657Updated last year
- ☆468Updated 3 years ago
- Autoregressive Entity Retrieval☆775Updated last year
- Scripts and links to recreate the ELI5 dataset.☆320Updated 3 years ago
- [ACL 2021] Learning Dense Representations of Phrases at Scale; EMNLP'2021: Phrase Retrieval Learns Passage Retrieval, Too https://arxiv.o…☆604Updated 2 years ago
- The Schema-Guided Dialogue Dataset☆556Updated last year
- ACL2020 Tutorial: Open-Domain Question Answering☆834Updated 4 years ago
- Plug and Play Language Model implementation. Allows to steer topic and attributes of GPT-2 models.☆1,135Updated 10 months ago
- BERT for Coreference Resolution☆445Updated 2 years ago
- Source code for end-to-end dialogue model from the MultiWOZ paper (Budzianowski et al. 2018, EMNLP)☆875Updated this week
- BLEURT is a metric for Natural Language Generation based on transfer learning.☆711Updated last year
- [DEPRECATED] Repo for exploring multi-task learning approaches to learning sentence representations☆781Updated 3 years ago
- A full Python Implementation of the ROUGE Metric (not a wrapper)☆682Updated last month
- Dense Passage Retriever - is a set of tools and models for open domain Q&A task.☆1,741Updated last year
- KnowBert -- Knowledge Enhanced Contextual Word Representations☆373Updated 4 years ago
- Entity Linker solution☆1,174Updated last year
- This repository contains the NarrativeQA dataset. It includes the list of documents with Wikipedia summaries, links to full stories, and …☆467Updated 4 years ago
- TyDi QA contains 200k human-annotated question-answer pairs in 11 Typologically Diverse languages, written without seeing the answer and …☆299Updated 4 years ago
- ☆480Updated 2 years ago
- UnifiedQA: Crossing Format Boundaries With a Single QA System☆430Updated 2 years ago
- High-accuracy NLP parser with models for 11 languages.☆878Updated 3 years ago
- A summary of must-read papers for Neural Question Generation (NQG)☆584Updated 3 years ago