google-research-datasets / natural-questions
Natural Questions (NQ) contains real user questions issued to Google search, and answers found from Wikipedia by annotators. NQ is designed for the training and evaluation of automatic question answering systems.
☆937Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for natural-questions
- Library for Knowledge Intensive Language Tasks☆915Updated 2 years ago
- Officially supported AllenNLP models☆527Updated last year
- Shared repository for open-sourced projects from the Google AI Language team.☆1,624Updated 2 weeks ago
- Autoregressive Entity Retrieval☆763Updated last year
- Scripts and links to recreate the ELI5 dataset.☆318Updated 3 years ago
- Code for using and evaluating SpanBERT.☆891Updated last year
- XTREME is a benchmark for the evaluation of the cross-lingual generalization ability of pre-trained multilingual models that covers 40 ty…☆631Updated last year
- This dataset contains 108,463 human-labeled and 656k noisily labeled pairs that feature the importance of modeling structure, context, an…☆555Updated 2 years ago
- [ACL 2021] Learning Dense Representations of Phrases at Scale; EMNLP'2021: Phrase Retrieval Learns Passage Retrieval, Too https://arxiv.o…☆605Updated 2 years ago
- jiant is an nlp toolkit☆1,644Updated last year
- BLEURT is a metric for Natural Language Generation based on transfer learning.☆692Updated last year
- A python tool for evaluating the quality of sentence embeddings.☆2,087Updated 7 months ago
- ACL2020 Tutorial: Open-Domain Question Answering☆836Updated 3 years ago
- Adversarial Natural Language Inference Benchmark☆388Updated 2 years ago
- UnifiedQA: Crossing Format Boundaries With a Single QA System☆428Updated 2 years ago
- Evaluating Cross-lingual Sentence Representations☆441Updated 3 years ago
- TyDi QA contains 200k human-annotated question-answer pairs in 11 Typologically Diverse languages, written without seeing the answer and …☆291Updated 4 years ago
- Tools to download and cleanup Common Crawl data☆971Updated last year
- ☆449Updated 3 years ago
- A full Python Implementation of the ROUGE Metric (not a wrapper)☆669Updated last year
- Dense Passage Retriever - is a set of tools and models for open domain Q&A task.☆1,723Updated last year
- BERT for Coreference Resolution☆445Updated last year
- [DEPRECATED] Repo for exploring multi-task learning approaches to learning sentence representations☆773Updated 3 years ago
- A Visual Analysis Tool to Explore Learned Representations in Transformers Models☆585Updated 9 months ago
- The Schema-Guided Dialogue Dataset☆548Updated last year
- Plug and Play Language Model implementation. Allows to steer topic and attributes of GPT-2 models.☆1,131Updated 8 months ago
- Resources for the "SummEval: Re-evaluating Summarization Evaluation" paper☆370Updated 4 months ago
- ☆476Updated 2 years ago
- Entity Linker solution☆1,170Updated last year
- ToTTo is an open-domain English table-to-text dataset with over 120,000 training examples that proposes a controlled generation task: giv…☆436Updated last month