google-research-datasets / tydiqaLinks
TyDi QA contains 200k human-annotated question-answer pairs in 11 Typologically Diverse languages, written without seeing the answer and without the use of translation, and is designed for the training and evaluation of automatic question answering systems. This repository provides evaluation code and a baseline system for the dataset.
☆307Updated 5 years ago
Alternatives and similar repositories for tydiqa
Users that are interested in tydiqa are comparing it to the libraries listed below
Sorting:
- New dataset☆304Updated 3 years ago
- ☆195Updated 3 years ago
- Code and data to support the paper "PAQ 65 Million Probably-Asked Questions andWhat You Can Do With Them"☆203Updated 3 years ago
- Interpretable Evaluation for (Almost) All NLP Tasks☆195Updated 2 years ago
- Unsupervised Question answering via Cloze Translation☆219Updated 3 years ago
- DialoGLUE: A Natural Language Understanding Benchmark for Task-Oriented Dialogue☆283Updated last year
- A Natural Language Inference (NLI) model based on Transformers (BERT and ALBERT)☆132Updated last year
- UnifiedQA: Crossing Format Boundaries With a Single QA System☆438Updated 3 years ago
- Repository that accompanies "An Evaluation Dataset for Intent Classification and Out-of-Scope Prediction" (EMNLP 2019)☆210Updated 4 years ago
- Scripts and links to recreate the ELI5 dataset.☆325Updated 3 years ago
- [ACL 2021] Learning Dense Representations of Phrases at Scale; EMNLP'2021: Phrase Retrieval Learns Passage Retrieval, Too https://arxiv.o…☆604Updated 3 years ago
- Dataset for NAACL 2021 paper: "DART: Open-Domain Structured Data Record to Text Generation"☆154Updated 2 years ago
- MoverScore: Text Generation Evaluating with Contextualized Embeddings and Earth Mover Distance☆206Updated last year
- Adversarial Natural Language Inference Benchmark☆396Updated 3 years ago
- Multi-hop dense retrieval for question answering☆215Updated 3 years ago
- We introduce MKQA, an open-domain question answering evaluation set comprising 10k question-answer pairs aligned across 26 typologically …☆182Updated 3 years ago
- Code to reproduce the experiments from the paper.☆101Updated last year
- This dataset contains 108,463 human-labeled and 656k noisily labeled pairs that feature the importance of modeling structure, context, an…☆557Updated 3 years ago
- Models to perform neural summarization (extractive and abstractive) using machine learning transformers and a tool to convert abstractive…☆433Updated last month
- [EMNLP 2021] LM-Critic: Language Models for Unsupervised Grammatical Error Correction☆119Updated 3 years ago
- ☆346Updated 3 years ago
- Please see the readme file as well as our 2019 EMNLP paper linked here -->☆211Updated last year
- ToTTo is an open-domain English table-to-text dataset with over 120,000 training examples that proposes a controlled generation task: giv…☆449Updated 9 months ago
- Data and models for the SciFact verification task.☆233Updated last year
- Resources for the NAACL 2018 paper "A Discourse-Aware Attention Model for Abstractive Summarization of Long Documents"☆373Updated 2 years ago
- Easier Automatic Sentence Simplification Evaluation☆162Updated last year
- Resources for the "SummEval: Re-evaluating Summarization Evaluation" paper☆393Updated last year
- a gaggle of deep neural architectures for text ranking and question answering, designed for Pyserini☆351Updated last year
- Official repository for "SimpleTOD: A Simple Language Model for Task-Oriented Dialogue"☆238Updated last month
- A repo to explore different NLP tasks which can be solved using T5☆172Updated 4 years ago