google-research-datasets / tydiqa
TyDi QA contains 200k human-annotated question-answer pairs in 11 Typologically Diverse languages, written without seeing the answer and without the use of translation, and is designed for the training and evaluation of automatic question answering systems. This repository provides evaluation code and a baseline system for the dataset.
☆291Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for tydiqa
- New dataset☆299Updated 3 years ago
- Unsupervised Question answering via Cloze Translation☆218Updated 2 years ago
- DialoGLUE: A Natural Language Understanding Benchmark for Task-Oriented Dialogue☆281Updated last year
- Interpretable Evaluation for (Almost) All NLP Tasks☆193Updated 2 years ago
- Scripts and links to recreate the ELI5 dataset.☆319Updated 3 years ago
- Code and data to support the paper "PAQ 65 Million Probably-Asked Questions andWhat You Can Do With Them"☆202Updated 3 years ago
- ☆181Updated 3 years ago
- Please see the readme file as well as our 2019 EMNLP paper linked here -->☆196Updated 6 months ago
- Obtain Word Alignments using Pretrained Language Models (e.g., mBERT)☆351Updated last year
- MoverScore: Text Generation Evaluating with Contextualized Embeddings and Earth Mover Distance☆199Updated last year
- Officially supported AllenNLP models☆528Updated last year
- A Natural Language Inference (NLI) model based on Transformers (BERT and ALBERT)☆129Updated 9 months ago
- EMNLP 2020: "Dialogue Response Ranking Training with Large-Scale Human Feedback Data"☆336Updated last week
- ToTTo is an open-domain English table-to-text dataset with over 120,000 training examples that proposes a controlled generation task: giv…☆436Updated 2 months ago
- A collection of task-specific NLU datasets☆146Updated 2 years ago
- CrossWeigh: Training Named Entity Tagger from Imperfect Annotations☆176Updated 3 months ago
- PyTorch original implementation of "Unsupervised Question Decomposition for Question Answering"☆120Updated last year
- ☆150Updated 5 years ago
- This dataset contains 108,463 human-labeled and 656k noisily labeled pairs that feature the importance of modeling structure, context, an…☆555Updated 2 years ago
- We introduce MKQA, an open-domain question answering evaluation set comprising 10k question-answer pairs aligned across 26 typologically …☆166Updated 2 years ago
- An elaborate and exhaustive paper list for Named Entity Recognition (NER)☆394Updated 2 years ago
- A tool for holistic analysis of language generations systems☆467Updated 2 years ago
- Repository that accompanies "An Evaluation Dataset for Intent Classification and Out-of-Scope Prediction" (EMNLP 2019)☆204Updated 3 years ago
- XTREME is a benchmark for the evaluation of the cross-lingual generalization ability of pre-trained multilingual models that covers 40 ty…☆631Updated last year
- Code to reproduce the experiments from the paper.☆101Updated last year
- The official tool for creating proceedings for conferences of the Association for Computational Linguistics (ACL).☆220Updated 3 months ago
- Fast + Non-Autoregressive Grammatical Error Correction using BERT. Code and Pre-trained models for paper "Parallel Iterative Edit Models …☆228Updated last year
- Neural Question Generation using the SQuAD and NewsQA datasets☆109Updated last year
- Resources for the "Evaluating the Factual Consistency of Abstractive Text Summarization" paper☆285Updated last year
- [EMNLP 2021] LM-Critic: Language Models for Unsupervised Grammatical Error Correction☆119Updated 3 years ago