We introduce MKQA, an open-domain question answering evaluation set comprising 10k question-answer pairs aligned across 26 typologically diverse languages (260k question-answer pairs in total). The goal of this dataset is to provide a challenging benchmark for question answering quality across a wide set of languages. Please refer to our paper f…
☆192Jun 16, 2022Updated 3 years ago
Alternatives and similar repositories for ml-mkqa
Users that are interested in ml-mkqa are comparing it to the libraries listed below
Sorting:
- This is the official repository for NAACL 2021, "XOR QA: Cross-lingual Open-Retrieval Question Answering".☆80Jun 3, 2021Updated 4 years ago
- An official repository for MIA 2022 (NAACL 2022 Workshop) Shared Task on Cross-lingual Open-Retrieval Question Answering.☆31Jun 26, 2022Updated 3 years ago
- ☆207Nov 12, 2021Updated 4 years ago
- TyDi QA contains 200k human-annotated question-answer pairs in 11 Typologically Diverse languages, written without seeing the answer and …☆317May 28, 2020Updated 5 years ago
- New dataset☆311Aug 31, 2021Updated 4 years ago
- Open-Domain Question Answering Goes Conversational via Question Rewriting☆164May 23, 2022Updated 3 years ago
- Code to support the paper "Question and Answer Test-Train Overlap in Open-Domain Question Answering Datasets"☆65Aug 31, 2021Updated 4 years ago
- Entity-Based Knowledge Conflicts in Question Answering. Code repo for EMNLP2021 paper: https://aclanthology.org/2021.emnlp-main.565/☆76Aug 29, 2022Updated 3 years ago
- The official implemetation of "Evidentiality-guided Generation for Knowledge-Intensive NLP Tasks" (NAACL 2022).☆44Dec 25, 2022Updated 3 years ago
- Progressively Pretrained Dense Corpus Index for Open-Domain QA and Information Retrieval☆43Jun 12, 2023Updated 2 years ago
- Mr. TyDi is a multi-lingual benchmark dataset built on TyDi, covering eleven typologically diverse languages.☆80Feb 16, 2022Updated 4 years ago
- A BART version of an open-domain QA model in a closed-book setup☆119Aug 13, 2020Updated 5 years ago
- Code for ModularQA☆28Jun 8, 2021Updated 4 years ago
- XTREME is a benchmark for the evaluation of the cross-lingual generalization ability of pre-trained multilingual models that covers 40 ty…☆651Jan 4, 2023Updated 3 years ago
- The official implementation of ICLR 2020, "Learning to Retrieve Reasoning Paths over Wikipedia Graph for Question Answering".☆435Jul 25, 2024Updated last year
- Dataset and baseline for ACL 2019 paper "XQA: A Cross-lingual Open-domain Question Answering Dataset"☆89Nov 16, 2021Updated 4 years ago
- scripts for cleaning and creating train/validation/test splits for Thai commonvoice☆12Sep 2, 2021Updated 4 years ago
- ☆11Aug 12, 2020Updated 5 years ago
- The official implementation for ACL 2021 "Challenges in Information Seeking QA: Unanswerable Questions and Paragraph Retrieval".☆28Jun 19, 2021Updated 4 years ago
- Code and datasets of "Multilingual Extractive Reading Comprehension by Runtime Machine Translation"☆40Jan 2, 2019Updated 7 years ago
- ☆23Apr 28, 2022Updated 3 years ago
- Dense Passage Retriever - is a set of tools and models for open domain Q&A task.☆1,860Apr 6, 2023Updated 2 years ago
- ReConsider is a re-ranking model that re-ranks the top-K (passage, answer-span) predictions of an Open-Domain QA Model like DPR (Karpukhi…☆49Apr 26, 2021Updated 4 years ago
- Self-Conditioning Pre-Trained Language Models, ICML 2022☆34Jul 12, 2022Updated 3 years ago
- BERT models for many languages created from Wikipedia texts☆33May 25, 2020Updated 5 years ago
- Official repository of the R2-D2's pipeline☆21Nov 16, 2021Updated 4 years ago
- Natural Questions (NQ) contains real user questions issued to Google search, and answers found from Wikipedia by annotators. NQ is design…☆1,092Jul 30, 2021Updated 4 years ago
- UnifiedQA: Crossing Format Boundaries With a Single QA System☆445May 9, 2022Updated 3 years ago
- Binary Passage Retriever (BPR) - an efficient passage retriever for open-domain question answering☆175Jun 6, 2021Updated 4 years ago
- LAReQA is a challenging benchmark for evaluating language agnostic answer retrieval from a multilingual candidate pool. This repository c…☆14May 19, 2020Updated 5 years ago
- NLP stuff with quantum computing☆17Nov 9, 2020Updated 5 years ago
- A Benchmark for Robust, Multi-evidence, Multi-answer Question Answering☆17Jan 7, 2023Updated 3 years ago
- ☆13Mar 1, 2022Updated 4 years ago
- Open Thai Wikipedia QA Dataset made by iApp Technology☆14Feb 17, 2021Updated 5 years ago
- Library for Knowledge Intensive Language Tasks☆967Mar 31, 2022Updated 3 years ago
- Cross-Lingual Machine Reading Comprehension (EMNLP 2019)☆67Nov 6, 2019Updated 6 years ago
- An original implementation of EMNLP 2020, "AmbigQA: Answering Ambiguous Open-domain Questions"☆121Apr 23, 2022Updated 3 years ago
- ACL2020 Tutorial: Open-Domain Question Answering☆835Jan 1, 2021Updated 5 years ago
- FaVIQ: Fact Verification from Information-seeking Questions☆43Nov 23, 2022Updated 3 years ago