apple/ml-mkqa

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/apple/ml-mkqa)

apple / ml-mkqa

We introduce MKQA, an open-domain question answering evaluation set comprising 10k question-answer pairs aligned across 26 typologically diverse languages (260k question-answer pairs in total). The goal of this dataset is to provide a challenging benchmark for question answering quality across a wide set of languages. Please refer to our paper f…

☆193

Alternatives and similar repositories for ml-mkqa

Users that are interested in ml-mkqa are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

mia-workshop / MIA-Shared-Task-2022
View on GitHub
An official repository for MIA 2022 (NAACL 2022 Workshop) Shared Task on Cross-lingual Open-Retrieval Question Answering.
☆31Jun 26, 2022Updated 4 years ago
AkariAsai / XORQA
View on GitHub
This is the official repository for NAACL 2021, "XOR QA: Cross-lingual Open-Retrieval Question Answering".
☆80Jun 3, 2021Updated 5 years ago
google-deepmind / xquad
View on GitHub
☆210Nov 12, 2021Updated 4 years ago
apple / ml-qrecc
View on GitHub
Open-Domain Question Answering Goes Conversational via Question Rewriting
☆167May 23, 2022Updated 4 years ago
facebookresearch / MLQA
View on GitHub
New dataset
☆311Aug 31, 2021Updated 4 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
apple / ml-knowledge-conflicts
View on GitHub
Entity-Based Knowledge Conflicts in Question Answering. Code repo for EMNLP2021 paper: https://aclanthology.org/2021.emnlp-main.565/
☆77Aug 29, 2022Updated 3 years ago
google-research-datasets / tydiqa
View on GitHub
TyDi QA contains 200k human-annotated question-answer pairs in 11 Typologically Diverse languages, written without seeing the answer and …
☆319May 28, 2020Updated 6 years ago
apple / ml-selfcond
View on GitHub
Self-Conditioning Pre-Trained Language Models, ICML 2022
☆34Jul 12, 2022Updated 4 years ago
facebookresearch / QA-Overlap
View on GitHub
Code to support the paper "Question and Answer Test-Train Overlap in Open-Domain Question Answering Datasets"
☆66Aug 31, 2021Updated 4 years ago
castorini / mr.tydi
View on GitHub
Mr. TyDi is a multi-lingual benchmark dataset built on TyDi, covering eleven typologically diverse languages.
☆83Feb 16, 2022Updated 4 years ago
AkariAsai / extractive_rc_by_runtime_mt
View on GitHub
Code and datasets of "Multilingual Extractive Reading Comprehension by Runtime Machine Translation"
☆40Jan 2, 2019Updated 7 years ago
AkariAsai / evidentiality_qa
View on GitHub
The official implemetation of "Evidentiality-guided Generation for Knowledge-Intensive NLP Tasks" (NAACL 2022).
☆44Dec 25, 2022Updated 3 years ago
xwhan / ProQA
View on GitHub
Progressively Pretrained Dense Corpus Index for Open-Domain QA and Information Retrieval
☆43Jun 12, 2023Updated 3 years ago
google-research / xtreme
View on GitHub
XTREME is a benchmark for the evaluation of the cross-lingual generalization ability of pre-trained multilingual models that covers 40 ty…
☆651Jan 4, 2023Updated 3 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
AkariAsai / learning_to_retrieve_reasoning_paths
View on GitHub
The official implementation of ICLR 2020, "Learning to Retrieve Reasoning Paths over Wikipedia Graph for Question Answering".
☆436Jul 25, 2024Updated last year
AkariAsai / unanswerable_qa
View on GitHub
The official implementation for ACL 2021 "Challenges in Information Seeking QA: Unanswerable Questions and Paragraph Retrieval".
☆28Jun 19, 2021Updated 5 years ago
shmsw25 / bart-closed-book-qa
View on GitHub
A BART version of an open-domain QA model in a closed-book setup
☆118Aug 13, 2020Updated 5 years ago
ekapolc / Thai_commonvoice_split
View on GitHub
scripts for cleaning and creating train/validation/test splits for Thai commonvoice
☆12Sep 2, 2021Updated 4 years ago
facebookresearch / reconsider
View on GitHub
ReConsider is a re-ranking model that re-ranks the top-K (passage, answer-span) predictions of an Open-Domain QA Model like DPR (Karpukhi…
☆50Apr 26, 2021Updated 5 years ago
thunlp / XQA
View on GitHub
Dataset and baseline for ACL 2019 paper "XQA: A Cross-lingual Open-domain Question Answering Dataset"
☆89Nov 16, 2021Updated 4 years ago
TurkuNLP / wikibert
View on GitHub
BERT models for many languages created from Wikipedia texts
☆33May 25, 2020Updated 6 years ago
google-research-datasets / natural-questions
View on GitHub
Natural Questions (NQ) contains real user questions issued to Google search, and answers found from Wikipedia by annotators. NQ is design…
☆1,135Jul 30, 2021Updated 4 years ago
facebookresearch / DPR
View on GitHub
Dense Passage Retriever - is a set of tools and models for open domain Q&A task.
☆1,869Apr 6, 2023Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
unicamp-dl / mMARCO
View on GitHub
A multilingual version of MS MARCO passage ranking dataset
☆148Oct 19, 2023Updated 2 years ago
ModuNLP / hacking_transformers
View on GitHub
☆11Aug 12, 2020Updated 5 years ago
shmsw25 / qa-hard-em
View on GitHub
An original implementation of EMNLP 2019, "A Discrete Hard EM Approach for Weakly Supervised Question Answering"
☆134Jul 3, 2020Updated 6 years ago
iapp-technology / iapp-wiki-qa-dataset
View on GitHub
Open Thai Wikipedia QA Dataset made by iApp Technology
☆14Feb 17, 2021Updated 5 years ago
lilt / tec
View on GitHub
Evaluation code and data for "Automatic Correction of Human Translations" [NAACL 2022].
☆19Dec 9, 2022Updated 3 years ago
allenai / modularqa
View on GitHub
Code for ModularQA
☆27Jun 8, 2021Updated 5 years ago
jeffeuxMartin / meta-learning-hlp
View on GitHub
A publishing website of a table collecting meta-learning-related papers in the area of human language processing.
☆17Aug 2, 2021Updated 4 years ago
shmsw25 / Channel-LM-Prompting
View on GitHub
An original implementation of "Noisy Channel Language Model Prompting for Few-Shot Text Classification"
☆130Apr 23, 2022Updated 4 years ago
google / retrieval-qa-eval
View on GitHub
☆42Sep 25, 2019Updated 6 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
allenai / unifiedqa
View on GitHub
UnifiedQA: Crossing Format Boundaries With a Single QA System
☆442May 9, 2022Updated 4 years ago
facebookresearch / KILT
View on GitHub
Library for Knowledge Intensive Language Tasks
☆978Mar 31, 2022Updated 4 years ago
facebookresearch / evaluation-of-nmt-bt
View on GitHub
This repository contains additional reference translations for the WMT'14 En-De (newstest2014) and WMT'19 En-Ru (newstest2019) test sets …
☆15Aug 31, 2021Updated 4 years ago
vzhong / e3
View on GitHub
Dockerized code for E3: Entailment-driven Extracting and Editing for Conversational Machine Reading.
☆48Jul 22, 2023Updated 3 years ago
facebookresearch / romqa
View on GitHub
A Benchmark for Robust, Multi-evidence, Multi-answer Question Answering
☆18Jan 7, 2023Updated 3 years ago
nyu-mll / pretraining-learning-curves
View on GitHub
The repository for the paper "When Do You Need Billions of Words of Pretraining Data?"
☆21Nov 10, 2020Updated 5 years ago
faviq / faviq
View on GitHub
FaVIQ: Fact Verification from Information-seeking Questions
☆43Nov 23, 2022Updated 3 years ago