ad-freiburg/large-qa-datasets

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ad-freiburg/large-qa-datasets)

ad-freiburg / large-qa-datasets

A collection of large question answering datasets

☆437

Alternatives and similar repositories for large-qa-datasets

Users that are interested in large-qa-datasets are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

JasonForJoy / BRIEF
View on GitHub
ACL 2026 & NAACL 2025: Bridging Retrieval and Inference through Evidence Fusion
☆14Apr 9, 2026Updated 3 months ago
YisiSang / TVSHOWGUESS
View on GitHub
☆11May 1, 2022Updated 4 years ago
Alab-NII / 2wikimultihop
View on GitHub
☆158Aug 21, 2023Updated 2 years ago
yuh-zha / Align
View on GitHub
Align, a general text alignment function
☆15Dec 7, 2023Updated 2 years ago
nju-websoft / EPR-KGQA
View on GitHub
Enhancing Complex Question Answering over Knowledge Graphs through Evidence Pattern Retrieval, WWW 2024
☆15Oct 22, 2024Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
mingdachen / WikiTableT
View on GitHub
Code, data, and pretrained models for the paper "Generating Wikipedia Article Sections from Diverse Data Sources"
☆21Feb 5, 2021Updated 5 years ago
sean0042 / Open_WikiTable
View on GitHub
Open-WikiTable :Dataset for Open Domain Question Answering with Complex Reasoning over Table
☆28Jun 2, 2023Updated 3 years ago
yuyuz / MetaQA
View on GitHub
MoviE Text Audio QA (MetaQA): a benchmark dataset for question answering
☆103Oct 10, 2021Updated 4 years ago
mhardalov / exams-qa
View on GitHub
A Multi-subject High School Examinations Dataset for Cross-lingual and Multilingual Question Answering
☆49Apr 5, 2022Updated 4 years ago
castorini / SimpleDBpediaQA
View on GitHub
simple QA over knowledge graphs on DBpedia
☆25Oct 31, 2018Updated 7 years ago
StonyBrookNLP / teabreac
View on GitHub
Repository for Teaching Broad Reasoning Skills for Multi-Step QA by Generating Hard Contexts, EMNLP22
☆19Jun 23, 2023Updated 3 years ago
esdurmus / Wikilingua
View on GitHub
Multilingual abstractive summarization dataset extracted from WikiHow.
☆99Mar 14, 2025Updated last year
ryannair05 / Tempus-Romanum
View on GitHub
Show the time in Roman Numerals
☆12Jan 23, 2020Updated 6 years ago
shmsw25 / bart-closed-book-qa
View on GitHub
A BART version of an open-domain QA model in a closed-book setup
☆118Aug 13, 2020Updated 5 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
KGQA / KGQA-datasets
View on GitHub
This repository is a collection of existing KGQA datasets in the form of the 🤗 huggingface datasets -> https://github.com/huggingface/d…
☆116Jan 8, 2024Updated 2 years ago
princeton-nlp / EvalConvQA
View on GitHub
[ACL 2022] Ditch the Gold Standard: Re-evaluating Conversational Question Answering
☆43Jun 18, 2022Updated 4 years ago
nju-websoft / TSQA
View on GitHub
TSQA: Tabular Scenario Based Question Answering (AAAI 2021)
☆18Dec 17, 2020Updated 5 years ago
X-LANCE / public_talks
View on GitHub
Materials of public talks given By SJTU X-LANCE members
☆14Dec 3, 2022Updated 3 years ago
TehreemFarooqi / Preparing-a-speech-recognition-dataset-using-YouTube-videos
View on GitHub
Using YouTube to prepare a speech recognition dataset for any language
☆10Mar 30, 2021Updated 5 years ago
yaodongC / awesome-instruction-dataset
View on GitHub
A collection of open-source dataset to train instruction-following LLMs (ChatGPT,LLaMA,Alpaca)
☆1,150Jan 4, 2024Updated 2 years ago
nlpub / rdt
View on GitHub
RDT: Russian Distributional Thesaurus (Русский Дистрибутивный Тезаурус)
☆30Feb 28, 2019Updated 7 years ago
patrickvonplaten / Wav2Vec2_ParlanceCTCDecode
View on GitHub
☆11Nov 5, 2021Updated 4 years ago
ckosten / Spider4SPARQL
View on GitHub
☆16Mar 17, 2025Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
ag2ai / SimpleDoc
View on GitHub
☆41Jan 9, 2026Updated 6 months ago
xiye17 / TextualExplInContext
View on GitHub
The Unreliability of Explanations in Few-shot Prompting for Textual Reasoning (NeurIPS 2022)
☆16Feb 11, 2023Updated 3 years ago
shmsw25 / AmbigQA
View on GitHub
An original implementation of EMNLP 2020, "AmbigQA: Answering Ambiguous Open-domain Questions"
☆123Apr 23, 2022Updated 4 years ago
gigio1023 / alpaca-lora-for-huggingface
View on GitHub
Alpaca-lora for huggingface implementation using Deepspeed and FullyShardedDataParallel
☆24Apr 3, 2023Updated 3 years ago
RenzeLou / Datasets-for-Question-Answering
View on GitHub
Will be updated continuously.
☆40Mar 2, 2022Updated 4 years ago
Spico197 / awesome-lm-evaluation
View on GitHub
🩺 A collection of ChatGPT evaluation reports on various bechmarks.
☆50Mar 28, 2023Updated 3 years ago
Kabir5296 / CUDA-CuDNN-Setup-for-Ubuntu-Guide
View on GitHub
CUDA, CuDNN, NVIDIA Driver, and PyTorch Installation for Ubuntu
☆12Feb 27, 2025Updated last year
LibrAIResearch / libra-eval
View on GitHub
☆23May 20, 2025Updated last year
beir-cellar / beir
View on GitHub
A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.
☆2,232Oct 16, 2025Updated 8 months ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
Harry-Chan / seq2seqlm-on-qg
View on GitHub
☆13Feb 9, 2022Updated 4 years ago
KGQA / QALD_9_plus
View on GitHub
QALD-9-Plus Dataset for Knowledge Graph Question Answering
☆29Jun 5, 2024Updated 2 years ago
askplatypus / wikidata-simplequestions
View on GitHub
Mapping of the SimpleQuestions dataset to Wikidata
☆86Jun 20, 2021Updated 5 years ago
pkufool / simple-wer
View on GitHub
A simple command line tool to calculate WER for ASR.
☆14Oct 14, 2024Updated last year
m3hrdadfi / albert-persian-lab
View on GitHub
ALBERT Persian Playground
☆13Jun 12, 2023Updated 3 years ago
orhonovich / q-squared
View on GitHub
☆30Sep 5, 2021Updated 4 years ago
talhanai / kaldi-diar-latte
View on GitHub
steps to perform text-based speaker diarization with kaldi toolkit
☆12Nov 2, 2018Updated 7 years ago