A collection of large question answering datasets
☆435Jul 1, 2024Updated last year
Alternatives and similar repositories for large-qa-datasets
Users that are interested in large-qa-datasets are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- WebQuestions QA Benchmarking Dataset☆176May 27, 2016Updated 10 years ago
- ☆11May 1, 2022Updated 4 years ago
- ☆155Aug 21, 2023Updated 2 years ago
- Align, a general text alignment function☆15Dec 7, 2023Updated 2 years ago
- Enhancing Complex Question Answering over Knowledge Graphs through Evidence Pattern Retrieval, WWW 2024☆15Oct 22, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- This repo contains information about FeB4RAG collection☆17Feb 19, 2024Updated 2 years ago
- Open-WikiTable :Dataset for Open Domain Question Answering with Complex Reasoning over Table☆28Jun 2, 2023Updated 3 years ago
- A Multi-subject High School Examinations Dataset for Cross-lingual and Multilingual Question Answering☆49Apr 5, 2022Updated 4 years ago
- simple QA over knowledge graphs on DBpedia☆25Oct 31, 2018Updated 7 years ago
- The official implementation of the paper: H-Neurons: On the Existence, Impact, and Origin of Hallucination-Associated Neurons in LLMs☆66Jan 14, 2026Updated 5 months ago
- Repository for Teaching Broad Reasoning Skills for Multi-Step QA by Generating Hard Contexts, EMNLP22☆19Jun 23, 2023Updated 2 years ago
- Multilingual abstractive summarization dataset extracted from WikiHow.☆99Mar 14, 2025Updated last year
- Show the time in Roman Numerals☆11Jan 23, 2020Updated 6 years ago
- A BART version of an open-domain QA model in a closed-book setup☆119Aug 13, 2020Updated 5 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [ACL 2022] Ditch the Gold Standard: Re-evaluating Conversational Question Answering☆43Jun 18, 2022Updated 4 years ago
- TSQA: Tabular Scenario Based Question Answering (AAAI 2021)☆18Dec 17, 2020Updated 5 years ago
- Materials of public talks given By SJTU X-LANCE members☆14Dec 3, 2022Updated 3 years ago
- Using YouTube to prepare a speech recognition dataset for any language☆10Mar 30, 2021Updated 5 years ago
- A collection of open-source dataset to train instruction-following LLMs (ChatGPT,LLaMA,Alpaca)☆1,147Jan 4, 2024Updated 2 years ago
- RDT: Russian Distributional Thesaurus (Русский Дистрибутивный Тезаурус)☆30Feb 28, 2019Updated 7 years ago
- ☆11Nov 5, 2021Updated 4 years ago
- ☆43Jan 9, 2026Updated 5 months ago
- The Unreliability of Explanations in Few-shot Prompting for Textual Reasoning (NeurIPS 2022)☆16Feb 11, 2023Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- An original implementation of EMNLP 2020, "AmbigQA: Answering Ambiguous Open-domain Questions"☆123Apr 23, 2022Updated 4 years ago
- Alpaca-lora for huggingface implementation using Deepspeed and FullyShardedDataParallel☆24Apr 3, 2023Updated 3 years ago
- Fusion-in-Decoder☆595Oct 4, 2023Updated 2 years ago
- A data set of natural language queries with corresponding SPARQL queries☆98Sep 5, 2023Updated 2 years ago
- A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.☆2,215Oct 16, 2025Updated 8 months ago
- MultiSpanQA: A Dataset for Multi-Span Question Answering☆27Jan 24, 2026Updated 4 months ago
- ☆14Feb 9, 2022Updated 4 years ago
- QALD-9-Plus Dataset for Knowledge Graph Question Answering☆29Jun 5, 2024Updated 2 years ago
- Mapping of the SimpleQuestions dataset to Wikidata☆86Jun 20, 2021Updated 4 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- A simple command line tool to calculate WER for ASR.☆14Oct 14, 2024Updated last year
- ☆30Sep 5, 2021Updated 4 years ago
- ALBERT Persian Playground☆13Jun 12, 2023Updated 3 years ago
- MANtIS - a multi-domain information seeking dialogues dataset☆22May 12, 2021Updated 5 years ago
- steps to perform text-based speaker diarization with kaldi toolkit☆12Nov 2, 2018Updated 7 years ago
- Fine-tuning BART on COVID Dialogue Dataset☆17Apr 8, 2020Updated 6 years ago
- NIILC QA data☆18Nov 20, 2015Updated 10 years ago