A collection of large question answering datasets
☆435Jul 1, 2024Updated last year
Alternatives and similar repositories for large-qa-datasets
Users that are interested in large-qa-datasets are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- WebQuestions QA Benchmarking Dataset☆176May 27, 2016Updated 10 years ago
- ☆11May 1, 2022Updated 4 years ago
- ☆154Aug 21, 2023Updated 2 years ago
- Align, a general text alignment function☆15Dec 7, 2023Updated 2 years ago
- Enhancing Complex Question Answering over Knowledge Graphs through Evidence Pattern Retrieval, WWW 2024☆15Oct 22, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A curated list of awesome instruction tuning datasets, models, papers and repositories.☆347Jun 12, 2023Updated 2 years ago
- This repo contains information about FeB4RAG collection☆17Feb 19, 2024Updated 2 years ago
- ☆41May 12, 2026Updated 2 weeks ago
- Code, data, and pretrained models for the paper "Generating Wikipedia Article Sections from Diverse Data Sources"☆21Feb 5, 2021Updated 5 years ago
- Open-WikiTable :Dataset for Open Domain Question Answering with Complex Reasoning over Table☆28Jun 2, 2023Updated 2 years ago
- MoviE Text Audio QA (MetaQA): a benchmark dataset for question answering☆103Oct 10, 2021Updated 4 years ago
- A Multi-subject High School Examinations Dataset for Cross-lingual and Multilingual Question Answering☆49Apr 5, 2022Updated 4 years ago
- simple QA over knowledge graphs on DBpedia☆25Oct 31, 2018Updated 7 years ago
- Repository for Teaching Broad Reasoning Skills for Multi-Step QA by Generating Hard Contexts, EMNLP22☆19Jun 23, 2023Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Multilingual abstractive summarization dataset extracted from WikiHow.☆99Mar 14, 2025Updated last year
- A BART version of an open-domain QA model in a closed-book setup☆119Aug 13, 2020Updated 5 years ago
- [ACL 2022] Ditch the Gold Standard: Re-evaluating Conversational Question Answering☆44Jun 18, 2022Updated 3 years ago
- TSQA: Tabular Scenario Based Question Answering (AAAI 2021)☆18Dec 17, 2020Updated 5 years ago
- Materials of public talks given By SJTU X-LANCE members☆14Dec 3, 2022Updated 3 years ago
- Using YouTube to prepare a speech recognition dataset for any language☆10Mar 30, 2021Updated 5 years ago
- A collection of open-source dataset to train instruction-following LLMs (ChatGPT,LLaMA,Alpaca)☆1,148Jan 4, 2024Updated 2 years ago
- ☆16Mar 17, 2025Updated last year
- ☆41Jan 9, 2026Updated 4 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- An original implementation of EMNLP 2020, "AmbigQA: Answering Ambiguous Open-domain Questions"☆123Apr 23, 2022Updated 4 years ago
- Alpaca-lora for huggingface implementation using Deepspeed and FullyShardedDataParallel☆24Apr 3, 2023Updated 3 years ago
- Fusion-in-Decoder☆593Oct 4, 2023Updated 2 years ago
- A data set of natural language queries with corresponding SPARQL queries☆98Sep 5, 2023Updated 2 years ago
- 🩺 A collection of ChatGPT evaluation reports on various bechmarks.☆50Mar 28, 2023Updated 3 years ago
- CUDA, CuDNN, NVIDIA Driver, and PyTorch Installation for Ubuntu☆12Feb 27, 2025Updated last year
- MultiSpanQA: A Dataset for Multi-Span Question Answering☆28Jan 24, 2026Updated 4 months ago
- A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.☆2,198Oct 16, 2025Updated 7 months ago
- The source code for running LLMs on the AAAR-1.0 benchmark.☆18Apr 5, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆14Feb 9, 2022Updated 4 years ago
- Mapping of the SimpleQuestions dataset to Wikidata☆86Jun 20, 2021Updated 4 years ago
- A simple command line tool to calculate WER for ASR.☆14Oct 14, 2024Updated last year
- ☆30Sep 5, 2021Updated 4 years ago
- MANtIS - a multi-domain information seeking dialogues dataset☆22May 12, 2021Updated 5 years ago
- steps to perform text-based speaker diarization with kaldi toolkit☆12Nov 2, 2018Updated 7 years ago
- Expanding natural instructions☆1,046Dec 11, 2023Updated 2 years ago