voidful/awesome-question-answering-dataset

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/voidful/awesome-question-answering-dataset)

voidful / awesome-question-answering-dataset

A list of awesome machine question answering dataset - 機器問答數據集

☆15

Alternatives and similar repositories for awesome-question-answering-dataset

Users that are interested in awesome-question-answering-dataset are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

voidful / nlp2
View on GitHub
⚙️Tool for NLP - handle file and text
☆15Feb 16, 2025Updated last year
voidful / BDG
View on GitHub
Code for "A BERT-based Distractor Generation Scheme with Multi-tasking and Negative Answer Training Strategies."
☆27Feb 2, 2022Updated 4 years ago
p208p2002 / albert-zh-for-pytorch-transformers
View on GitHub
轉換好的 Albert 中文模型 (for pytorch-transformers)
☆19Mar 6, 2020Updated 6 years ago
sonos / spoken-language-understanding-research-datasets
View on GitHub
☆50Feb 13, 2022Updated 4 years ago
jingyihiter / myDuReader
View on GitHub
2018百度机器阅读理解竞赛
☆26Jul 16, 2018Updated 8 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
soco-ai / SF-QA
View on GitHub
Evaluation framework for open-domain question answering.
☆20May 16, 2021Updated 5 years ago
nlpdata / strategy
View on GitHub
Improving Machine Reading Comprehension with General Reading Strategies
☆37Apr 23, 2019Updated 7 years ago
benywon / ReCO
View on GitHub
ReCO: A Large Scale Chinese Reading Comprehension Dataset on Opinion
☆37Jul 25, 2024Updated 2 years ago
ksw2000 / hackmd-to-html-cli
View on GitHub
A CLI/Package converts HackMD markdown to HTML.
☆26Jun 15, 2026Updated last month
zsweet / zsw_AI_model
View on GitHub
☆12Sep 25, 2018Updated 7 years ago
voidful / NLPrep
View on GitHub
🍳 NLPrep - dataset tool for many natural language processing task
☆28Jul 30, 2021Updated 4 years ago
allenai / modularqa
View on GitHub
Code for ModularQA
☆27Jun 8, 2021Updated 5 years ago
UDICatNCHU / UdicOpenData
View on GitHub
公開的情緒訓練資料
☆58Mar 7, 2023Updated 3 years ago
Harry-Chan / seq2seqlm-on-qg
View on GitHub
☆13Feb 9, 2022Updated 4 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
junzeng-pluto / ChineseSquad
View on GitHub
中文机器阅读理解数据集
☆65Jan 15, 2020Updated 6 years ago
p208p2002 / Transformer-QG-on-SQuAD
View on GitHub
Implement Question Generator with SOTA pre-trained Language Models (RoBERTa, BERT, GPT, BART, T5, etc.)
☆50Sep 20, 2022Updated 3 years ago
SimpleVQA / SimpleVQA
View on GitHub
SimpleVQA: Multimodal Factuality Evaluation for Multimodal Large Language Models
☆15Feb 20, 2025Updated last year
AkariAsai / unanswerable_qa
View on GitHub
The official implementation for ACL 2021 "Challenges in Information Seeking QA: Unanswerable Questions and Paragraph Retrieval".
☆28Jun 19, 2021Updated 5 years ago
wbopan / mstar
View on GitHub
mstar: Optimizing memory architecture for every LLM task as executable Python code.
☆15Jun 12, 2026Updated last month
zsweet / Design-and-Implementation-of-Online-Judge-Semantic-Checking-System-Based-on-Abstract-Syntax-Tree
View on GitHub
Design and Implementation of Online Judge Semantic Checking System Based on Abstract Syntax Tree
☆13Aug 25, 2017Updated 8 years ago
bwanglzu / Maximal-Marginal-Relevance
View on GitHub
MMR for information retrieval
☆18Sep 22, 2017Updated 8 years ago
voidful / llm-codec
View on GitHub
LLM-Codec: Neural Audio Codec Meets Language Model Objectives
☆23May 3, 2026Updated 2 months ago
biasinrecsys / umap2020
View on GitHub
ACM UMAP2020 Hands-on Tutorial on Data and Algorithmic Bias in Recommender Systems
☆10May 23, 2021Updated 5 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
nreimers / beir-sparta
View on GitHub
Re-Implementation of SPARTA model
☆13Oct 1, 2021Updated 4 years ago
voidful / Phraseg
View on GitHub
Phraseg - 一言：新詞發現工具包
☆26Nov 30, 2021Updated 4 years ago
voidful / nlp2go
View on GitHub
🏃 hosting nlp models in one line
☆20May 8, 2024Updated 2 years ago
luciusssss / why-learn-shortcut
View on GitHub
[ACL'21 Findings] Why Machine Reading Comprehension Models Learn Shortcuts?
☆16Aug 8, 2023Updated 2 years ago
facebookresearch / reconsider
View on GitHub
ReConsider is a re-ranking model that re-ranks the top-K (passage, answer-span) predictions of an Open-Domain QA Model like DPR (Karpukhi…
☆50Apr 26, 2021Updated 5 years ago
moporgic / TDL2048-Demo
View on GitHub
Temporal Difference Learning for the Game of 2048 (Demo)
☆17May 10, 2024Updated 2 years ago
emorynlp / FriendsQA
View on GitHub
Question answering on multiparty dialogue
☆45May 28, 2020Updated 6 years ago
Chia-Hsuan-Lee / ODSQA
View on GitHub
ODSQA: OPEN-DOMAIN SPOKEN QUESTION ANSWERING DATASET
☆64Feb 20, 2022Updated 4 years ago
cofacts / opendata
View on GitHub
Open data of Cofacts collaborative fact-checking database
☆51Updated this week
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
cmpute / audio-codec-benchmark
View on GitHub
Comprehensive quantitative comparison of lossless and lossy audio codecs
☆41Feb 11, 2023Updated 3 years ago
ga642381 / Spoken-Dialogue-Model-Survey
View on GitHub
A survey of spoken dialogue models (SDMs) with speech input and speech output. Focus on their Intermediate Representation and Generation …
☆31Mar 24, 2026Updated 4 months ago
THU-KEG / R-Eval
View on GitHub
[KDD24-ADS] R-Eval: A Unified Toolkit for Evaluating Domain Knowledge of Retrieval Augmented Large Language Models
☆11Apr 9, 2024Updated 2 years ago
bitswired / semantic-splitting-tutorial
View on GitHub
☆15May 16, 2024Updated 2 years ago
megagonlabs / SubjQA
View on GitHub
A question-answering dataset with a focus on subjective information
☆49Jan 8, 2024Updated 2 years ago
isjwdu / DFADD
View on GitHub
Official Implementation and Dataset of paper - DFADD: The Diffusion and Flow-matching based Audio Deepfake Dataset
☆16Apr 7, 2025Updated last year
thu-coai / LAUG
View on GitHub
Language Understanding Augmentation Toolkit for Robustness Testing
☆20Jan 22, 2023Updated 3 years ago