A list of awesome machine question answering dataset - 機器問答數據集
☆15Dec 24, 2019Updated 6 years ago
Alternatives and similar repositories for awesome-question-answering-dataset
Users that are interested in awesome-question-answering-dataset are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 如何將維基百科中文資料,簡轉繁並萃取文字內容整理成JSON檔案☆19Aug 5, 2021Updated 4 years ago
- ⚙️Tool for NLP - handle file and text☆15Feb 16, 2025Updated last year
- Code for "A BERT-based Distractor Generation Scheme with Multi-tasking and Negative Answer Training Strategies."☆27Feb 2, 2022Updated 4 years ago
- 轉換好的 Albert 中文模型 (for pytorch-transformers)☆19Mar 6, 2020Updated 6 years ago
- Collections of Chinese reading comprehension datasets☆221Dec 19, 2019Updated 6 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆50Feb 13, 2022Updated 4 years ago
- 2018百度机器阅读理解竞赛☆26Jul 16, 2018Updated 7 years ago
- 🤖📇 handling multiple nlp task in one pipeline☆57Sep 18, 2025Updated 6 months ago
- Evaluation framework for open-domain question answering.☆20May 16, 2021Updated 4 years ago
- Improving Machine Reading Comprehension with General Reading Strategies☆37Apr 23, 2019Updated 6 years ago
- ReCO: A Large Scale Chinese Reading Comprehension Dataset on Opinion☆37Jul 25, 2024Updated last year
- A CLI/Package converts HackMD markdown to HTML.☆26Oct 27, 2025Updated 5 months ago
- 🍳 NLPrep - dataset tool for many natural language processing task☆28Jul 30, 2021Updated 4 years ago
- Code for ModularQA☆27Jun 8, 2021Updated 4 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- 中文机器阅读理解数据集☆65Jan 15, 2020Updated 6 years ago
- Character Embedding + ESIM + Focal Loss for Chinese Answer Sentence Selection☆10Jan 4, 2020Updated 6 years ago
- ACM UMAP2020 Hands-on Tutorial on Data and Algorithmic Bias in Recommender Systems☆10May 23, 2021Updated 4 years ago
- Phraseg - 一言:新詞發現工具包☆26Nov 30, 2021Updated 4 years ago
- Multi-span Style Extraction for Generative Reading Comprehension☆10Apr 2, 2021Updated 4 years ago
- 🏃 hosting nlp models in one line☆20May 8, 2024Updated last year
- [ACL'21 Findings] Why Machine Reading Comprehension Models Learn Shortcuts?☆16Aug 8, 2023Updated 2 years ago
- ReConsider is a re-ranking model that re-ranks the top-K (passage, answer-span) predictions of an Open-Domain QA Model like DPR (Karpukhi…☆49Apr 26, 2021Updated 4 years ago
- EMNLP 2021: Detecting Speaker Personas from Conversational Texts☆13Nov 5, 2021Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ODSQA: OPEN-DOMAIN SPOKEN QUESTION ANSWERING DATASET☆63Feb 20, 2022Updated 4 years ago
- Question answering on multiparty dialogue☆45May 28, 2020Updated 5 years ago
- Code and data for the COLING 2020 paper "Try to Substitute: An Unsupervised Chinese Word Sense Disambiguation Method Based on HowNet"☆14Dec 2, 2020Updated 5 years ago
- DuReader bert Chinese MRC☆14Nov 18, 2022Updated 3 years ago
- Comprehensive quantitative comparison of lossless and lossy audio codecs☆39Feb 11, 2023Updated 3 years ago
- [KDD24-ADS] R-Eval: A Unified Toolkit for Evaluating Domain Knowledge of Retrieval Augmented Large Language Models☆11Apr 9, 2024Updated last year
- My PyTorch playground for NLP☆13Sep 20, 2018Updated 7 years ago
- Simple chatbot created using Rasa☆10Feb 20, 2021Updated 5 years ago
- The official repo/implementation of the paper "Training a Singing Transcription Model Using Connectionist Temporal Classification Loss an…☆12Mar 25, 2025Updated last year
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- ☆15May 16, 2024Updated last year
- Research code for the paper "How Good is Your Tokenizer? On the Monolingual Performance of Multilingual Language Models"☆28Oct 3, 2021Updated 4 years ago
- This repository contains datasets (including testing set) for EMNLP-IJCNLP 2019 paper "BiPaR: A Bilingual Parallel Dataset for Multilingu…☆23Jul 13, 2021Updated 4 years ago
- Official Implementation and Dataset of paper - DFADD: The Diffusion and Flow-matching based Audio Deepfake Dataset☆15Apr 7, 2025Updated 11 months ago
- A question-answering dataset with a focus on subjective information☆49Jan 8, 2024Updated 2 years ago
- Language Understanding Augmentation Toolkit for Robustness Testing☆20Jan 22, 2023Updated 3 years ago
- AdvSV stands as the first dataset developed specifically for evaluating Speaker Verification (SV) systems against adversarial attacks. I…☆11Nov 21, 2023Updated 2 years ago