A list of datasets/corpora for NLP tasks, in reverse chronological order.
☆920Jan 4, 2020Updated 6 years ago
Alternatives and similar repositories for nlp-datasets
Users that are interested in nlp-datasets are comparing it to the libraries listed below
Sorting:
- chat corpus collection from various open sources☆245Mar 10, 2017Updated 8 years ago
- ☆833Jul 12, 2017Updated 8 years ago
- Code for the paper "Improving Information Extraction by Acquiring External Evidence with Reinforcement Learning" http://arxiv.org/abs/160…☆235Jun 6, 2018Updated 7 years ago
- A script that creates train, valid and test datasets for the ranking task from Ubuntu corpus dialogs.☆667Jul 25, 2023Updated 2 years ago
- Summary of deep learning models for dialog systems (Tiancheng Zhao LTI, CMU)☆643Jul 8, 2020Updated 5 years ago
- 用于训练中英文对话系统的语料库 Datasets for Training Chatbot System☆2,052Sep 23, 2020Updated 5 years ago
- Alphabetical list of free/public domain datasets with text data for use in Natural Language Processing (NLP)☆5,971Feb 15, 2023Updated 3 years ago
- Bi-directional Attention Flow (BiDAF) network is a multi-stage hierarchical process that represents context at different levels of granul…☆1,541May 31, 2023Updated 2 years ago
- Dual LSTM Encoder for Dialog Response Generation☆1,577Nov 21, 2022Updated 3 years ago
- Question answering dataset featured in "Teaching Machines to Read and Comprehend☆1,296Apr 26, 2017Updated 8 years ago
- Reading Wikipedia to Answer Open-Domain Questions☆4,476Oct 1, 2023Updated 2 years ago
- Task generation for testing text understanding and reasoning☆906Mar 27, 2019Updated 6 years ago
- Implementation of Sequence Generative Adversarial Nets with Policy Gradient☆2,094Mar 10, 2019Updated 6 years ago
- "End-To-End Memory Networks" in Tensorflow☆827Mar 14, 2017Updated 8 years ago
- ☆619Mar 15, 2017Updated 8 years ago
- Memory Networks implementations☆1,753Jul 28, 2020Updated 5 years ago
- My tensorflow implementation of "A neural conversational model", a Deep learning based chatbot☆2,924Dec 30, 2022Updated 3 years ago
- Natural Language Processing Tasks and References☆3,016Sep 20, 2018Updated 7 years ago
- An open-source NLP research library, built on PyTorch.☆11,889Nov 22, 2022Updated 3 years ago
- Query-Reduction Networks (QRN)☆138Dec 20, 2017Updated 8 years ago
- A Tensorflow implementation of QANet for machine reading comprehension☆983May 30, 2018Updated 7 years ago
- Sequence-to-sequence model with LSTM encoder/decoders and attention☆1,282Dec 30, 2020Updated 5 years ago
- ☆285Sep 14, 2017Updated 8 years ago
- User Simulation for Task-Completion Dialogues☆804May 19, 2023Updated 2 years ago
- A datasets and methods survey about task-oriented dialogue, including recent datasets and SOTA leaderboards.☆1,244Nov 8, 2022Updated 3 years ago
- This is an implementation of the Attention Sum Reader model as presented in "Text Comprehension with the Attention Sum Reader Network" av…☆98Sep 9, 2016Updated 9 years ago
- Sent2Vec encoder and training code from the paper "Skip-Thought Vectors"☆2,052Jun 9, 2020Updated 5 years ago
- Gated Attention Reader for Text Comprehension☆190Nov 29, 2017Updated 8 years ago
- Tree-structured Long Short-Term Memory networks (http://arxiv.org/abs/1503.00075)☆896Jul 30, 2017Updated 8 years ago
- The Natural Language Decathlon: A Multitask Challenge for NLP☆2,340May 1, 2025Updated 10 months ago
- A Chinese Cloze-style RC Dataset: People's Daily & Children's Fairy Tale (CFT)☆175Mar 26, 2019Updated 6 years ago
- Resources, datasets, papers on Question Answering☆687Mar 17, 2023Updated 2 years ago
- ☆182Aug 17, 2018Updated 7 years ago
- A question answering corpus in insurance domain☆464Jan 16, 2017Updated 9 years ago
- Tutorial on "Practical Neural Networks for NLP: From Theory to Code" at EMNLP 2016☆434Mar 5, 2017Updated 8 years ago
- This repo contains our ACL 2017 paper data and source code☆729Sep 15, 2020Updated 5 years ago
- RNNLG is an open source benchmark toolkit for Natural Language Generation (NLG) in spoken dialogue system application domains. It is rele…☆492Jul 2, 2019Updated 6 years ago
- Sequence to Sequence Learning with Keras☆3,177Aug 20, 2022Updated 3 years ago
- A framework for training and evaluating AI models on a variety of openly available dialogue datasets.☆10,627Nov 3, 2023Updated 2 years ago