A list of datasets/corpora for NLP tasks, in reverse chronological order.
☆920Jan 4, 2020Updated 6 years ago
Alternatives and similar repositories for nlp-datasets
Users that are interested in nlp-datasets are comparing it to the libraries listed below
Sorting:
- chat corpus collection from various open sources☆245Mar 10, 2017Updated 9 years ago
- A script that creates train, valid and test datasets for the ranking task from Ubuntu corpus dialogs.☆667Jul 25, 2023Updated 2 years ago
- ☆832Jul 12, 2017Updated 8 years ago
- Code for the paper "Improving Information Extraction by Acquiring External Evidence with Reinforcement Learning" http://arxiv.org/abs/160…☆235Jun 6, 2018Updated 7 years ago
- 用于训练中英文对话系统的语料库 Datasets for Training Chatbot System☆2,052Sep 23, 2020Updated 5 years ago
- Alphabetical list of free/public domain datasets with text data for use in Natural Language Processing (NLP)☆5,970Feb 15, 2023Updated 3 years ago
- Dual LSTM Encoder for Dialog Response Generation☆1,577Nov 21, 2022Updated 3 years ago
- Reading Wikipedia to Answer Open-Domain Questions☆4,478Oct 1, 2023Updated 2 years ago
- Summary of deep learning models for dialog systems (Tiancheng Zhao LTI, CMU)☆643Jul 8, 2020Updated 5 years ago
- Bi-directional Attention Flow (BiDAF) network is a multi-stage hierarchical process that represents context at different levels of granul…☆1,540May 31, 2023Updated 2 years ago
- "End-To-End Memory Networks" in Tensorflow☆827Mar 14, 2017Updated 9 years ago
- Question answering dataset featured in "Teaching Machines to Read and Comprehend☆1,297Apr 26, 2017Updated 8 years ago
- An open-source NLP research library, built on PyTorch.☆11,893Nov 22, 2022Updated 3 years ago
- Natural Language Processing Tasks and References☆3,013Sep 20, 2018Updated 7 years ago
- Task generation for testing text understanding and reasoning☆907Mar 27, 2019Updated 6 years ago
- awesome deep learning papers for dialog systems☆52Jun 19, 2017Updated 8 years ago
- A Tensorflow implementation of QANet for machine reading comprehension☆982May 30, 2018Updated 7 years ago
- Memory Networks implementations☆1,753Jul 28, 2020Updated 5 years ago
- My tensorflow implementation of "A neural conversational model", a Deep learning based chatbot☆2,923Dec 30, 2022Updated 3 years ago
- User Simulation for Task-Completion Dialogues☆804May 19, 2023Updated 2 years ago
- A datasets and methods survey about task-oriented dialogue, including recent datasets and SOTA leaderboards.☆1,244Nov 8, 2022Updated 3 years ago
- Resources, datasets, papers on Question Answering☆687Mar 17, 2023Updated 3 years ago
- Implementation of Sequence Generative Adversarial Nets with Policy Gradient☆2,094Mar 10, 2019Updated 7 years ago
- ☆619Mar 15, 2017Updated 9 years ago
- The Natural Language Decathlon: A Multitask Challenge for NLP☆2,339May 1, 2025Updated 10 months ago
- A Chinese Cloze-style RC Dataset: People's Daily & Children's Fairy Tale (CFT)☆175Mar 26, 2019Updated 6 years ago
- Sequence-to-sequence model with LSTM encoder/decoders and attention☆1,282Dec 30, 2020Updated 5 years ago
- ☆285Sep 14, 2017Updated 8 years ago
- A framework for training and evaluating AI models on a variety of openly available dialogue datasets.☆10,628Nov 3, 2023Updated 2 years ago
- Sent2Vec encoder and training code from the paper "Skip-Thought Vectors"☆2,051Jun 9, 2020Updated 5 years ago
- A curated question answering research dataset of factoid questions☆49Nov 9, 2019Updated 6 years ago
- A deep NLP library, based on Keras / tf, focused on question answering (but useful for other NLP too)☆405May 21, 2018Updated 7 years ago
- This repo contains our ACL 2017 paper data and source code☆730Sep 15, 2020Updated 5 years ago
- A dialogue bot for information access☆185Jul 4, 2018Updated 7 years ago
- Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the mo…☆22,975Jul 28, 2024Updated last year
- Tutorial on "Practical Neural Networks for NLP: From Theory to Code" at EMNLP 2016☆434Mar 5, 2017Updated 9 years ago
- ☆182Aug 17, 2018Updated 7 years ago
- Summaries and notes on Deep Learning research papers☆4,418Feb 13, 2018Updated 8 years ago
- A question answering corpus in insurance domain☆464Jan 16, 2017Updated 9 years ago