A list of datasets/corpora for NLP tasks, in reverse chronological order.
☆920Jan 4, 2020Updated 6 years ago
Alternatives and similar repositories for nlp-datasets
Users that are interested in nlp-datasets are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- chat corpus collection from various open sources☆245Mar 10, 2017Updated 9 years ago
- A script that creates train, valid and test datasets for the ranking task from Ubuntu corpus dialogs.☆667Jul 25, 2023Updated 2 years ago
- ☆831Jul 12, 2017Updated 8 years ago
- Code for the paper "Improving Information Extraction by Acquiring External Evidence with Reinforcement Learning" http://arxiv.org/abs/160…☆235Jun 6, 2018Updated 7 years ago
- 用于训练中英文对话系统的语料库 Datasets for Training Chatbot System☆2,051Sep 23, 2020Updated 5 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Alphabetical list of free/public domain datasets with text data for use in Natural Language Processing (NLP)☆5,971Feb 15, 2023Updated 3 years ago
- Dual LSTM Encoder for Dialog Response Generation☆1,576Nov 21, 2022Updated 3 years ago
- Reading Wikipedia to Answer Open-Domain Questions☆4,477Oct 1, 2023Updated 2 years ago
- Summary of deep learning models for dialog systems (Tiancheng Zhao LTI, CMU)☆643Jul 8, 2020Updated 5 years ago
- Bi-directional Attention Flow (BiDAF) network is a multi-stage hierarchical process that represents context at different levels of granul…☆1,542May 31, 2023Updated 2 years ago
- "End-To-End Memory Networks" in Tensorflow☆826Mar 14, 2017Updated 9 years ago
- Question answering dataset featured in "Teaching Machines to Read and Comprehend☆1,297Apr 26, 2017Updated 8 years ago
- An open-source NLP research library, built on PyTorch.☆11,893Nov 22, 2022Updated 3 years ago
- Natural Language Processing Tasks and References☆3,012Sep 20, 2018Updated 7 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Task generation for testing text understanding and reasoning☆907Mar 27, 2019Updated 7 years ago
- awesome deep learning papers for dialog systems☆52Jun 19, 2017Updated 8 years ago
- A Tensorflow implementation of QANet for machine reading comprehension☆985May 30, 2018Updated 7 years ago
- Memory Networks implementations☆1,756Jul 28, 2020Updated 5 years ago
- My tensorflow implementation of "A neural conversational model", a Deep learning based chatbot☆2,917Dec 30, 2022Updated 3 years ago
- User Simulation for Task-Completion Dialogues☆805May 19, 2023Updated 2 years ago
- A datasets and methods survey about task-oriented dialogue, including recent datasets and SOTA leaderboards.☆1,243Nov 8, 2022Updated 3 years ago
- Resources, datasets, papers on Question Answering☆687Mar 17, 2023Updated 3 years ago
- Implementation of Sequence Generative Adversarial Nets with Policy Gradient☆2,095Mar 10, 2019Updated 7 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆619Mar 15, 2017Updated 9 years ago
- The Natural Language Decathlon: A Multitask Challenge for NLP☆2,339May 1, 2025Updated 11 months ago
- A Chinese Cloze-style RC Dataset: People's Daily & Children's Fairy Tale (CFT)☆175Mar 26, 2019Updated 7 years ago
- Sequence-to-sequence model with LSTM encoder/decoders and attention☆1,282Dec 30, 2020Updated 5 years ago
- ☆285Sep 14, 2017Updated 8 years ago
- A framework for training and evaluating AI models on a variety of openly available dialogue datasets.☆10,631Nov 3, 2023Updated 2 years ago
- Sent2Vec encoder and training code from the paper "Skip-Thought Vectors"☆2,051Jun 9, 2020Updated 5 years ago
- A deep NLP library, based on Keras / tf, focused on question answering (but useful for other NLP too)☆404May 21, 2018Updated 7 years ago
- A curated question answering research dataset of factoid questions☆49Nov 9, 2019Updated 6 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- This repo contains our ACL 2017 paper data and source code☆730Sep 15, 2020Updated 5 years ago
- A dialogue bot for information access☆185Jul 4, 2018Updated 7 years ago
- Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the mo…☆22,973Jul 28, 2024Updated last year
- Tutorial on "Practical Neural Networks for NLP: From Theory to Code" at EMNLP 2016☆433Mar 5, 2017Updated 9 years ago
- ☆182Aug 17, 2018Updated 7 years ago
- Summaries and notes on Deep Learning research papers☆4,419Feb 13, 2018Updated 8 years ago
- This repository contains the NarrativeQA dataset. It includes the list of documents with Wikipedia summaries, links to full stories, and …☆509Apr 15, 2020Updated 5 years ago