A list of datasets/corpora for NLP tasks, in reverse chronological order.
☆921Jan 4, 2020Updated 6 years ago
Alternatives and similar repositories for nlp-datasets
Users that are interested in nlp-datasets are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- chat corpus collection from various open sources☆245Mar 10, 2017Updated 9 years ago
- A script that creates train, valid and test datasets for the ranking task from Ubuntu corpus dialogs.☆667Jul 25, 2023Updated 2 years ago
- ☆831Jul 12, 2017Updated 8 years ago
- Code for the paper "Improving Information Extraction by Acquiring External Evidence with Reinforcement Learning" http://arxiv.org/abs/160…☆235Jun 6, 2018Updated 7 years ago
- 用于训练中英文对话系统的语料库 Datasets for Training Chatbot System☆2,051Sep 23, 2020Updated 5 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Alphabetical list of free/public domain datasets with text data for use in Natural Language Processing (NLP)☆5,981Feb 15, 2023Updated 3 years ago
- Dual LSTM Encoder for Dialog Response Generation☆1,575Nov 21, 2022Updated 3 years ago
- Reading Wikipedia to Answer Open-Domain Questions☆4,472Oct 1, 2023Updated 2 years ago
- Summary of deep learning models for dialog systems (Tiancheng Zhao LTI, CMU)☆643Jul 8, 2020Updated 5 years ago
- Bi-directional Attention Flow (BiDAF) network is a multi-stage hierarchical process that represents context at different levels of granul…☆1,541May 31, 2023Updated 2 years ago
- "End-To-End Memory Networks" in Tensorflow☆825Mar 14, 2017Updated 9 years ago
- Question answering dataset featured in "Teaching Machines to Read and Comprehend☆1,296Apr 26, 2017Updated 9 years ago
- An open-source NLP research library, built on PyTorch.☆11,893Nov 22, 2022Updated 3 years ago
- Natural Language Processing Tasks and References☆3,013Sep 20, 2018Updated 7 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Task generation for testing text understanding and reasoning☆906Mar 27, 2019Updated 7 years ago
- awesome deep learning papers for dialog systems☆52Jun 19, 2017Updated 8 years ago
- A Tensorflow implementation of QANet for machine reading comprehension☆985May 30, 2018Updated 7 years ago
- Memory Networks implementations☆1,756Jul 28, 2020Updated 5 years ago
- My tensorflow implementation of "A neural conversational model", a Deep learning based chatbot☆2,915Dec 30, 2022Updated 3 years ago
- User Simulation for Task-Completion Dialogues☆805May 19, 2023Updated 3 years ago
- A datasets and methods survey about task-oriented dialogue, including recent datasets and SOTA leaderboards.☆1,241Nov 8, 2022Updated 3 years ago
- Resources, datasets, papers on Question Answering☆688Mar 17, 2023Updated 3 years ago
- Implementation of Sequence Generative Adversarial Nets with Policy Gradient☆2,094Mar 10, 2019Updated 7 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆619Mar 15, 2017Updated 9 years ago
- The Natural Language Decathlon: A Multitask Challenge for NLP☆2,340May 1, 2025Updated last year
- A Chinese Cloze-style RC Dataset: People's Daily & Children's Fairy Tale (CFT)☆176Mar 26, 2019Updated 7 years ago
- Sequence-to-sequence model with LSTM encoder/decoders and attention☆1,283Dec 30, 2020Updated 5 years ago
- ☆284Sep 14, 2017Updated 8 years ago
- Sent2Vec encoder and training code from the paper "Skip-Thought Vectors"☆2,051Jun 9, 2020Updated 5 years ago
- A framework for training and evaluating AI models on a variety of openly available dialogue datasets.☆10,627Nov 3, 2023Updated 2 years ago
- A deep NLP library, based on Keras / tf, focused on question answering (but useful for other NLP too)☆404May 21, 2018Updated 7 years ago
- A curated question answering research dataset of factoid questions☆49Nov 9, 2019Updated 6 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- This repo contains our ACL 2017 paper data and source code☆729Sep 15, 2020Updated 5 years ago
- A dialogue bot for information access☆185Jul 4, 2018Updated 7 years ago
- Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the mo…☆22,968Jul 28, 2024Updated last year
- Tutorial on "Practical Neural Networks for NLP: From Theory to Code" at EMNLP 2016☆433Mar 5, 2017Updated 9 years ago
- ☆182Aug 17, 2018Updated 7 years ago
- Summaries and notes on Deep Learning research papers☆4,421Feb 13, 2018Updated 8 years ago
- This repository contains the NarrativeQA dataset. It includes the list of documents with Wikipedia summaries, links to full stories, and …☆513Apr 15, 2020Updated 6 years ago