PolyAI-LDN / conversational-datasetsLinks
Large datasets for conversational AI
β1,348Updated 5 years ago
Alternatives and similar repositories for conversational-datasets
Users that are interested in conversational-datasets are comparing it to the libraries listed below
Sorting:
- A dataset containing human-human knowledge-grounded open-domain conversations.β654Updated 11 months ago
- π¦ State-of-the-Art Conversational AI with Transfer Learningβ1,752Updated 2 years ago
- jiant is an nlp toolkitβ1,670Updated 2 years ago
- Language-Agnostic SEntence Representationsβ3,646Updated last year
- Plug and Play Language Model implementation. Allows to steer topic and attributes of GPT-2 models.β1,147Updated last year
- Natural Questions (NQ) contains real user questions issued to Google search, and answers found from Wikipedia by annotators. NQ is designβ¦β1,022Updated 3 years ago
- πHMTL: Hierarchical Multi-Task Learning - A State-of-the-Art neural network model for several NLP tasks based on PyTorch and AllenNLPβ1,196Updated last year
- Pre-trained subword embeddings in 275 languages, based on Byte-Pair Encoding (BPE)β1,215Updated 9 months ago
- Visually Explore the Stanford Question Answering Datasetβ568Updated last year
- ConvoKit is a toolkit for extracting conversational features and analyzing social phenomena in conversations. It includes several large cβ¦β584Updated last week
- BLEURT is a metric for Natural Language Generation based on transfer learning.β739Updated last year
- Fast & easy transfer learning for NLP. Harvesting language models for the industry. Focus on Question Answering.β1,755Updated last year
- πΈ Use pretrained transformers like BERT, XLNet and GPT-2 in spaCyβ1,388Updated last month
- Shared repository for open-sourced projects from the Google AI Language team.β1,685Updated this week
- High-accuracy NLP parser with models for 11 languages.β890Updated 3 years ago
- The Schema-Guided Dialogue Datasetβ575Updated last year
- Source code for end-to-end dialogue model from the MultiWOZ paper (Budzianowski et al. 2018, EMNLP)β902Updated 5 months ago
- The website for the CMU Language Technologies Institute low resource NLP bootcamp 2020β601Updated 5 years ago
- Dialogue model that produces empathetic responses when trained on the EmpatheticDialogues dataset.β497Updated 3 years ago
- Super easy library for BERT based NLP modelsβ1,897Updated 10 months ago
- β [NOT MAINTAINED] An End-To-End Closed Domain Question Answering System.β617Updated 5 years ago
- Open clone of OpenAI's unreleased WebText dataset scraper. This version uses pushshift.io files instead of the API for speed.β738Updated 2 years ago
- curated collection of papers for the nlp practitioner ππ©βπ¬β1,070Updated 4 years ago
- Ekphrasis is a text processing tool, geared towards text from social networks, such as Twitter or Facebook. Ekphrasis performs tokenizatiβ¦β670Updated last month
- Library for Knowledge Intensive Language Tasksβ949Updated 3 years ago
- β393Updated 2 years ago
- This dataset contains 108,463 human-labeled and 656k noisily labeled pairs that feature the importance of modeling structure, context, anβ¦β557Updated 3 years ago
- EMNLP 2020: "Dialogue Response Ranking Training with Large-Scale Human Feedback Data"β340Updated 8 months ago
- This is the Plato Research Dialogue System, a flexible platform for developing conversational AI agents.β983Updated 4 years ago
- A python tool for evaluating the quality of sentence embeddings.β2,107Updated last year