PolyAI-LDN / conversational-datasets
Large datasets for conversational AI
β1,320Updated 5 years ago
Alternatives and similar repositories for conversational-datasets:
Users that are interested in conversational-datasets are comparing it to the libraries listed below
- π¦ State-of-the-Art Conversational AI with Transfer Learningβ1,746Updated last year
- Large-scale pretraining for dialogueβ2,370Updated 2 years ago
- The guide to tackle with the Text Summarizationβ1,297Updated 2 years ago
- πΈ Use pretrained transformers like BERT, XLNet and GPT-2 in spaCyβ1,362Updated 2 weeks ago
- A dataset containing human-human knowledge-grounded open-domain conversations.β646Updated 6 months ago
- β¨Fast Coreference Resolution in spaCy with Neural Networksβ2,865Updated last year
- A list of datasets/corpora for NLP tasks, in reverse chronological order.β920Updated 5 years ago
- jiant is an nlp toolkitβ1,661Updated last year
- Pre-trained subword embeddings in 275 languages, based on Byte-Pair Encoding (BPE)β1,194Updated 4 months ago
- Source code for end-to-end dialogue model from the MultiWOZ paper (Budzianowski et al. 2018, EMNLP)β882Updated last month
- A curated list of pretrained sentence and word embedding modelsβ2,245Updated 3 years ago
- A python tool for evaluating the quality of sentence embeddings.β2,094Updated 11 months ago
- Summary of deep learning models for dialog systems (Tiancheng Zhao LTI, CMU)β651Updated 4 years ago
- β393Updated 2 years ago
- πHMTL: Hierarchical Multi-Task Learning - A State-of-the-Art neural network model for several NLP tasks based on PyTorch and AllenNLPβ1,192Updated last year
- EMNLP 2020: "Dialogue Response Ranking Training with Large-Scale Human Feedback Data"β337Updated 3 months ago
- High-accuracy NLP parser with models for 11 languages.β880Updated 3 years ago
- Language-Agnostic SEntence Representationsβ3,617Updated 9 months ago
- InferSent sentence embeddingsβ2,285Updated 3 years ago
- β [NOT MAINTAINED] An End-To-End Closed Domain Question Answering System.β617Updated 4 years ago
- The Natural Language Decathlon: A Multitask Challenge for NLPβ2,346Updated last year
- A fast, efficient universal vector embedding utility package.β1,642Updated last year
- This dataset contains 108,463 human-labeled and 656k noisily labeled pairs that feature the importance of modeling structure, context, anβ¦β555Updated 3 years ago
- Scikit-learn style model finetuning for NLPβ707Updated last week
- A library for Multilingual Unsupervised or Supervised word Embeddingsβ3,202Updated 2 years ago
- The Schema-Guided Dialogue Datasetβ557Updated last year
- Convai2 submission.β294Updated 2 years ago
- Well tested & Multi-language evaluation framework for text summarization.β618Updated 2 years ago
- Alphabetical list of free/public domain datasets with text data for use in Natural Language Processing (NLP)β5,833Updated 2 years ago
- Visually Explore the Stanford Question Answering Datasetβ558Updated last year