PolyAI-LDN / conversational-datasets
Large datasets for conversational AI
β1,333Updated 5 years ago
Alternatives and similar repositories for conversational-datasets:
Users that are interested in conversational-datasets are comparing it to the libraries listed below
- π¦ State-of-the-Art Conversational AI with Transfer Learningβ1,751Updated last year
- A dataset containing human-human knowledge-grounded open-domain conversations.β647Updated 8 months ago
- A python tool for evaluating the quality of sentence embeddings.β2,105Updated last year
- InferSent sentence embeddingsβ2,284Updated 3 years ago
- β392Updated 2 years ago
- A curated list of resources dedicated to text summarizationβ1,542Updated 2 years ago
- πHMTL: Hierarchical Multi-Task Learning - A State-of-the-Art neural network model for several NLP tasks based on PyTorch and AllenNLPβ1,194Updated last year
- Pre-trained subword embeddings in 275 languages, based on Byte-Pair Encoding (BPE)β1,212Updated 6 months ago
- πΈ Use pretrained transformers like BERT, XLNet and GPT-2 in spaCyβ1,381Updated 2 months ago
- jiant is an nlp toolkitβ1,667Updated last year
- Scikit-learn style model finetuning for NLPβ710Updated 3 weeks ago
- ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generatorsβ2,352Updated last year
- Visually Explore the Stanford Question Answering Datasetβ566Updated last year
- A list of datasets/corpora for NLP tasks, in reverse chronological order.β924Updated 5 years ago
- Toolkit for Machine Learning, Natural Language Processing, and Text Generation, in TensorFlow. This is part of the CASL project: http://β¦β2,388Updated 3 years ago
- This dataset contains 108,463 human-labeled and 656k noisily labeled pairs that feature the importance of modeling structure, context, anβ¦β557Updated 3 years ago
- BLEURT is a metric for Natural Language Generation based on transfer learning.β727Updated last year
- Natural Questions (NQ) contains real user questions issued to Google search, and answers found from Wikipedia by annotators. NQ is designβ¦β993Updated 3 years ago
- A curated list of pretrained sentence and word embedding modelsβ2,257Updated 4 years ago
- Well tested & Multi-language evaluation framework for text summarization.β623Updated 2 years ago
- Tensorflow implementation of contextualized word representations from bi-directional language modelsβ1,619Updated 2 years ago
- Unsupervised Word Segmentation for Neural Machine Translation and Text Generationβ2,230Updated 8 months ago
- π₯A PyTorch implementation of OpenAI's finetuned transformer language model with a script to import the weights pre-trained by OpenAIβ1,507Updated 3 years ago
- Language-Agnostic SEntence Representationsβ3,633Updated 11 months ago
- This repository recorded my NLP journey.β1,078Updated 4 years ago
- β¨Fast Coreference Resolution in spaCy with Neural Networksβ2,873Updated 2 years ago
- Plug and Play Language Model implementation. Allows to steer topic and attributes of GPT-2 models.β1,143Updated last year
- Source code for end-to-end dialogue model from the MultiWOZ paper (Budzianowski et al. 2018, EMNLP)β889Updated 3 months ago
- Repository of code for the tutorial on Transfer Learning in NLP held at NAACL 2019 in Minneapolis, MN, USAβ723Updated 5 years ago
- πA pyTorch implementation of the DeepMoji model: state-of-the-art deep learning model for analyzing sentiment, emotion, sarcasm etcβ923Updated last year