PolyAI-LDN / conversational-datasetsLinks
Large datasets for conversational AI
β1,353Updated 5 years ago
Alternatives and similar repositories for conversational-datasets
Users that are interested in conversational-datasets are comparing it to the libraries listed below
Sorting:
- π¦ State-of-the-Art Conversational AI with Transfer Learningβ1,750Updated 2 years ago
- A dataset containing human-human knowledge-grounded open-domain conversations.β658Updated last year
- jiant is an nlp toolkitβ1,671Updated 2 years ago
- πΈ Use pretrained transformers like BERT, XLNet and GPT-2 in spaCyβ1,392Updated 2 months ago
- Large-scale pretraining for dialogueβ2,403Updated 2 years ago
- Language-Agnostic SEntence Representationsβ3,647Updated last year
- Pre-trained subword embeddings in 275 languages, based on Byte-Pair Encoding (BPE)β1,218Updated 10 months ago
- πHMTL: Hierarchical Multi-Task Learning - A State-of-the-Art neural network model for several NLP tasks based on PyTorch and AllenNLPβ1,195Updated 2 years ago
- A python tool for evaluating the quality of sentence embeddings.β2,108Updated last year
- Plug and Play Language Model implementation. Allows to steer topic and attributes of GPT-2 models.β1,150Updated last year
- β¨Fast Coreference Resolution in spaCy with Neural Networksβ2,881Updated 2 years ago
- β [NOT MAINTAINED] An End-To-End Closed Domain Question Answering System.β616Updated 5 years ago
- Fast & easy transfer learning for NLP. Harvesting language models for the industry. Focus on Question Answering.β1,753Updated last year
- Visually Explore the Stanford Question Answering Datasetβ567Updated last year
- InferSent sentence embeddingsβ2,280Updated 3 years ago
- This is the Plato Research Dialogue System, a flexible platform for developing conversational AI agents.β980Updated 4 years ago
- Super easy library for BERT based NLP modelsβ1,901Updated last year
- Source code for end-to-end dialogue model from the MultiWOZ paper (Budzianowski et al. 2018, EMNLP)β910Updated 7 months ago
- Shared repository for open-sourced projects from the Google AI Language team.β1,698Updated last month
- This dataset contains 108,463 human-labeled and 656k noisily labeled pairs that feature the importance of modeling structure, context, anβ¦β561Updated 3 years ago
- A curated list of resources dedicated to text summarizationβ1,543Updated 2 years ago
- Ekphrasis is a text processing tool, geared towards text from social networks, such as Twitter or Facebook. Ekphrasis performs tokenizatiβ¦β670Updated 2 months ago
- The Schema-Guided Dialogue Datasetβ581Updated 2 years ago
- Toolkit for Machine Learning, Natural Language Processing, and Text Generation, in TensorFlow. This is part of the CASL project: http://β¦β2,388Updated 3 years ago
- The website for the CMU Language Technologies Institute low resource NLP bootcamp 2020β600Updated 5 years ago
- High-accuracy NLP parser with models for 11 languages.β893Updated 3 years ago
- Natural Questions (NQ) contains real user questions issued to Google search, and answers found from Wikipedia by annotators. NQ is designβ¦β1,032Updated 4 years ago
- PyTorch original implementation of Cross-lingual Language Model Pretraining.β2,920Updated 2 years ago
- β501Updated 4 years ago
- Open clone of OpenAI's unreleased WebText dataset scraper. This version uses pushshift.io files instead of the API for speed.β736Updated 2 years ago