PolyAI-LDN / conversational-datasets
Large datasets for conversational AI
β1,328Updated 5 years ago
Alternatives and similar repositories for conversational-datasets:
Users that are interested in conversational-datasets are comparing it to the libraries listed below
- Large-scale pretraining for dialogueβ2,381Updated 2 years ago
- π¦ State-of-the-Art Conversational AI with Transfer Learningβ1,749Updated last year
- πΈ Use pretrained transformers like BERT, XLNet and GPT-2 in spaCyβ1,372Updated last month
- jiant is an nlp toolkitβ1,663Updated last year
- πHMTL: Hierarchical Multi-Task Learning - A State-of-the-Art neural network model for several NLP tasks based on PyTorch and AllenNLPβ1,193Updated last year
- Language-Agnostic SEntence Representationsβ3,629Updated 10 months ago
- High-accuracy NLP parser with models for 11 languages.β880Updated 3 years ago
- Plug and Play Language Model implementation. Allows to steer topic and attributes of GPT-2 models.β1,141Updated last year
- Natural Questions (NQ) contains real user questions issued to Google search, and answers found from Wikipedia by annotators. NQ is designβ¦β981Updated 3 years ago
- Papers & presentation materials from Hugging Face's internal science dayβ2,043Updated 4 years ago
- PyTorch original implementation of Cross-lingual Language Model Pretraining.β2,903Updated 2 years ago
- Super easy library for BERT based NLP modelsβ1,889Updated 7 months ago
- Library for Knowledge Intensive Language Tasksβ935Updated 2 years ago
- A python tool for evaluating the quality of sentence embeddings.β2,100Updated last year
- A dataset containing human-human knowledge-grounded open-domain conversations.β646Updated 7 months ago
- EMNLP 2020: "Dialogue Response Ranking Training with Large-Scale Human Feedback Data"β339Updated 4 months ago
- InferSent sentence embeddingsβ2,284Updated 3 years ago
- β¨Fast Coreference Resolution in spaCy with Neural Networksβ2,868Updated last year
- ConvoKit is a toolkit for extracting conversational features and analyzing social phenomena in conversations. It includes several large cβ¦β573Updated 2 months ago
- XTREME is a benchmark for the evaluation of the cross-lingual generalization ability of pre-trained multilingual models that covers 40 tyβ¦β642Updated 2 years ago
- Toolkit for Machine Learning, Natural Language Processing, and Text Generation, in TensorFlow. This is part of the CASL project: http://β¦β2,388Updated 3 years ago
- Ekphrasis is a text processing tool, geared towards text from social networks, such as Twitter or Facebook. Ekphrasis performs tokenizatiβ¦β666Updated last year
- Shared repository for open-sourced projects from the Google AI Language team.β1,660Updated this week
- This repository contains the code for "Exploiting Cloze Questions for Few-Shot Text Classification and Natural Language Inference"β1,628Updated last year
- Pre-trained subword embeddings in 275 languages, based on Byte-Pair Encoding (BPE)β1,205Updated 5 months ago
- This is the Plato Research Dialogue System, a flexible platform for developing conversational AI agents.β982Updated 4 years ago
- A Visual Analysis Tool to Explore Learned Representations in Transformers Modelsβ587Updated last year
- β [NOT MAINTAINED] An End-To-End Closed Domain Question Answering System.β617Updated 4 years ago
- Unsupervised Word Segmentation for Neural Machine Translation and Text Generationβ2,227Updated 7 months ago
- Minimalist NMT for educational purposesβ690Updated last year