alexa / Topical-Chat
A dataset containing human-human knowledge-grounded open-domain conversations.
☆620Updated last month
Related projects: ⓘ
- Large datasets for conversational AI☆1,279Updated 4 years ago
- The Schema-Guided Dialogue Dataset☆539Updated last year
- Source code for end-to-end dialogue model from the MultiWOZ paper (Budzianowski et al. 2018, EMNLP)☆847Updated 3 weeks ago
- Library for Knowledge Intensive Language Tasks☆905Updated 2 years ago
- Dialogue model that produces empathetic responses when trained on the EmpatheticDialogues dataset.☆437Updated 2 years ago
- BLEURT is a metric for Natural Language Generation based on transfer learning.☆685Updated last year
- Natural Questions (NQ) contains real user questions issued to Google search, and answers found from Wikipedia by annotators. NQ is design…☆916Updated 3 years ago
- Scripts and links to recreate the ELI5 dataset.☆316Updated 3 years ago
- EMNLP 2020: "Dialogue Response Ranking Training with Large-Scale Human Feedback Data"☆335Updated last year
- ConvoKit is a toolkit for extracting conversational features and analyzing social phenomena in conversations. It includes several large c…☆542Updated last week
- Open clone of OpenAI's unreleased WebText dataset scraper. This version uses pushshift.io files instead of the API for speed.☆708Updated last year
- 🦄 State-of-the-Art Conversational AI with Transfer Learning☆1,734Updated last year
- DialoGLUE: A Natural Language Understanding Benchmark for Task-Oriented Dialogue☆280Updated last year
- Visually Explore the Stanford Question Answering Dataset☆547Updated 11 months ago
- This is the Plato Research Dialogue System, a flexible platform for developing conversational AI agents.☆978Updated 4 years ago
- UnifiedQA: Crossing Format Boundaries With a Single QA System☆426Updated 2 years ago
- A Large Scale Text Summarization Dataset☆330Updated 8 months ago
- Plug and Play Language Model implementation. Allows to steer topic and attributes of GPT-2 models.☆1,128Updated 6 months ago
- Officially supported AllenNLP models☆518Updated last year
- Autoregressive Entity Retrieval☆756Updated last year
- TyDi QA contains 200k human-annotated question-answer pairs in 11 Typologically Diverse languages, written without seeing the answer and …☆289Updated 4 years ago
- Repository that accompanies "An Evaluation Dataset for Intent Classification and Out-of-Scope Prediction" (EMNLP 2019)☆202Updated 3 years ago
- [ACL 2021] Learning Dense Representations of Phrases at Scale; EMNLP'2021: Phrase Retrieval Learns Passage Retrieval, Too https://arxiv.o…☆605Updated 2 years ago
- ☆436Updated 3 years ago
- ⛔ [NOT MAINTAINED] An End-To-End Closed Domain Question Answering System.☆614Updated 4 years ago
- A tool for holistic analysis of language generations systems☆465Updated 2 years ago
- This dataset contains 108,463 human-labeled and 656k noisily labeled pairs that feature the importance of modeling structure, context, an…☆545Updated 2 years ago
- Pre-Trained Models for ToD-BERT☆288Updated last year
- Tools to download and cleanup Common Crawl data☆961Updated last year