longwind48 / convo-miner
Mine conversations from novels in Project Gutenberg, to generate data for data-driven dialogue systems.
☆14Updated 5 years ago
Alternatives and similar repositories for convo-miner:
Users that are interested in convo-miner are comparing it to the libraries listed below
- This is the repository for the Interspeech 2018 paper "Coherence models for dialogue".☆19Updated 5 years ago
- ☆29Updated last year
- The Universal Decompositional Semantics (UDS) dataset and the Decomp toolkit☆57Updated last year
- Data and code for Kang et al., EMNLP 2019's paper titled "(Male, Bachelor) and (Female, Ph.D) have different connotations: Parallelly Ann…☆29Updated 4 years ago
- pair2vec: Compositional Word-Pair Embeddings for Cross-Sentence Inference☆62Updated 2 years ago
- Dataset for Coherent Topic Segmentation and Classification☆35Updated 5 years ago
- Code from the paper "Specializing Unsupervised Pretraining Models for Word-Level Semantic Similarity"☆17Updated 4 years ago
- Code for ACL 2020 paper: USR: An Unsupervised and Reference Free Evaluation Metric for Dialog Generation (https://arxiv.org/pdf/2005.0045…☆50Updated 2 years ago
- Code for our EACL-2021 paper "Generating Syntactically Controlled Paraphrases without Using Annotated Parallel Pairs".☆38Updated 8 months ago
- ☆29Updated 2 years ago
- A program to choose transfer languages for cross-lingual learning☆72Updated last year
- EMNLP 2021 Tutorial: Multi-Domain Multilingual Question Answering☆38Updated 3 years ago
- BERT models for many languages created from Wikipedia texts☆33Updated 4 years ago
- Code for a web demo of Plan, Write, and Revise: a neural system for interactive open-domain story generation☆33Updated 3 years ago
- DiscoScore: Evaluating Text Generation with BERT and Discourse Coherence☆36Updated last year
- The library that uses dependency parsing to preprocess text to train DisSent model☆33Updated 4 years ago
- This is a repository for the paper on testing inductive bias with scaled-down RoBERTa models.☆20Updated 3 years ago
- A resource to create a multi domain Dialog Act Tagger for conversational agents using publicly available data☆51Updated 3 years ago
- The implementation of the paper "Evaluating Coherence in Dialogue Systems using Entailment"☆74Updated 5 months ago
- Modular implementation of an AM dependency parser in AllenNLP.☆31Updated 8 months ago
- Perspectrum: a dataset of claims, perspectives and evidence documents☆33Updated 5 years ago
- Code for bidirectional sequence generation (BiSon) for generating from BERT pre-trained models.☆51Updated 4 years ago
- EMNLP 2020 - Summarizing Text on Any Aspects☆37Updated 4 years ago
- Code for ACL 2022 paper "Expanding Pretrained Models to Thousands More Languages via Lexicon-based Adaptation"☆30Updated 2 years ago
- XCOPA: A Multilingual Dataset for Causal Commonsense Reasoning☆101Updated 4 years ago
- Code for the paper "Improving Robustness of Machine Translation with Synthetic Noise"☆21Updated 5 years ago
- Word Sense Induction with BERT MLM☆28Updated last year
- 🐸 KERMIT - A lightweight library to encode and interpret Universal Syntactic Embeddings☆57Updated 2 years ago
- A framework for training and evaluating AI models on a variety of openly available dialogue datasets.☆36Updated 4 years ago
- SUM-QE, a BERT-based Summary Quality Estimation Model☆21Updated last year