CornellNLP / ConvoKit
ConvoKit is a toolkit for extracting conversational features and analyzing social phenomena in conversations. It includes several large conversational datasets along with scripts exemplifying the use of the toolkit on these datasets.
☆578Updated 3 months ago
Alternatives and similar repositories for ConvoKit:
Users that are interested in ConvoKit are comparing it to the libraries listed below
- Annotated dataset of 100 works of fiction to support tasks in natural language processing and the computational humanities.☆352Updated 2 years ago
- A python package to run contextualized topic modeling. CTMs combine contextualized embeddings (e.g., BERT) with topic models to get coher…☆1,227Updated 2 months ago
- 💥 Use the latest Stanza (StanfordNLP) research models directly in spaCy☆731Updated 8 months ago
- analyze text with empath☆328Updated 8 years ago
- High-accuracy NLP parser with models for 11 languages.☆880Updated 3 years ago
- A dataset containing human-human knowledge-grounded open-domain conversations.☆647Updated 8 months ago
- Collection of tools for building diachronic/historical word vectors☆429Updated last year
- BERTweet: A pre-trained language model for English Tweets (EMNLP-2020)☆590Updated 9 months ago
- Ekphrasis is a text processing tool, geared towards text from social networks, such as Twitter or Facebook. Ekphrasis performs tokenizati…☆668Updated last year
- ☆166Updated 2 years ago
- Sample implementation of a politeness model, trained on the Stanford Politeness Corpus☆147Updated 2 years ago
- Python package of Tomoto, the Topic Modeling Tool☆578Updated 8 months ago
- This repository contains EmoBank, a large-scale text corpus manually annotated with emotion according to the psychological Valence-Arousa…☆204Updated 2 years ago
- A module to compute textual lexical richness (aka lexical diversity).☆106Updated last year
- A reading list of up-to-date papers on NLP for Social Good.☆300Updated last year
- Topic Modeling in Embedding Spaces☆554Updated last year
- Resources for the "SummEval: Re-evaluating Summarization Evaluation" paper☆391Updated 10 months ago
- Language, Knowledge, Cognition☆601Updated 2 months ago
- The Schema-Guided Dialogue Dataset☆565Updated last year
- Dialogue model that produces empathetic responses when trained on the EmpatheticDialogues dataset.☆484Updated 3 years ago
- Package to extract connotation frames☆85Updated last year
- A Python wrapper around the topic modeling functions of MALLET.☆101Updated 5 months ago
- A frame-semantic parsing system based on a softmax-margin SegRNN.☆232Updated 2 years ago
- 🛸 Use pretrained transformers like BERT, XLNet and GPT-2 in spaCy☆1,383Updated 2 months ago
- Pipeline to generate the Standardized Project Gutenberg Corpus☆177Updated last year
- Hierarchical unsupervised and semi-supervised topic models for sparse count data with CorEx☆632Updated 4 years ago
- This is a simple Python package for calculating a variety of lexical diversity indices☆75Updated last year
- A Python library for calculating a large variety of metrics from text☆337Updated 4 months ago
- Codebase for testing whether hidden states of neural networks encode discrete structures.☆392Updated last year
- Catalog of abusive language data (PLoS 2020)☆309Updated 10 months ago