CornellNLP/ConvoKit

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/CornellNLP/ConvoKit)

CornellNLP / ConvoKit

ConvoKit is a toolkit for extracting conversational features and analyzing social phenomena in conversations. It includes several large conversational datasets along with scripts exemplifying the use of the toolkit on these datasets.

☆640

Alternatives and similar repositories for ConvoKit

Users that are interested in ConvoKit are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

myeomans / politeness
View on GitHub
☆26Mar 12, 2026Updated 4 months ago
johnwdubois / rezonator
View on GitHub
Rezonator: Dynamics of human engagement
☆34Jul 8, 2026Updated 2 weeks ago
jmhessel / FightingWords
View on GitHub
Quick implementation of Monroe et al.'s algorithm for comparing languages
☆57Jun 15, 2020Updated 6 years ago
ddemszky / framing-twitter
View on GitHub
Code for the paper "Analyzing Polarization in Social Media: Method and Application to Tweets on 21 Mass Shootings"
☆70Sep 30, 2022Updated 3 years ago
cosanlab / neighbors
View on GitHub
A package to perform collaborative filtering on emotion datasets.
☆11Jan 8, 2024Updated 2 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
cgpotts / swda
View on GitHub
Switchboard Dialog Act Corpus with Penn Treebank links
☆150Dec 30, 2020Updated 5 years ago
PolyAI-LDN / conversational-datasets
View on GitHub
Large datasets for conversational AI
☆1,402Nov 16, 2019Updated 6 years ago
Ejhfast / empath-client
View on GitHub
analyze text with empath
☆348Apr 22, 2017Updated 9 years ago
allenai / smashed
View on GitHub
SMASHED is a toolkit designed to apply transformations to samples in datasets, such as fields extraction, tokenization, prompting, batchi…
☆35May 24, 2024Updated 2 years ago
alexa / dialoglue
View on GitHub
DialoGLUE: A Natural Language Understanding Benchmark for Task-Oriented Dialogue
☆288Jul 6, 2023Updated 3 years ago
nwrush / Visualization
View on GitHub
☆11Dec 1, 2017Updated 8 years ago
MilaNLProc / contextualized-topic-models
View on GitHub
A python package to run contextualized topic modeling. CTMs combine contextualized embeddings (e.g., BERT) with topic models to get coher…
☆1,272Jul 24, 2025Updated last year
facebookresearch / EmpatheticDialogues
View on GitHub
Dialogue model that produces empathetic responses when trained on the EmpatheticDialogues dataset.
☆555Dec 3, 2021Updated 4 years ago
emorynlp / character-mining
View on GitHub
Mining individual characters in multiparty dialogue
☆177Aug 21, 2023Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
BoulderDS / feature-importance
View on GitHub
☆16Jul 6, 2023Updated 3 years ago
nickduran / align-linguistic-alignment
View on GitHub
Python library for extracting quantitative, reproducible metrics of multi-level alignment between speakers in naturalistic language corpo…
☆55Jun 29, 2026Updated 3 weeks ago
UniversalAnaphora / UniversalAnaphora
View on GitHub
An initiative to collect and distribute resources for co-reference resolution in a unified standard.
☆26May 12, 2024Updated 2 years ago
ryanjgallagher / shifterator
View on GitHub
Interpretable data visualizations for understanding how texts differ at the word level
☆290Jun 30, 2026Updated 3 weeks ago
tslmy / politeness-estimator
View on GitHub
A set of pre-trained machine-learning models that predict (im-)politeness scores in texts.
☆19Jan 2, 2025Updated last year
ropenscilabs / tif
View on GitHub
Text Interchange Formats
☆38Nov 26, 2023Updated 2 years ago
seinan9 / LSCDiscovery
View on GitHub
Scripts for large-scale prediction of lexical semantic change.
☆14Feb 9, 2023Updated 3 years ago
JasonKessler / scattertext
View on GitHub
Beautiful visualizations of how language differs among document types.
☆2,338Jul 4, 2026Updated 2 weeks ago
booknlp / booknlp
View on GitHub
BookNLP, a natural language processing pipeline for books
☆927Jul 31, 2024Updated last year
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
ddangelov / Top2Vec
View on GitHub
Top2Vec learns jointly embedded topic, document and word vectors.
☆3,103Nov 14, 2024Updated last year
kenlimmj / fightin-words
View on GitHub
A scikit-learn compliant implementation of Monroe et al.'s Fightin' Words analysis method.
☆11May 26, 2026Updated last month
Mahhos / Empathy
View on GitHub
Fine grained Empathy Direction Detection
☆16Dec 11, 2020Updated 5 years ago
alexa / Topical-Chat
View on GitHub
A dataset containing human-human knowledge-grounded open-domain conversations.
☆673Aug 2, 2024Updated last year
behavioral-data / Cognitive-Reframing
View on GitHub
Codes and Datasets for our ACL 2023 paper on cognitive reframing of negative thoughts
☆67Sep 12, 2023Updated 2 years ago
networkdynamics / humanizr
View on GitHub
☆32Jul 6, 2015Updated 11 years ago
cschwem2er / facerec
View on GitHub
An Interface for Face Recognition in R
☆35Jun 18, 2019Updated 7 years ago
schochastics / PSAWR
View on GitHub
R package to interact with the Pushift.io API
☆10Aug 4, 2025Updated 11 months ago
kgjerde / corporaexplorer
View on GitHub
An R package for dynamic exploration of text collections
☆65Mar 6, 2026Updated 4 months ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
vene / marseille
View on GitHub
Mining Argument Structures with Expressive Inference (Linear and LSTM Engines)
☆66Aug 1, 2017Updated 8 years ago
WolfNiu / polite-dialogue-generation
View on GitHub
Code for "Polite Dialogue Generation Without Parallel Data"
☆25Nov 24, 2018Updated 7 years ago
declare-lab / dialogue-understanding
View on GitHub
This repository contains PyTorch implementation for the baseline models from the paper Utterance-level Dialogue Understanding: An Empiric…
☆128Mar 14, 2023Updated 3 years ago
kbenoit / quanteda.dictionaries
View on GitHub
Dictionaries for text analysis
☆76Jan 31, 2023Updated 3 years ago
trinker / lexicon
View on GitHub
A data package containing lexicons and dictionaries for text analysis
☆113Oct 12, 2021Updated 4 years ago
thilomichael / retico
View on GitHub
An open-source framework for modeling real-time conversations in spoken dialogue systems.
☆27Aug 12, 2022Updated 3 years ago
ddemszky / conversational-uptake
View on GitHub
Code and data for the paper "Measuring Conversational Uptake: A Case-Study on Student-Teacher Interactions"
☆25Apr 24, 2025Updated last year