dlatk / happierfuntokenizing
This code implements a basic, Twitter-aware tokenizer.
☆12Updated 9 months ago
Related projects ⓘ
Alternatives and complementary repositories for happierfuntokenizing
- Code for the paper "Analyzing Polarization in Social Media: Method and Application to Tweets on 21 Mass Shootings"☆66Updated 2 years ago
- Sentence specificity prediction☆25Updated 5 years ago
- ☆40Updated 4 years ago
- ☆17Updated 6 years ago
- This package supports implementation of anchor-based topic modeling and variants of the anchoring algorithm in Python 3.☆16Updated 6 years ago
- Driver for LIWC2015 analysis. LIWC2015 dictionary not included.☆16Updated last year
- Repo for EMNLP 2020 paper, "Improving Neural Topic Models using Knowledge Distillation"☆31Updated 4 years ago
- A set of media framing annotations, along with scripts for obtaining the corresponding news articles☆49Updated 5 years ago
- annotated hateful speech☆25Updated 5 years ago
- ☆16Updated 5 years ago
- ☆11Updated 4 years ago
- The COVID-19 Real World Worry Datasets☆26Updated 2 years ago
- Unsupervised method for extracting quotation-speaker pairs from large news corpora.☆28Updated 6 years ago
- This repository implements models described in ''Interpretale Word Embeddings via Informative Priors''☆9Updated 5 years ago
- Quick implementation of Monroe et al.'s algorithm for comparing languages☆48Updated 4 years ago
- Harassment Lexicon and Corpus☆27Updated 6 years ago
- Dataset and code of our EMNLP 2019 paper "Multilingual and Multi-Aspect Hate Speech Analysis"☆56Updated last year
- Code for FACTOID dataset paper in LREC 2022☆15Updated last year
- A framework to identify relations between ideas in temporal text corpora.☆29Updated 6 years ago
- ☆17Updated 3 years ago
- Metaphor dataset: literal versus non-literal uses of words☆14Updated 9 years ago
- ☆15Updated 7 years ago
- This package consists of functionalities for dynamic topic modelling and its visualization☆24Updated 4 years ago
- Anchor-based topic modeling☆10Updated 4 years ago
- dynamic topic modeling☆39Updated last year
- Training Temporal Word Embeddings with a Compass☆64Updated last year
- Additional material for the paper "MoralStrength: Exploiting a Moral Lexicon and Embedding Similarity for Moral Foundations Prediction"☆53Updated last year
- A python package for the Linguistic Inquiry and Word Count (LIWC) dictionary.☆37Updated 3 years ago
- A data set regarding news veracity on social media. Published at ICWSM-18.☆32Updated 3 years ago
- Fortifying Toxic Speech Detectors Against Veiled Toxicity☆11Updated 4 years ago