dlatk / happierfuntokenizingLinks
This code implements a basic, Twitter-aware tokenizer.
☆12Updated last year
Alternatives and similar repositories for happierfuntokenizing
Users that are interested in happierfuntokenizing are comparing it to the libraries listed below
Sorting:
- Code for the paper "Analyzing Polarization in Social Media: Method and Application to Tweets on 21 Mass Shootings"☆69Updated 3 years ago
- A data set regarding news veracity on social media. Published at ICWSM-18.☆36Updated 4 years ago
- Quick implementation of Monroe et al.'s algorithm for comparing languages☆53Updated 5 years ago
- Collaborative web framework for analyzing text (e.g., tweets). Supports standard labeling and pairwise comparison.☆14Updated 4 years ago
- Text-Based Ideal Points☆45Updated 2 years ago
- A set of media framing annotations, along with scripts for obtaining the corresponding news articles☆54Updated 6 years ago
- ☆16Updated 2 years ago
- Sentence specificity prediction☆26Updated 6 years ago
- Data to accompany the ICWSM 2015 paper "CREDBANK: A Large-scale Social Media Corpus With Associated Credibility Annotations"☆45Updated 6 years ago
- This repository implements models described in ''Interpretale Word Embeddings via Informative Priors''☆11Updated 6 years ago
- Topic Modeling for The New York Times News Dataset☆20Updated 8 years ago
- A framework to identify relations between ideas in temporal text corpora.☆28Updated 7 years ago
- Software for the paper "Gender and Lexical Variation in Social Media" with David Bamman and Tyler Schnoebelen☆17Updated 9 years ago
- The COVID-19 Real World Worry Datasets☆27Updated 3 years ago
- public repository of the interdisciplinary working group 'Hatespeech' of the research training group UCSM☆17Updated 6 years ago
- Repo for EMNLP 2020 paper, "Improving Neural Topic Models using Knowledge Distillation"☆31Updated 4 years ago
- Dynamic Word Embeddings for Evolving Semantic Discovery code.☆73Updated 2 years ago
- Code for the paper "Simple, Interpretable and Stable Method for Detecting Words with Usage Change across Corpora", ACL 2020.☆18Updated 5 years ago
- Dataset and code of our EMNLP 2019 paper "Multilingual and Multi-Aspect Hate Speech Analysis"☆57Updated 10 months ago
- Harassment Lexicon and Corpus☆30Updated 7 years ago
- annotated hateful speech☆24Updated 6 years ago
- ☆15Updated 7 years ago
- Metaphor detection using NLP techniques, made in Python using NLTK☆18Updated 11 years ago
- Training Temporal Word Embeddings with a Compass☆65Updated last month
- ☆11Updated 5 years ago
- Datasets for fake news and misinformation detection☆68Updated 2 years ago
- BirdSpotter is a python package which provides an influence and bot detection toolkit for twitter.☆19Updated 4 years ago
- Tokenizer for Twitter and Reddit data☆46Updated 6 years ago
- ☆54Updated 3 years ago
- Code to reproduce experiments from the ACL 2016 paper about Rumour Stance Classification with Hawkes Processes.☆25Updated 8 years ago