matchado / HashTagSplitterLinks
A Python function to break down hashtags or compound words created by putting together multiple words
☆33Updated 10 years ago
Alternatives and similar repositories for HashTagSplitter
Users that are interested in HashTagSplitter are comparing it to the libraries listed below
Sorting:
- Elegant and Easy Tweet Preprocessing in Python☆310Updated 2 years ago
- Default English stopword lists from many different sources☆308Updated 2 years ago
- semi supervised guided topic model with custom guidedLDA☆510Updated 5 months ago
- Ekphrasis is a text processing tool, geared towards text from social networks, such as Twitter or Facebook. Ekphrasis performs tokenizati…☆671Updated 4 months ago
- analyze text with empath☆336Updated 8 years ago
- ☆235Updated 8 years ago
- Collection of tools for building diachronic/historical word vectors☆442Updated last year
- A dataset of millions of news articles scraped from a curated list of data sources.☆400Updated 5 years ago
- Tutorial on topic models in Python with scikit-learn☆157Updated 2 years ago
- Cleans Reddit Text Data☆83Updated 5 years ago
- Deep Learning models to detect hate speech in tweets☆217Updated 7 years ago
- See https://meta.wikimedia.org/wiki/Research:Modeling_Talk_Page_Abuse☆150Updated 5 years ago
- Datasets for fake news and misinformation detection☆68Updated 2 years ago
- Pretrained BERT model for analysing COVID-19 Twitter data☆184Updated 2 years ago
- Various Algorithms for Short Text Mining☆472Updated last week
- An introduction to using spaCy for NLP and machine learning☆192Updated 3 years ago
- ☆301Updated 8 years ago
- Twitter word embeddings generated using Word2Vec and FastText.☆47Updated 6 years ago
- Remove problematic gender bias from word embeddings.☆250Updated 2 years ago
- ☆71Updated 7 years ago
- Wikipedia tools (for Humans): easily extract data from Wikipedia, Wikidata, and other MediaWikis☆591Updated 2 years ago
- Data and analysis for the BuzzFeed News article, "Hyperpartisan Facebook Pages Are Publishing False And Misleading Information At An Alar…☆111Updated 8 years ago
- Interpretable data visualizations for understanding how texts differ at the word level☆280Updated 7 months ago
- Code for the paper "Characterizing and Detecting Hateful Users on Twitter"☆74Updated 4 years ago
- 16 Text Preprocessing Techniques in Python for Twitter Sentiment Analysis.☆227Updated 6 years ago
- Semantic Orientation Calculator for Sentiment Analysis☆51Updated 2 years ago
- A python package for the Linguistic Inquiry and Word Count (LIWC) dictionary.☆40Updated 4 years ago
- Linguistic Inquiry and Word Count (LIWC) analyzer☆222Updated 3 years ago
- A multilingual lexicon of words to hurt.☆90Updated 2 months ago
- Hierarchical unsupervised and semi-supervised topic models for sparse count data with CorEx☆639Updated 4 years ago