matchado / HashTagSplitterLinks
A Python function to break down hashtags or compound words created by putting together multiple words
☆33Updated 10 years ago
Alternatives and similar repositories for HashTagSplitter
Users that are interested in HashTagSplitter are comparing it to the libraries listed below
Sorting:
- Elegant and Easy Tweet Preprocessing in Python☆310Updated 2 years ago
- Twitter word embeddings generated using Word2Vec and FastText.☆47Updated 6 years ago
- Default English stopword lists from many different sources☆310Updated 2 years ago
- Ekphrasis is a text processing tool, geared towards text from social networks, such as Twitter or Facebook. Ekphrasis performs tokenizati…☆672Updated 6 months ago
- Collection of tools for building diachronic/historical word vectors☆443Updated last year
- Tutorial on topic models in Python with scikit-learn☆158Updated 2 years ago
- ☆234Updated 8 years ago
- Pretrained BERT model for analysing COVID-19 Twitter data☆184Updated 2 years ago
- analyze text with empath☆338Updated 8 years ago
- Key information extraction from text and graph visualization☆91Updated 5 years ago
- Cleans Reddit Text Data☆84Updated 5 years ago
- Code and data for inducing domain-specific sentiment lexicons.☆196Updated last year
- Word Embeddings for Information Retrieval☆225Updated 2 years ago
- ☆171Updated 2 years ago
- Tutorial on computational models of language change☆116Updated 6 years ago
- Deep Learning models to detect hate speech in tweets☆217Updated 7 years ago
- Linguistic Inquiry and Word Count (LIWC) analyzer☆231Updated 3 years ago
- Text Mining and Topic Modeling Toolkit for Python with parallel processing power☆191Updated 2 years ago
- 16 Text Preprocessing Techniques in Python for Twitter Sentiment Analysis.☆227Updated 6 years ago
- A comparison and discussion of different NLP methods for 5-class sentiment classification on the SST-5 dataset.☆172Updated 7 months ago
- Remove problematic gender bias from word embeddings.☆251Updated 2 years ago
- Quickly extract multi-word phrases from a corpus☆194Updated 5 years ago
- Code for the paper "Analyzing Polarization in Social Media: Method and Application to Tweets on 21 Mass Shootings"☆70Updated 3 years ago
- Interpretable data visualizations for understanding how texts differ at the word level☆284Updated 9 months ago
- Extraction of the journalistic five W and one H questions (5W1H) from news articles: who did what, when, where, why, and how?☆527Updated last year
- ☆301Updated 8 years ago
- Twitter named entity extraction for WNUT 2016 http://noisy-text.github.io/2016/ner-shared-task.html☆140Updated 3 years ago
- A multilingual lexicon of words to hurt.☆92Updated last month
- ☆62Updated 4 years ago
- A library for topic modeling and browsing☆89Updated 6 years ago