matchado / HashTagSplitterLinks
A Python function to break down hashtags or compound words created by putting together multiple words
☆34Updated 10 years ago
Alternatives and similar repositories for HashTagSplitter
Users that are interested in HashTagSplitter are comparing it to the libraries listed below
Sorting:
- Elegant and Easy Tweet Preprocessing in Python☆310Updated 2 years ago
- Default English stopword lists from many different sources☆308Updated 2 years ago
- Cleans Reddit Text Data☆82Updated 5 years ago
- Twitter word embeddings generated using Word2Vec and FastText.☆47Updated 6 years ago
- A dataset of millions of news articles scraped from a curated list of data sources.☆398Updated 5 years ago
- Key information extraction from text and graph visualization☆91Updated 5 years ago
- ☆235Updated 8 years ago
- A multilingual lexicon of words to hurt.☆90Updated last month
- Twitter named entity extraction for WNUT 2016 http://noisy-text.github.io/2016/ner-shared-task.html☆140Updated 3 years ago
- Code and data for inducing domain-specific sentiment lexicons.☆196Updated last year
- A library for topic modeling and browsing☆89Updated 6 years ago
- Pretrained BERT model for analysing COVID-19 Twitter data☆184Updated 2 years ago
- 16 Text Preprocessing Techniques in Python for Twitter Sentiment Analysis.☆227Updated 6 years ago
- Ekphrasis is a text processing tool, geared towards text from social networks, such as Twitter or Facebook. Ekphrasis performs tokenizati…☆671Updated 3 months ago
- Collection of tools for building diachronic/historical word vectors☆440Updated last year
- ☆301Updated 8 years ago
- Tutorial on topic models in Python with scikit-learn☆157Updated last year
- Word Embeddings for Information Retrieval☆225Updated last year
- Interpretable data visualizations for understanding how texts differ at the word level☆280Updated 6 months ago
- Text Mining and Topic Modeling Toolkit for Python with parallel processing power☆190Updated 2 years ago
- ☆54Updated 3 years ago
- Code for the paper "Analyzing Polarization in Social Media: Method and Application to Tweets on 21 Mass Shootings"☆69Updated 2 years ago
- a Deep Learning Framework for Text https://delft.readthedocs.io/☆401Updated last month
- Steam review texting embedding analysis☆142Updated 2 years ago
- Quickly extract multi-word phrases from a corpus☆194Updated 5 years ago
- Tutorial on computational models of language change☆116Updated 6 years ago
- Catalog of abusive language data (PLoS 2020)☆314Updated last year
- Linguistic Inquiry and Word Count (LIWC) analyzer☆222Updated 3 years ago
- A python package for the Linguistic Inquiry and Word Count (LIWC) dictionary.☆40Updated 4 years ago
- DeEpLearning models for MultIlingual haTespeech (DELIMIT): Benchmarking multilingual models across 9 languages and 16 datasets.☆109Updated 2 years ago