matchado / HashTagSplitter
A Python function to break down hashtags or compound words created by putting together multiple words
☆33Updated 9 years ago
Alternatives and similar repositories for HashTagSplitter:
Users that are interested in HashTagSplitter are comparing it to the libraries listed below
- Twitter word embeddings generated using Word2Vec and FastText.☆49Updated 5 years ago
- Tutorial on topic models in Python with scikit-learn☆157Updated last year
- Twitter named entity extraction for WNUT 2016 http://noisy-text.github.io/2016/ner-shared-task.html☆138Updated 2 years ago
- Code and data for inducing domain-specific sentiment lexicons.☆195Updated 8 months ago
- Automatic labeling for topic model☆57Updated 9 years ago
- Tutorial on computational models of language change☆114Updated 5 years ago
- Analysis and experiments on the UN General Debate corpus☆36Updated 5 years ago
- Quickly extract multi-word phrases from a corpus☆191Updated 4 years ago
- Subjectivity and sentiment classification using polarity lexicons☆88Updated 3 years ago
- Making sense embedding out of word embeddings using graph-based word sense induction☆213Updated 3 years ago
- Generating labels for topics automatically using neural embeddings☆185Updated 3 weeks ago
- ☆26Updated 8 years ago
- 💙 Emoji handling and meta data for spaCy with custom extension attributes☆181Updated last year
- Python port of the Twokenize class of ark-tweet-nlp☆141Updated 6 years ago
- Python library for advanced text mining☆68Updated 4 years ago
- Package for Statistically significant linguistic change☆55Updated 2 years ago
- Text Mining and Topic Modeling Toolkit for Python with parallel processing power☆190Updated last year
- ☆54Updated 3 years ago
- ☆54Updated 3 years ago
- Computation of the semantic interpretability of topics produced by topic models.☆180Updated 7 years ago
- See https://meta.wikimedia.org/wiki/Research:Modeling_Talk_Page_Abuse☆152Updated 4 years ago
- A Dependency Parser for Tweets☆78Updated 5 years ago
- An Easy to Use, Accurate Python Geolocation Library☆41Updated 2 years ago
- Cleans Reddit Text Data☆81Updated 4 years ago
- Default English stopword lists from many different sources☆298Updated last year
- Data and analysis for the BuzzFeed News article, "Hyperpartisan Facebook Pages Are Publishing False And Misleading Information At An Alar…☆110Updated 8 years ago
- Python interface for https://github.com/dice-group/Palmetto☆39Updated 2 years ago
- spaCy + UDPipe☆161Updated 2 years ago
- Worked examples from the NLTK Book☆182Updated 5 years ago
- ☆40Updated 9 years ago