matchado / HashTagSplitterLinks
A Python function to break down hashtags or compound words created by putting together multiple words
☆33Updated 10 years ago
Alternatives and similar repositories for HashTagSplitter
Users that are interested in HashTagSplitter are comparing it to the libraries listed below
Sorting:
- Elegant and Easy Tweet Preprocessing in Python☆309Updated 2 years ago
- Twitter word embeddings generated using Word2Vec and FastText.☆47Updated 6 years ago
- Default English stopword lists from many different sources☆311Updated 2 years ago
- A dataset of millions of news articles scraped from a curated list of data sources.☆406Updated 5 years ago
- ☆234Updated 9 years ago
- 16 Text Preprocessing Techniques in Python for Twitter Sentiment Analysis.☆228Updated 6 years ago
- Cleans Reddit Text Data☆83Updated 5 years ago
- Code and data for inducing domain-specific sentiment lexicons.☆196Updated last year
- Tutorial on topic models in Python with scikit-learn☆157Updated 2 years ago
- Extraction of the journalistic five W and one H questions (5W1H) from news articles: who did what, when, where, why, and how?☆528Updated last year
- Ekphrasis is a text processing tool, geared towards text from social networks, such as Twitter or Facebook. Ekphrasis performs tokenizati…☆674Updated 7 months ago
- Word Embeddings for Information Retrieval☆225Updated 2 years ago
- Data and analysis for the BuzzFeed News article, "Hyperpartisan Facebook Pages Are Publishing False And Misleading Information At An Alar…☆112Updated 9 years ago
- Twitter named entity extraction for WNUT 2016 http://noisy-text.github.io/2016/ner-shared-task.html☆140Updated 3 years ago
- analyze text with empath☆339Updated 8 years ago
- Code for the paper "Analyzing Polarization in Social Media: Method and Application to Tweets on 21 Mass Shootings"☆70Updated 3 years ago
- Quickly extract multi-word phrases from a corpus☆194Updated 5 years ago
- Analysis and experiments on the UN General Debate corpus☆36Updated 6 years ago
- A multilingual lexicon of words to hurt.☆92Updated 2 months ago
- Collection of tools for building diachronic/historical word vectors☆443Updated 2 years ago
- Data and analysis supporting the BuzzFeed News article, "In Spite Of Its Efforts, Facebook Is Still The Home Of Hugely Viral Fake News" p…☆33Updated 3 years ago
- Datasets for fake news and misinformation detection☆69Updated 2 years ago
- Models for predicting emotions from English tweets.☆165Updated 2 years ago
- Subjectivity and sentiment classification using polarity lexicons☆91Updated 4 years ago
- ☆301Updated 8 years ago
- ☆40Updated 10 years ago
- Unsupervised method for extracting quotation-speaker pairs from large news corpora.☆29Updated 7 years ago
- This repository contains papers and resources pertaining to Hate speech research.☆44Updated 4 years ago
- Remove problematic gender bias from word embeddings.☆251Updated 2 years ago
- Data to accompany the ICWSM 2015 paper "CREDBANK: A Large-scale Social Media Corpus With Associated Credibility Annotations"☆46Updated 6 years ago