matchado / HashTagSplitter
A Python function to break down hashtags or compound words created by putting together multiple words
☆33Updated 9 years ago
Alternatives and similar repositories for HashTagSplitter:
Users that are interested in HashTagSplitter are comparing it to the libraries listed below
- Twitter word embeddings generated using Word2Vec and FastText.☆49Updated 5 years ago
- Twitter named entity extraction for WNUT 2016 http://noisy-text.github.io/2016/ner-shared-task.html☆138Updated 2 years ago
- Analysis and experiments on the UN General Debate corpus☆36Updated 5 years ago
- Word2Vec 400M Tweets Embedding model based on https://www.fredericgodin.com/software/☆42Updated 4 years ago
- Python port of the Twokenize class of ark-tweet-nlp☆141Updated 6 years ago
- See https://meta.wikimedia.org/wiki/Research:Modeling_Talk_Page_Abuse☆152Updated 4 years ago
- ☆56Updated 3 years ago
- A multilingual lexicon of words to hurt.☆83Updated 3 months ago
- ☆26Updated 8 years ago
- Multilingual Rapid Automatic Keyword Extraction (RAKE) for Python☆269Updated last year
- Code and data for inducing domain-specific sentiment lexicons.☆195Updated 7 months ago
- Python library for advanced text mining☆68Updated 4 years ago
- Replication code for the JAIR 2016 paper "Predicting Twitter User Demographics using Distance Supervision from Website Traffic Data"☆12Updated 8 years ago
- ☆53Updated 2 years ago
- Package for Statistically significant linguistic change☆55Updated 2 years ago
- A library for topic modeling and browsing☆89Updated 6 years ago
- The page lists recent research developments in the area of Stance Learning.☆53Updated last year
- An introduction to using spaCy for NLP and machine learning☆191Updated 3 years ago
- Geolocation for Twitter.☆72Updated 2 years ago
- Python implementation of MABED (Mention-Anomaly-Based Event Detection)☆37Updated 5 years ago
- Named Entity Recognition data for Europeana Newspapers☆171Updated last year
- CrisisLex: Your data and lexical resource in crises☆51Updated last year
- A baseline implementation for FNC-1☆137Updated 2 years ago
- Corpus and annotations for the CL-Aff Shared Task from the University of Pennsylvania☆19Updated 3 years ago
- Tutorial on computational models of language change☆114Updated 5 years ago
- Hierarchical, multi-label topic modelling with LDA☆53Updated 2 years ago
- A Dependency Parser for Tweets☆78Updated 5 years ago
- Generating labels for topics automatically using neural embeddings☆183Updated last year
- Key information extraction from text and graph visualization☆91Updated 4 years ago
- Data to accompany the ICWSM 2015 paper "CREDBANK: A Large-scale Social Media Corpus With Associated Credibility Annotations"☆45Updated 5 years ago