pmbaumgartner / text-feat-lib
Provide a comprehensive list of tokenizers, features, and general NLP things used for text analysis with examples. The initial focus is on features used for twitter data and sentiment analysis.
☆46Updated 9 years ago
Alternatives and similar repositories for text-feat-lib:
Users that are interested in text-feat-lib are comparing it to the libraries listed below
- Subjectivity and sentiment classification using polarity lexicons☆88Updated 3 years ago
- Automatic labeling for topic model☆57Updated 9 years ago
- Wrapper to use syntaxnet with pre-trained model☆29Updated 6 years ago
- Similarity search on Wikipedia using gensim in Python.☆60Updated 6 years ago
- Topic Modelling for Humans☆40Updated 7 years ago
- ☆40Updated 9 years ago
- A Dependency Parser for Tweets☆78Updated 5 years ago
- ☆21Updated 8 years ago
- Relatively simple text classification powered by spaCy☆41Updated 9 years ago
- High-coverage and high-precision lexica of terms annotated with emotion scores for English and Italian.☆152Updated 4 months ago
- ☆23Updated 7 years ago
- See https://meta.wikimedia.org/wiki/Research:Modeling_Talk_Page_Abuse☆152Updated 4 years ago
- Benchmarking text classification algorithms☆64Updated 7 years ago
- A python module to get the emotion of a word.☆75Updated 6 years ago
- Regex like pattern tree matching but on sentence's tree instead of Strings☆42Updated 7 years ago
- A small utility for converting Stanford GloVe vectors to HDF5 / NumPy☆12Updated 7 years ago
- A Multilingual Latent Dirichlet Allocation (LDA) Pipeline with Stop Words Removal, n-gram features, and Inverse Stemming, in Python.☆84Updated 8 months ago
- Python implementation of MABED (Mention-Anomaly-Based Event Detection)☆37Updated 5 years ago
- Python port of the Twokenize class of ark-tweet-nlp☆141Updated 6 years ago
- materials for the study on mental health subreddits. If you use this code in your work, please cite George Gkotsis, Anika Oellrich, Tim …☆21Updated last year
- Train word embeddings with Gensim and vizualize them with TensorBoard☆34Updated 6 years ago
- An introduction to using spaCy for NLP and machine learning☆191Updated 3 years ago
- A curated list of resources dedicated to text summarization☆55Updated 6 years ago
- My solution for the Kaggle "Allen AI science challenge"☆48Updated 9 years ago
- Socially-Equitable Language Identification☆78Updated 2 years ago
- Quora Kaggle Competition : Natural Language Processing using word2vec embeddings, scikit-learn and xgboost for training☆18Updated 6 years ago
- Tokenization and pre-processing for Twitter data used to train classifiers.☆72Updated 8 years ago
- HackDelft☆81Updated 7 years ago
- Sentiment Classification using Word Sense Disambiguation☆171Updated 2 years ago
- Getting started with AllenNLP and PyTorch by training a tweet classifier☆66Updated 7 years ago