s / preprocessor
Elegant and Easy Tweet Preprocessing in Python
☆306Updated last year
Alternatives and similar repositories for preprocessor:
Users that are interested in preprocessor are comparing it to the libraries listed below
- 16 Text Preprocessing Techniques in Python for Twitter Sentiment Analysis.☆224Updated 5 years ago
- semi supervised guided topic model with custom guidedLDA☆504Updated 4 years ago
- Steam review texting embedding analysis☆141Updated last year
- Tutorial on topic models in Python with scikit-learn☆157Updated last year
- Lexicon-based sentiment analysis inspired by Syuzhet R package☆22Updated 2 years ago
- Word2Vec 400M Tweets Embedding model based on https://www.fredericgodin.com/software/☆42Updated 4 years ago
- A multilingual lexicon of words to hurt.☆86Updated 4 months ago
- Models for predicting emotions from English tweets.☆164Updated last year
- Harry Potter and the Allocation of Dirichlet☆123Updated 5 years ago
- Pretrained BERT model for analysing COVID-19 Twitter data☆184Updated last year
- Open source Emoticons and Emoji detection library: emot☆192Updated last year
- 🔤 Calculate average word embeddings (word2vec) from documents for transfer learning☆54Updated 10 months ago
- Code and data for inducing domain-specific sentiment lexicons.☆195Updated 7 months ago
- A comparison and discussion of different NLP methods for 5-class sentiment classification on the SST-5 dataset.☆169Updated 6 months ago
- Fixes contractions such as `you're` to `you are`☆315Updated 2 years ago
- Analysis and experiments on the UN General Debate corpus☆36Updated 5 years ago
- Applying NLP transfer learning techniques to predict Tweet stance toward a topic☆107Updated 6 years ago
- Text Mining and Topic Modeling Toolkit for Python with parallel processing power☆190Updated last year
- Repo for my talk at the PyData Berlin 2017 conference☆66Updated 7 years ago
- ☆233Updated 8 years ago
- DeEpLearning models for MultIlingual haTespeech (DELIMIT): Benchmarking multilingual models across 9 languages and 16 datasets.☆109Updated last year
- ☆54Updated 2 years ago
- A baseline implementation for FNC-1☆137Updated 2 years ago
- The SentiWordNet sentiment lexicon☆326Updated 2 years ago
- GSDMM: Short text clustering☆355Updated 2 years ago
- Repository for TweetEval☆365Updated 2 years ago
- Ekphrasis is a text processing tool, geared towards text from social networks, such as Twitter or Facebook. Ekphrasis performs tokenizati…☆666Updated last year
- Word Embeddings for Information Retrieval☆225Updated last year
- Intelligently expand and create contractions in text leveraging grammar checking and Word Mover's Distance.☆76Updated 3 years ago
- Biterm Topic Model☆135Updated last year