cardiffnlp / tweetnlpLinks
TweetNLP for all the NLP enthusiasts working on Twitter! The Python library tweetnlp provides a collection of useful tools to analyze/understand tweets such as sentiment analysis, emoji prediction, and named entity recognition, powered by state-of-the-art language models specialised on Twitter.
☆360Updated 6 months ago
Alternatives and similar repositories for tweetnlp
Users that are interested in tweetnlp are comparing it to the libraries listed below
Sorting:
- Repository for TweetEval☆386Updated 3 years ago
- Target-dependent sentiment classification in news articles reporting on political events. Includes a high-quality data set of over 11k se…☆155Updated 3 months ago
- A Python library for calculating a large variety of metrics from text☆352Updated 10 months ago
- Concept Modeling: Topic Modeling on Images and Text☆214Updated 11 months ago
- Clustering sentence embeddings to extract message intent☆174Updated 4 years ago
- OCTIS: Comparing Topic Models is Simple! A python package to optimize and evaluate topic models (accepted at EACL2021 demo track)☆789Updated last year
- Set of vectorizers that extract keyphrases with part-of-speech patterns from a collection of text documents and convert them into a docum…☆265Updated 11 months ago
- Code and experiments for *BERTopic: Neural topic modeling with a class-based TF-IDF procedure*☆82Updated last year
- Text analysis with networks.☆290Updated 2 weeks ago
- A Python multilingual toolkit for Sentiment Analysis and Social NLP tasks☆627Updated last year
- A package to run embedded topic modelling with ETM. Adapted from the original at: https://github.com/adjidieng/ETM☆96Updated 2 years ago
- 📖 A curated list of LegalNLP resources from all around the web.☆288Updated 2 weeks ago
- Catalog of abusive language data (PLoS 2020)☆317Updated last year
- A multilingual lexicon of words to hurt.☆91Updated 3 weeks ago
- Topic modeling helpers using managed language models from Cohere. Name text clusters using large GPT models.☆223Updated 2 years ago
- BERT classification model for processing texts longer than 512 tokens. Text is first divided into smaller chunks and after feeding them t…☆145Updated last year
- BERTweet: A pre-trained language model for English Tweets (EMNLP-2020)☆600Updated last year
- Lbl2Vec learns jointly embedded label, document and word vectors to retrieve documents with predefined topics from an unlabeled document …☆186Updated last year
- 🗺️ Data Cleaning and Textual Data Visualization 🗺️☆189Updated 5 months ago
- A python package for text preprocessing task in natural language processing.☆63Updated 3 years ago
- This repository contains an easy and intuitive approach to few-shot classification using sentence-transformers or spaCy models, or zero-s…☆220Updated 9 months ago
- potato: portable text annotation tool☆353Updated last week
- ☆169Updated last year
- Hate speech dataset from Stormfront forum manually labelled at sentence level.☆175Updated 5 years ago
- A module to compute textual lexical richness (aka lexical diversity).☆111Updated 2 years ago
- Repository for the LREC 2022 submission on Emotion Word Dynamics in Geolocated Tweet data.☆101Updated 2 years ago
- SKILLSPAN: Competences as Spans for Skill Extraction from Job Postings☆64Updated 8 months ago
- HDBSCAN Tuning for BERTopic Models☆49Updated 2 years ago
- Datasets for Hate Speech Detection☆132Updated 2 years ago
- 📗 Score text readability using a number of formulas: Flesch-Kincaid Grade Level, Gunning Fog, ARI, Dale Chall, SMOG, and more☆391Updated last year