cardiffnlp / tweetnlp
TweetNLP for all the NLP enthusiasts working on Twitter! The Python library tweetnlp provides a collection of useful tools to analyze/understand tweets such as sentiment analysis, emoji prediction, and named entity recognition, powered by state-of-the-art language models specialised on Twitter.
☆304Updated last month
Related projects: ⓘ
- Repository for TweetEval☆354Updated 2 years ago
- Set of vectorizers that extract keyphrases with part-of-speech patterns from a collection of text documents and convert them into a docum…☆251Updated 4 months ago
- Clustering sentence embeddings to extract message intent☆166Updated 2 years ago
- OCTIS: Comparing Topic Models is Simple! A python package to optimize and evaluate topic models (accepted at EACL2021 demo track)☆714Updated last month
- Target-dependent sentiment classification in news articles reporting on political events. Includes a high-quality data set of over 11k se…☆137Updated 8 months ago
- BERTweet: A pre-trained language model for English Tweets (EMNLP-2020)☆574Updated last month
- Code and experiments for *BERTopic: Neural topic modeling with a class-based TF-IDF procedure*☆65Updated 9 months ago
- Concept Modeling: Topic Modeling on Images and Text☆192Updated last year
- A package to run embedded topic modelling with ETM. Adapted from the original at: https://github.com/adjidieng/ETM☆85Updated last year
- Text analysis with networks.☆283Updated 4 months ago
- Lbl2Vec learns jointly embedded label, document and word vectors to retrieve documents with predefined topics from an unlabeled document …☆175Updated 7 months ago
- A Python library for calculating a large variety of metrics from text☆309Updated this week
- 📖 A curated list of LegalNLP resources from all around the web.☆227Updated last year
- A python package to run contextualized topic modeling. CTMs combine contextualized embeddings (e.g., BERT) with topic models to get coher…☆1,192Updated 8 months ago
- BERT classification model for processing texts longer than 512 tokens. Text is first divided into smaller chunks and after feeding them t…☆113Updated 3 months ago
- Datasets for Hate Speech Detection☆114Updated last year
- SpanMarker for Named Entity Recognition☆384Updated last month
- Powerful unsupervised domain adaptation method for dense retrieval. Requires only unlabeled corpus and yields massive improvement: "GPL: …☆321Updated last year
- SKILLSPAN: Competences as Spans for Skill Extraction from Job Postings☆52Updated 7 months ago
- Links to conference/journal publications in automated fact-checking (resources for the TACL22/EMNLP23 paper).☆392Updated 3 weeks ago
- Catalog of abusive language data (PLoS 2020)☆299Updated 3 months ago
- Code & Prompts for TopicGPT: A Prompt-Based Framework for Topic Modeling☆202Updated 5 months ago
- Language model fine-tuning on NER with an easy interface and cross-domain evaluation. "T-NER: An All-Round Python Library for Transformer…☆373Updated last year
- Entity Disambiguation as text extraction (ACL 2022)☆173Updated 2 years ago
- FastFit ⚡ When LLMs are Unfit Use FastFit ⚡ Fast and Effective Text Classification with Many Classes☆180Updated 3 weeks ago
- Zero and Few shot named entity & relationships recognition☆340Updated this week
- Repository for Zheng and Guha et al., 2021, "When Does Pretraining Help? Assessing Self-Supervised Learning for Law and the CaseHOLD Data…☆80Updated last year
- Asent is a python library for performing efficient and transparent sentiment analysis using spaCy.☆114Updated 5 months ago
- HDBSCAN Tuning for BERTopic Models☆42Updated last year
- A collection of topic diversity measures for topic modeling