fhamborg / NewsMTSC
Target-dependent sentiment classification in news articles reporting on political events. Includes a high-quality data set of over 11k sentences and a state-of-the-art classification model.
☆140Updated 11 months ago
Related projects ⓘ
Alternatives and complementary repositories for NewsMTSC
- Extraction of the journalistic five W and one H questions (5W1H) from news articles: who did what, when, where, why, and how?☆512Updated 3 weeks ago
- Cleans Reddit Text Data☆81Updated 4 years ago
- SKILLSPAN: Competences as Spans for Skill Extraction from Job Postings☆56Updated 9 months ago
- Code and experiments for *BERTopic: Neural topic modeling with a class-based TF-IDF procedure*☆70Updated 11 months ago
- A classifier that distinguishes political from non-political news articles.☆28Updated last year
- A module to compute textual lexical richness (aka lexical diversity).☆92Updated last year
- TweetNLP for all the NLP enthusiasts working on Twitter! The Python library tweetnlp provides a collection of useful tools to analyze/und…☆313Updated 3 months ago
- This repository provides usage examples for the Python module Newspaper3k.☆142Updated 10 months ago
- A package to run embedded topic modelling with ETM. Adapted from the original at: https://github.com/adjidieng/ETM☆91Updated last year
- A collection of topic diversity measures for topic modeling☆45Updated 3 years ago
- Set of vectorizers that extract keyphrases with part-of-speech patterns from a collection of text documents and convert them into a docum…☆254Updated last week
- Implementation of the ClausIE information extraction system for python+spacy☆220Updated 2 years ago
- Text analysis with networks.☆285Updated 6 months ago
- DeEpLearning models for MultIlingual haTespeech (DELIMIT): Benchmarking multilingual models across 9 languages and 16 datasets.☆107Updated last year
- A multilingual lexicon of words to hurt.☆80Updated 2 weeks ago
- Code accompanying the submission "Structural Text Segmentation of Legal Documents" by Aumiller et al.☆96Updated last year
- HDBSCAN Tuning for BERTopic Models☆42Updated last year
- LexRank algorithm for text summarization☆229Updated 7 months ago
- ☆147Updated 5 months ago
- Asent is a python library for performing efficient and transparent sentiment analysis using spaCy.☆115Updated 7 months ago
- A python package for text preprocessing task in natural language processing.☆63Updated 2 years ago
- Creating class-based TF-IDF matrices☆82Updated 2 years ago
- Lbl2Vec learns jointly embedded label, document and word vectors to retrieve documents with predefined topics from an unlabeled document …☆177Updated 9 months ago
- This repository contains a dataset for hate speech detection on social media platforms.☆66Updated last year
- ☆35Updated 3 years ago
- Coreference resolution for English, French, German and Polish, optimised for limited training data and easily extensible for further lang…☆191Updated last year
- A Python library for calculating a large variety of metrics from text☆315Updated last month
- Keyword extraction using TextRank algorithm after pre-processing the text with lemmatization, filtering unwanted parts-of-speech and othe…☆114Updated 4 years ago
- Class for Aspect-term extraction and Aspect-based sentiment analysis with BERT and Adapters☆40Updated 2 years ago
- 📗 Score text readability using a number of formulas: Flesch-Kincaid Grade Level, Gunning Fog, ARI, Dale Chall, SMOG, and more☆361Updated 2 months ago