dominiksinsaarland / DocSCAN
Learning from Neighbors: Unsupervised Text Classification
☆17Updated 2 years ago
Alternatives and similar repositories for DocSCAN:
Users that are interested in DocSCAN are comparing it to the libraries listed below
- ☆41Updated 5 years ago
- Project repository of the paper "Less Annotating, More Classifying – Addressing the Data Scarcity Issue of Supervised Machine Learning wi…☆31Updated last year
- Repository for the CommonLit Ease of Readability Corpus☆23Updated last year
- Introducing gpt_annotate: an easy-to-use python package designed to streamline automated text annotation using LLMs for different tasks a…☆28Updated 8 months ago
- Repository for the paper Us vs. Them: A Dataset of Populist Attitudes, News Bias and Emotions☆16Updated 11 months ago
- Package to extract connotation frames☆85Updated last year
- ☆21Updated last year
- Additional material for the paper "MoralStrength: Exploiting a Moral Lexicon and Embedding Similarity for Moral Foundations Prediction"☆54Updated 2 years ago
- Noise-robust de-duplication at scale☆19Updated 2 years ago
- GisPy: A Tool for Measuring Gist Inference Score in Text https://aclanthology.org/2022.wnu-1.5/☆12Updated 10 months ago
- A python package to enrich Twitter Data☆75Updated last year
- Contextualised Word Representations for Lexical Semantic Change Analysis☆31Updated 4 years ago
- Code for the paper "Content Analysis of Textbooks via Natural Language Processing".☆58Updated last year
- Dataset and code for directed sentiment analysis in news text.☆16Updated 3 years ago
- Text-Based Ideal Points☆44Updated 2 years ago
- Sentiment Analysis and Cognition Engine (text analysis tool)☆19Updated 4 years ago
- A module to compute textual lexical richness (aka lexical diversity).☆106Updated last year
- Literature 📄 and datasets 📚 on automatic populism detection☆18Updated last month
- This is a step by step tutorial for text analyst who want an easy start to basic and and common techniques in NLP, Text Analysis, Machine…☆18Updated 2 years ago
- Code and data for "Superbizarre Is Not Superb: Derivational Morphology Improves BERT's Interpretation of Complex Words"☆16Updated 3 years ago
- Scripts to fit and explore word embedding models augmented with political metadata.☆25Updated 9 months ago
- Code and data for paper "Large language models can rate news outlet credibility"☆13Updated 9 months ago
- Python library for extracting quantitative, reproducible metrics of multi-level alignment between speakers in naturalistic language corpo…☆45Updated last month
- Code for the paper "Simple, Interpretable and Stable Method for Detecting Words with Usage Change across Corpora", ACL 2020.☆18Updated 4 years ago
- ☆19Updated 3 years ago
- HDBSCAN Tuning for BERTopic Models☆45Updated last year
- ☆54Updated 3 years ago
- ☆15Updated 7 years ago
- Training Temporal Word Embeddings with a Compass☆64Updated 2 years ago
- Package for computing causal effects of text (as treatment)☆70Updated 3 years ago