dominiksinsaarland / DocSCAN
Learning from Neighbors: Unsupervised Text Classification
☆17Updated 2 years ago
Alternatives and similar repositories for DocSCAN:
Users that are interested in DocSCAN are comparing it to the libraries listed below
- Project repository of the paper "Less Annotating, More Classifying – Addressing the Data Scarcity Issue of Supervised Machine Learning wi…☆31Updated 11 months ago
- ☆41Updated 4 years ago
- Repository for the paper Us vs. Them: A Dataset of Populist Attitudes, News Bias and Emotions☆16Updated 8 months ago
- Package to extract connotation frames☆83Updated last year
- Dataset and code for directed sentiment analysis in news text.☆16Updated 3 years ago
- Repository for the CommonLit Ease of Readability Corpus☆22Updated 10 months ago
- ☆22Updated 4 years ago
- ☆21Updated last year
- Code and data for paper "Large language models can rate news outlet credibility"☆12Updated 6 months ago
- Twitter dataset for 2022 Russian and Ukrainian crisis☆49Updated 2 years ago
- Contextualised Word Representations for Lexical Semantic Change Analysis☆31Updated 4 years ago
- ☆32Updated last year
- HDBSCAN Tuning for BERTopic Models☆43Updated last year
- Noise-robust de-duplication at scale☆17Updated last year
- ☆13Updated 3 years ago
- ☆39Updated 3 years ago
- Introducing gpt_annotate: an easy-to-use python package designed to streamline automated text annotation using LLMs for different tasks a…☆27Updated 5 months ago
- A python package to enrich Twitter Data☆74Updated last year
- Package for computing causal effects of text (as treatment)☆70Updated 2 years ago
- ☆164Updated 2 years ago
- Text-Based Ideal Points☆43Updated last year
- The COVID-19 Real World Worry Datasets☆27Updated 3 years ago
- Code for the paper "Simple, Interpretable and Stable Method for Detecting Words with Usage Change across Corpora", ACL 2020.☆18Updated 4 years ago
- A tool for Semantic Scaling of Political Text (branch of Topfish, a suite of tools for Political Text Analysis)☆27Updated last year
- Driver for LIWC2015 analysis. LIWC2015 dictionary not included.☆16Updated 2 years ago
- ☆17Updated 6 years ago
- ☆16Updated 3 weeks ago
- Information and data related to the ProtestNews shared task at CASE @ ACL-IJCNLP 2021 workshop☆43Updated 2 years ago
- Scripts to fit and explore word embedding models augmented with political metadata.☆23Updated 7 months ago
- [LREC 2022] An off-the-shelf pre-trained Tweet NLP Toolkit (NER, tokenization, lemmatization, POS tagging, dependency parsing) + Tweeban…☆104Updated last year