MoritzLaurer / less-annotating-with-bert-nli
Project repository of the paper "Less Annotating, More Classifying – Addressing the Data Scarcity Issue of Supervised Machine Learning with Deep Transfer Learning and BERT-NLI"
☆23Updated 6 months ago
Related projects: ⓘ
- The official implementation of the iConference 2022 paper "Identifying Machine-Paraphrased Plagiarism".☆16Updated last year
- The course introduces the use of open-source large language models (LLMs) from the Hugging Face ecosystem for research in the behavioral …☆43Updated 3 months ago
- Learning from Neighbors: Unsupervised Text Classification☆17Updated last year
- MoodCat😼 classifies the mood of English sentences.☆13Updated 2 years ago
- This is a step by step tutorial for text analyst who want an easy start to basic and and common techniques in NLP, Text Analysis, Machine…☆20Updated last year
- HDBSCAN Tuning for BERTopic Models☆42Updated last year
- Dutch abusive language data☆11Updated 11 months ago
- A tool for Semantic Scaling of Political Text (branch of Topfish, a suite of tools for Political Text Analysis)☆26Updated 8 months ago
- Introducing gpt_annotate: an easy-to-use python package designed to streamline automated text annotation using LLMs for different tasks a…☆26Updated last week
- The Harvard USPTO Patent Dataset☆54Updated 9 months ago
- Code and data for paper "Large language models can rate news outlet credibility"☆12Updated last month
- Fine-tuned transformers for protest event detection.☆9Updated 3 years ago
- Code for publication Törnberg, P. 2022. "How digital media drive affective polarization through partisan sorting". PNAS.☆14Updated 2 years ago
- Tutorials for Stance Detection: A practical guide☆21Updated last year
- T-Projection is a method to perform high-quality Annotation Projection of Sequence Labeling datasets.☆11Updated 9 months ago
- A BERT-based application for reusable text classification at scale☆37Updated last year
- TopicScan: Visualization and validation interface for NMF Topic Modeling☆23Updated 4 years ago
- Course repository for the session "Hands-on Transformers: Fine-Tune your own BERT and GPT" of the Data Science Summer School 2023☆84Updated last year
- ConfliBERT: A Pre-trained Language Model for Political Conflict and Violence (NAACL 2022)☆16Updated 11 months ago
- Noise-robust de-duplication at scale☆15Updated last year
- Package to extract connotation frames☆78Updated 9 months ago
- The CleanCoNLL dataset from our EMNLP 2023 paper where we corrected annotation errors and inconsistencies in CoNLL-03.☆16Updated 2 months ago
- Repository for the paper Us vs. Them: A Dataset of Populist Attitudes, News Bias and Emotions☆16Updated 3 months ago
- A python package to enrich Twitter Data☆73Updated last year
- Twitter dataset for 2022 Russian and Ukrainian crisis☆50Updated last year
- ☆22Updated last year
- Text-Based Ideal Points☆43Updated last year
- ☆53Updated 8 months ago
- ☆26Updated last year
- Classifies sentences whether they represent a fact or personal opinion with 90% accuracy using various Machine Learning algorithms from s…☆28Updated 6 years ago