tommasoc80 / DALC
Dutch abusive language data
☆11Updated last year
Alternatives and similar repositories for DALC:
Users that are interested in DALC are comparing it to the libraries listed below
- ☆22Updated 2 years ago
- Using short models to classify long texts☆21Updated last year
- ☆21Updated 2 weeks ago
- Ranking of fine-tuned HF models as base models.☆35Updated last year
- BERT Probe: A python package for probing attention based robustness to character and word based adversarial evaluation. Also, with recipe…☆18Updated 2 years ago
- GLaRA: Graph-based Labeling Rule Augmentation for Weakly Supervised Named Entity Recognition☆31Updated 3 years ago
- Source code and data for Like a Good Nearest Neighbor☆28Updated last month
- TopicScan: Visualization and validation interface for NMF Topic Modeling☆23Updated 4 years ago
- ☆22Updated 2 years ago
- KnowMAN: Weakly Supervised Multinomial Adversarial Networks☆12Updated 3 years ago
- Code for "Incorporating Relevance Feedback for Information-Seeking Retrieval using Few-Shot Document Re-Ranking" (https://arxiv.org/abs/2…☆13Updated last year
- GisPy: A Tool for Measuring Gist Inference Score in Text https://aclanthology.org/2022.wnu-1.5/☆12Updated 7 months ago
- T-Projection is a method to perform high-quality Annotation Projection of Sequence Labeling datasets.☆12Updated last year
- Preprocessing and analysis for training SNOMED-CT concept embeddings from CORD-19 corpus☆14Updated last year
- NLP tool to extract emotional phrase from tweets 🤩☆40Updated 3 years ago
- Project repository of the paper "Less Annotating, More Classifying – Addressing the Data Scarcity Issue of Supervised Machine Learning wi…☆30Updated 10 months ago
- Companion Repo for the Vision Language Modelling YouTube series - https://bit.ly/3PsbsC2 - by Prithivi Da. Open to PRs and collaborations☆14Updated 2 years ago
- MoodCat😼 classifies the mood of English sentences.☆14Updated 2 years ago
- ☆16Updated 6 months ago
- Hashformers is a framework for hashtag segmentation with Transformers and Large Language Models (LLMs).☆70Updated 5 months ago
- Arabic News Stance Corpus☆10Updated 4 years ago
- SeqScore: Scoring for named entity recognition and other sequence labeling tasks☆22Updated last month
- A set of methods for finding an appropriate number of topics in a text collection☆15Updated 6 months ago
- ☆15Updated 4 years ago
- Resources accompanying the "Zero-Shot Recommendation as Language Modeling" paper (ECIR2022)☆13Updated last year
- OptimSeed - Seed Word Selection for Weakly-Supervised Text Classification [NAACL SRW 2021]☆14Updated 3 years ago
- spaCy match and replace, maintaining conjugation☆35Updated 2 years ago
- Exploring NLP weak supervision approaches to train text classification models. The project is also a prototype for a semi-automated text …☆22Updated 11 months ago
- Tool for the Automatic Assessment of Lexical Diversity☆11Updated 4 years ago
- ☆35Updated last year