tommasoc80 / DALC
Dutch abusive language data
☆11Updated last year
Alternatives and similar repositories for DALC
Users that are interested in DALC are comparing it to the libraries listed below
Sorting:
- Using short models to classify long texts☆21Updated 2 years ago
- ☆22Updated 3 years ago
- ☆22Updated 3 months ago
- MoodCat😼 classifies the mood of English sentences.☆14Updated 2 years ago
- Source code and data for Like a Good Nearest Neighbor☆28Updated 4 months ago
- MinHash implementation in Python☆11Updated 8 months ago
- BERT Probe: A python package for probing attention based robustness to character and word based adversarial evaluation. Also, with recipe…☆18Updated 2 years ago
- TopicScan: Visualization and validation interface for NMF Topic Modeling☆23Updated 4 years ago
- KnowMAN: Weakly Supervised Multinomial Adversarial Networks☆12Updated 3 years ago
- ☆23Updated 2 years ago
- Preprocessing and analysis for training SNOMED-CT concept embeddings from CORD-19 corpus☆14Updated last year
- ☆54Updated last year
- GLaRA: Graph-based Labeling Rule Augmentation for Weakly Supervised Named Entity Recognition☆31Updated 3 years ago
- Just another sentiment wrapper.☆17Updated 3 years ago
- FAMIE: A Fast Active Learning Framework for Multilingual Information Extraction☆24Updated 2 years ago
- SeqScore: Scoring for named entity recognition and other sequence labeling tasks☆24Updated 2 months ago
- Transforming textual descriptions into process models using deep learning☆14Updated 5 years ago
- Code for SaGe subword tokenizer (EACL 2023)☆24Updated 5 months ago
- ☆38Updated last year
- Ranking of fine-tuned HF models as base models.☆35Updated this week
- A small repository to test Captum Explainable AI with a trained Flair transformers-based text classifier.☆27Updated 4 years ago
- A Python library aimed at dissecting and augmenting NER training data.☆58Updated 2 years ago
- spaCy match and replace, maintaining conjugation☆35Updated 2 years ago
- ☆14Updated 7 months ago
- This project develops compact transformer models tailored for clinical text analysis, balancing efficiency and performance for healthcare…☆18Updated last year
- Fastlaw's purpose is to replace generic word embeddings for work on supervised machine learning NLP-tasks with legal texts.☆38Updated 5 years ago
- Semantically Structured Sentence Embeddings☆66Updated 6 months ago
- Tokenization across languages. Useful as preprocessing for subword tokenization.☆22Updated 2 years ago
- Minimal code to train ELMo models in recent versions of TensorFlow☆14Updated 2 years ago
- This repository hosts the code for a tokenizer of tweets.☆12Updated 6 years ago