unitaryai / detoxifyView external linksLinks
Trained models & code to predict toxic comments on all 3 Jigsaw Toxic Comment Challenges. Built using β‘ Pytorch Lightning and π€ Transformers. For access to our API, please email us at contact@unitary.ai.
β1,182Jan 5, 2026Updated last month
Alternatives and similar repositories for detoxify
Users that are interested in detoxify are comparing it to the libraries listed below
Sorting:
- This repo contains the code for generating the ToxiGen dataset, published at ACL 2022.β346Jun 17, 2024Updated last year
- Using GPT-3 to detect hate speech that contains sexist and racist contentβ24Nov 11, 2025Updated 3 months ago
- TextAttack π is a Python framework for adversarial attacks, data augmentation, and model training in NLP https://textattack.readthedocsβ¦β3,358Jul 10, 2025Updated 7 months ago
- DeEpLearning models for MultIlingual haTespeech (DELIMIT): Benchmarking multilingual models across 9 languages and 16 datasets.β110Jun 12, 2023Updated 2 years ago
- Can we use explanations to improve hate speech models? Our paper accepted at AAAI 2021 tries to explore that question.β230Jun 12, 2023Updated 2 years ago
- This repo contains the dataset and description for Ruddit and its variants.β36Feb 13, 2022Updated 4 years ago
- Data augmentation for NLPβ4,645Jun 24, 2024Updated last year
- Leveraging BERT and c-TF-IDF to create easily interpretable topics.β7,397Jan 31, 2026Updated 2 weeks ago
- This repository contains a dataset for hate speech detection on social media platforms.β74Dec 9, 2022Updated 3 years ago
- State-of-the-Art Text Embeddingsβ18,225Updated this week
- Repository for the paper "Automated Hate Speech Detection and the Problem of Offensive Language", ICWSM 2017β834Jun 12, 2023Updated 2 years ago
- Active Learning for Text Classification in Pythonβ639Feb 1, 2026Updated last week
- β57Jun 5, 2024Updated last year
- The Learning Interpretability Tool: Interactively analyze ML models to understand their behavior in an extensible and framework agnostic β¦β3,632Feb 6, 2026Updated last week
- Datasets for Hate Speech Detectionβ136May 12, 2023Updated 2 years ago
- NL-Augmenter π¦ β π A Collaborative Repository of Natural Language Transformationsβ786May 19, 2024Updated last year
- RΓΆttger et al. (WOAH at NAACL 2022): "Multilingual HateCheck: Functional Tests for Multilingual Hate Speech Detection Models"β17May 23, 2022Updated 3 years ago
- RΓΆttger et al. (ACL 2021): "HateCheck: Functional Tests for Hate Speech Detection Models" - Dataβ59Oct 14, 2025Updated 4 months ago
- Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasetsβ4,852Updated this week
- Catalog of abusive language data (PLoS 2020)β321Jun 14, 2024Updated last year
- jiant is an nlp toolkitβ1,675Jul 6, 2023Updated 2 years ago
- Explain, analyze, and visualize NLP language models. Ecco creates interactive visualizations directly in Jupyter notebooks explaining theβ¦β2,079Aug 15, 2024Updated last year
- Efficient few-shot learning with Sentence Transformersβ2,678Dec 11, 2025Updated 2 months ago
- skweak: A software toolkit for weak supervision applied to NLP tasksβ926Sep 2, 2024Updated last year
- Parallelformers: An Efficient Model Parallelization Toolkit for Deploymentβ791Apr 24, 2023Updated 2 years ago
- Beyond the Imitation Game collaborative benchmark for measuring and extrapolating the capabilities of language modelsβ3,199Jul 19, 2024Updated last year
- Adversarial Natural Language Inference Benchmarkβ398May 12, 2022Updated 3 years ago
- β228Feb 23, 2021Updated 4 years ago
- Library for Knowledge Intensive Language Tasksβ963Mar 31, 2022Updated 3 years ago
- Efficient, scalable and enterprise-grade CPU/GPU inference server for π€ Hugging Face transformer models πβ1,689Oct 23, 2024Updated last year
- Detect toxic spans in toxic textsβ71Jun 12, 2023Updated 2 years ago
- A very simple framework for state-of-the-art Natural Language Processing (NLP)β14,355Oct 27, 2025Updated 3 months ago
- Human preference data for "Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback"β1,814Jun 17, 2025Updated 7 months ago
- A multilingual lexicon of words to hurt.β94Oct 10, 2025Updated 4 months ago
- Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.β2,885Updated this week
- Facebook AI Research Sequence-to-Sequence Toolkit written in Python.β32,143Sep 30, 2025Updated 4 months ago
- β332Updated this week
- A python package for benchmarking interpretability techniques on Transformers.β215Sep 29, 2024Updated last year
- Your PyTorch AI Factory - Flash enables you to easily configure and run complex AI recipes for over 15 tasks across 7 data domainsβ1,736Oct 8, 2023Updated 2 years ago