Trained models & code to predict toxic comments on all 3 Jigsaw Toxic Comment Challenges. Built using β‘ Pytorch Lightning and π€ Transformers. For access to our API, please email us at contact@unitary.ai.
β1,262Apr 6, 2026Updated 2 months ago
Alternatives and similar repositories for detoxify
Users that are interested in detoxify are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This repo contains the code for generating the ToxiGen dataset, published at ACL 2022.β348Jun 17, 2024Updated 2 years ago
- Can we use explanations to improve hate speech models? Our paper accepted at AAAI 2021 tries to explore that question.β245Jun 12, 2023Updated 3 years ago
- Developing a classification model to detect hate tweets ready for deployment using various NLP techniquesβ19Oct 7, 2024Updated last year
- DeEpLearning models for MultIlingual haTespeech (DELIMIT): Benchmarking multilingual models across 9 languages and 16 datasets.β112Jun 12, 2023Updated 3 years ago
- Using GPT-3 to detect hate speech that contains sexist and racist contentβ24Nov 11, 2025Updated 7 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits β’ AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- β60Jun 5, 2024Updated 2 years ago
- TextAttack π is a Python framework for adversarial attacks, data augmentation, and model training in NLP https://textattack.readthedocsβ¦β3,437Apr 17, 2026Updated 2 months ago
- Repository for the paper "Automated Hate Speech Detection and the Problem of Offensive Language", ICWSM 2017β844Jun 12, 2023Updated 3 years ago
- This repo contains the dataset and description for Ruddit and its variants.β36Feb 13, 2022Updated 4 years ago
- RΓΆttger et al. (ACL 2021): "HateCheck: Functional Tests for Hate Speech Detection Models" - Dataβ59Oct 14, 2025Updated 8 months ago
- β233Feb 23, 2021Updated 5 years ago
- Leveraging BERT and c-TF-IDF to create easily interpretable topics.β7,716May 13, 2026Updated last month
- Catalog of abusive language data (PLoS 2020)β324Jun 14, 2024Updated 2 years ago
- Datasets for Hate Speech Detectionβ138May 12, 2023Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer β’ AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- This repository contains a dataset for hate speech detection on social media platforms.β75Dec 9, 2022Updated 3 years ago
- Data augmentation for NLPβ4,659Jun 20, 2026Updated 2 weeks ago
- The world's largest social media toxicity dataset.β192Jun 10, 2022Updated 4 years ago
- State-of-the-Art Embeddings, Retrieval, and Rerankingβ18,853Jun 26, 2026Updated last week
- β469May 30, 2023Updated 3 years ago
- A multilingual lexicon of words to hurt.β99Oct 10, 2025Updated 8 months ago
- The Learning Interpretability Tool: Interactively analyze ML models to understand their behavior in an extensible and framework agnostic β¦β3,655Jun 22, 2026Updated last week
- β10Aug 31, 2022Updated 3 years ago
- code associated with ACL 2021 DExperts paperβ119May 24, 2023Updated 3 years ago
- Virtual machines for every use case on DigitalOcean β’ AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Efficient few-shot learning with Sentence Transformersβ2,755May 26, 2026Updated last month
- Detect toxic spans in toxic textsβ70Jun 12, 2023Updated 3 years ago
- jiant is an nlp toolkitβ1,676Jul 6, 2023Updated 2 years ago
- NL-Augmenter π¦ β π A Collaborative Repository of Natural Language Transformationsβ786May 19, 2024Updated 2 years ago
- Efficient, scalable and enterprise-grade CPU/GPU inference server for π€ Hugging Face transformer models πβ1,689Oct 23, 2024Updated last year
- Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasetsβ5,022Updated this week
- Active Learning for Text Classification in Pythonβ645May 24, 2026Updated last month
- Human preference data for "Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback"β1,845Jun 17, 2025Updated last year
- Beyond the Imitation Game collaborative benchmark for measuring and extrapolating the capabilities of language modelsβ3,248Jul 19, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- β160Aug 9, 2022Updated 3 years ago
- β328Feb 25, 2026Updated 4 months ago
- Adversarial Natural Language Inference Benchmarkβ400May 12, 2022Updated 4 years ago
- Code and data for the EMNLP 2021 paper "Just Say No: Analyzing the Stance of Neural Dialogue Generation in Offensive Contexts". Coming soβ¦β17Jul 27, 2023Updated 2 years ago
- A very simple framework for state-of-the-art Natural Language Processing (NLP)β14,379Oct 27, 2025Updated 8 months ago
- Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.β3,138May 26, 2026Updated last month
- Official codebase for Image Hijacks: Adversarial Images can Control Generative Models at Runtimeβ55Sep 19, 2023Updated 2 years ago