unitaryai / detoxify
Trained models & code to predict toxic comments on all 3 Jigsaw Toxic Comment Challenges. Built using ⚡ Pytorch Lightning and 🤗 Transformers. For access to our API, please email us at contact@unitary.ai.
☆1,042Updated last month
Alternatives and similar repositories for detoxify
Users that are interested in detoxify are comparing it to the libraries listed below
Sorting:
- Catalog of abusive language data (PLoS 2020)☆310Updated 11 months ago
- This repo contains the code for generating the ToxiGen dataset, published at ACL 2022.☆317Updated 10 months ago
- Repository for TweetEval☆373Updated 2 years ago
- BERTweet: A pre-trained language model for English Tweets (EMNLP-2020)☆590Updated 9 months ago
- TweetNLP for all the NLP enthusiasts working on Twitter! The Python library tweetnlp provides a collection of useful tools to analyze/und…☆346Updated last month
- NL-Augmenter 🦎 → 🐍 A Collaborative Repository of Natural Language Transformations☆783Updated 11 months ago
- Can we use explanations to improve hate speech models? Our paper accepted at AAAI 2021 tries to explore that question.☆206Updated last year
- The implementation of DeBERTa☆2,082Updated last year
- BLEURT is a metric for Natural Language Generation based on transfer learning.☆729Updated last year
- Hate speech dataset from Stormfront forum manually labelled at sentence level.☆173Updated 4 years ago
- A python package to run contextualized topic modeling. CTMs combine contextualized embeddings (e.g., BERT) with topic models to get coher…☆1,228Updated 3 months ago
- Model explainability that works seamlessly with 🤗 transformers. Explain your transformers model in just 2 lines of code.☆1,341Updated last year
- A Neural Language Style Transfer framework to transfer natural language text smoothly between fine-grained language styles like formal/ca…☆486Updated last year
- This repository contains the code for "Exploiting Cloze Questions for Few-Shot Text Classification and Natural Language Inference"☆1,629Updated last year
- Robustness Gym is an evaluation toolkit for machine learning.☆440Updated 2 years ago
- Repository for the paper "Automated Hate Speech Detection and the Problem of Offensive Language", ICWSM 2017☆814Updated last year
- TextAugment: Text Augmentation Library☆420Updated last year
- simpleT5 is built on top of PyTorch-lightning⚡️ and Transformers🤗 that lets you quickly train your T5 models.☆393Updated last year
- Active Learning for Text Classification in Python☆614Updated last month
- Top2Vec learns jointly embedded topic, document and word vectors.☆3,036Updated 6 months ago
- ☆1,209Updated 9 months ago
- Explain, analyze, and visualize NLP language models. Ecco creates interactive visualizations directly in Jupyter notebooks explaining the…☆2,032Updated 9 months ago
- A multilingual lexicon of words to hurt.☆89Updated 6 months ago
- Resources for the "SummEval: Re-evaluating Summarization Evaluation" paper☆391Updated 10 months ago
- SpanMarker for Named Entity Recognition☆429Updated 4 months ago
- String-to-String Algorithms for Natural Language Processing☆546Updated 9 months ago
- This is where I put things I find useful that speed up my work with Machine Learning. Ever looked in your old projects to reuse those coo…☆259Updated 3 years ago
- OCTIS: Comparing Topic Models is Simple! A python package to optimize and evaluate topic models (accepted at EACL2021 demo track)☆763Updated 9 months ago
- Set of vectorizers that extract keyphrases with part-of-speech patterns from a collection of text documents and convert them into a docum…☆262Updated 6 months ago
- Fixes contractions such as `you're` to `you are`☆318Updated 2 years ago