unitaryai / detoxify
Trained models & code to predict toxic comments on all 3 Jigsaw Toxic Comment Challenges. Built using β‘ Pytorch Lightning and π€ Transformers. For access to our API, please email us at contact@unitary.ai.
β1,025Updated 3 weeks ago
Alternatives and similar repositories for detoxify:
Users that are interested in detoxify are comparing it to the libraries listed below
- Repository for TweetEvalβ367Updated 2 years ago
- This repo contains the code for generating the ToxiGen dataset, published at ACL 2022.β307Updated 9 months ago
- Catalog of abusive language data (PLoS 2020)β309Updated 9 months ago
- Can we use explanations to improve hate speech models? Our paper accepted at AAAI 2021 tries to explore that question.β199Updated last year
- π§Ή Python package for text cleaningβ975Updated last year
- NL-Augmenter π¦ β π A Collaborative Repository of Natural Language Transformationsβ781Updated 10 months ago
- BERTweet: A pre-trained language model for English Tweets (EMNLP-2020)β589Updated 8 months ago
- StereoSet: Measuring stereotypical bias in pretrained language modelsβ180Updated 2 years ago
- A python package to run contextualized topic modeling. CTMs combine contextualized embeddings (e.g., BERT) with topic models to get coherβ¦β1,225Updated last month
- A Python multilingual toolkit for Sentiment Analysis and Social NLP tasksβ585Updated 8 months ago
- NeuSpell: A Neural Spelling Correction Toolkitβ692Updated last year
- Top2Vec learns jointly embedded topic, document and word vectors.β3,021Updated 4 months ago
- Explain, analyze, and visualize NLP language models. Ecco creates interactive visualizations directly in Jupyter notebooks explaining theβ¦β2,023Updated 7 months ago
- A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.β1,757Updated last month
- skweak: A software toolkit for weak supervision applied to NLP tasksβ923Updated 7 months ago
- python package to calculate readability statistics of a text object - paragraphs, sentences, articles.β1,257Updated 3 weeks ago
- Resources for the "SummEval: Re-evaluating Summarization Evaluation" paperβ390Updated 9 months ago
- Hate speech dataset from Stormfront forum manually labelled at sentence level.β171Updated 4 years ago
- Datasets for Hate Speech Detectionβ125Updated last year
- Active Learning for Text Classification in Pythonβ609Updated this week
- Efficient few-shot learning with Sentence Transformersβ2,424Updated 2 months ago
- Fixes contractions such as `you're` to `you are`β316Updated 2 years ago
- A Python library for calculating a large variety of metrics from textβ334Updated 3 months ago
- TweetNLP for all the NLP enthusiasts working on Twitter! The Python library tweetnlp provides a collection of useful tools to analyze/undβ¦β336Updated 7 months ago
- Repository for the paper "Automated Hate Speech Detection and the Problem of Offensive Language", ICWSM 2017β807Updated last year
- ConvoKit is a toolkit for extracting conversational features and analyzing social phenomena in conversations. It includes several large cβ¦β575Updated 3 months ago
- Repository for research in the field of Responsible NLP at Meta.β198Updated 4 months ago
- This repository contains a dataset for hate speech detection on social media platforms.β71Updated 2 years ago
- Ask Me Anything language model promptingβ546Updated last year
- utilities for decoding deep representations (like sentence embeddings) back to textβ788Updated 2 months ago