The official repository for Toxic Commons and Celadon. Toxicity Classification for public domain data.
☆22Jun 10, 2026Updated last week
Alternatives and similar repositories for toxic-commons
Users that are interested in toxic-commons are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆18Feb 25, 2025Updated last year
- Small python package to measure OCR quality and other related metrics.☆27Feb 19, 2024Updated 2 years ago
- PathPiece tokenizer☆14Nov 10, 2024Updated last year
- Make the Best of Cross-lingual Transfer: Evidence from POS Tagging with over 100 Languages (ACL 2022)☆19May 17, 2022Updated 4 years ago
- German Language Understanding Evaluation Benchmark @NAACL24☆22Dec 11, 2025Updated 6 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- our modeling of online misogyny☆11Jun 22, 2022Updated 3 years ago
- ✂️ Sentence segmentation with wtpsplit's state-of-the-art Segment any Text (SaT) models☆39May 2, 2026Updated last month
- CounterGeDi is a pipeline that aims at controlling the counter speech generated to make it emotional, polite and detoxified. Paper accept…☆11Jul 19, 2022Updated 3 years ago
- Use spaCy for NLP and output to the FoLiA XML format.☆12Feb 27, 2024Updated 2 years ago
- Tool for parsing English phonemes into syllables.☆10Jan 15, 2018Updated 8 years ago
- Data for the HIPE 2022 shared task.☆23May 15, 2026Updated last month
- Code Roberta version of RetroMAE: Pre-Training Retrieval-oriented Language Models Via Masked Auto-Encoder☆10Mar 16, 2023Updated 3 years ago
- This repository contains the code for the EMNLP'23 paper "AdaSent: Efficient Domain-Adapted Sentence Embeddings for Few-Shot Classificati…☆16Jun 3, 2024Updated 2 years ago
- Script to get ACL Anthology☆16Jan 2, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆23Aug 13, 2023Updated 2 years ago
- Tutorial to pretrain & fine-tune a 🤗 Flax T5 model on a TPUv3-8 with GCP☆58Jul 28, 2022Updated 3 years ago
- ☆23Jun 2, 2026Updated 2 weeks ago
- HyPe: Better Pre-trained Language Model Fine-tuning with Hidden Representation Perturbation [ACL 2023]☆14Jul 11, 2023Updated 2 years ago
- Frozen Pretrained Transformers for Neural Sign Language Translation☆15Apr 23, 2022Updated 4 years ago
- An agent-based model for scientific inquiry based on abstract argumentation☆13Jan 17, 2022Updated 4 years ago
- String Distances in rust☆14Nov 21, 2022Updated 3 years ago
- XL-AMR is a sequence-to-graph cross-lingual AMR parser that exploits transfer learning (EMNLP2020).☆17Jul 25, 2024Updated last year
- ☆10Dec 17, 2020Updated 5 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Boilerplate application for the Chaplin.js library☆80Jan 23, 2014Updated 12 years ago
- Laravel Config with DB-storage support☆21Oct 15, 2018Updated 7 years ago
- ☆10Oct 2, 2024Updated last year
- ☆13Jun 16, 2021Updated 5 years ago
- Bilingual sentence similarity classifier using Tensorflow☆24Sep 26, 2019Updated 6 years ago
- decontamination☆33Mar 4, 2026Updated 3 months ago
- mPLM-Sim: Better Cross-Lingual Similarity and Transfer in Multilingual Pretrained Language Models☆11Jan 19, 2024Updated 2 years ago
- An LLM Client for the PS Vita☆13Jun 23, 2025Updated 11 months ago
- German Alpaca Dataset (Cleaned + Translated)☆26Apr 6, 2023Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Repository accompanying "An Open Dataset and Model for Language Identification" (Burchell et al., 2023)☆76Apr 1, 2025Updated last year
- Ranger helps you see the forest among the trees - Ranger is an effect-size meta analysis library creating beautiful forest plots!☆12Jun 12, 2023Updated 3 years ago
- Official implemtation of UniverSR (ICASSP 2026)☆53Apr 9, 2026Updated 2 months ago
- ☆14Mar 2, 2023Updated 3 years ago
- The official code of "PixelWorld: Towards Perceiving Everything as Pixels" [TMLR25]☆16Sep 12, 2025Updated 9 months ago
- A library for soft differentiable relaxations of common PyTorch functions.☆76Mar 14, 2026Updated 3 months ago
- Export Apple News saved articles to SQLite☆14Mar 16, 2023Updated 3 years ago