Code for our WOAH@ACL 2021 Paper on Data Integration for Toxic Comment Classification: Making More Than 40 Datasets Easily Accessible in One Unified Format
☆30Nov 25, 2021Updated 4 years ago
Alternatives and similar repositories for toxic-comment-collection
Users that are interested in toxic-comment-collection are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ACL 2023] Counterspeeches up my sleeve! Intent Distribution Learning and Persistent Fusion for Intent-Conditioned Counterspeech Generati…☆10Sep 23, 2023Updated 2 years ago
- Can we use explanations to improve hate speech models? Our paper accepted at AAAI 2021 tries to explore that question.☆236Jun 12, 2023Updated 2 years ago
- Documenting large text datasets 🖼️ 📚☆14Dec 17, 2024Updated last year
- Explaining neural decisions contrastively to alternative decisions.☆24Mar 18, 2021Updated 5 years ago
- Hugging Face and Pyserini interoperability☆19May 18, 2023Updated 2 years ago
- A repo to keep all resources about interpretability in NLP organised and up to date☆12Nov 22, 2020Updated 5 years ago
- ☆24Apr 2, 2024Updated last year
- Implementation of unregularized, l1 regularized and l2 regularized linear regression using numpy and without sklearn☆12Oct 4, 2019Updated 6 years ago
- Code of Robust Lottery Tickets for Pre-trained Language Models (ACL2022)☆20Jul 18, 2022Updated 3 years ago
- ☆23Jun 12, 2023Updated 2 years ago
- Code for "Goodtriever: Toxicity Mitigation with Retrieval-augmented Language Models"☆25May 30, 2024Updated last year
- Detect toxic spans in toxic texts☆70Jun 12, 2023Updated 2 years ago
- ☆10Jan 5, 2022Updated 4 years ago
- A Dataset and Results for Classifying Emotions Across Languages☆10Jun 20, 2021Updated 4 years ago
- Stanford Internet Observatory publications☆14Dec 2, 2021Updated 4 years ago
- 2020 Summer Olympics medals per million people☆12Aug 8, 2021Updated 4 years ago
- Catalog of abusive language data (PLoS 2020)☆323Jun 14, 2024Updated last year
- Röttger et al. (WOAH at NAACL 2022): "Multilingual HateCheck: Functional Tests for Multilingual Hate Speech Detection Models"☆17May 23, 2022Updated 3 years ago
- Official repository for our EACL 2023 paper "LongEval: Guidelines for Human Evaluation of Faithfulness in Long-form Summarization" (https…☆44Aug 10, 2024Updated last year
- Causal Mediation analysis☆10Dec 26, 2025Updated 2 months ago
- Code for FACTOID dataset paper in LREC 2022☆18Dec 19, 2022Updated 3 years ago
- [EMNLP 2025 Oral] IPIGuard: A Novel Tool Dependency Graph-Based Defense Against Indirect Prompt Injection in LLM Agents☆16Sep 16, 2025Updated 6 months ago
- ☆20Dec 16, 2020Updated 5 years ago
- "Why do I feel offended?" - Korean Dataset for Offensive Language Identification (EACL2023 Findings)☆15May 14, 2023Updated 2 years ago
- The official repository of "Improving Large Language Models via Fine-grained Reinforcement Learning with Minimum Editing Constraint"☆39Jan 12, 2024Updated 2 years ago
- Addressing common clinical biases in medical language models☆16Jul 27, 2024Updated last year
- CounterGeDi is a pipeline that aims at controlling the counter speech generated to make it emotional, polite and detoxified. Paper accept…☆11Jul 19, 2022Updated 3 years ago
- Official repository of the Hate Speech Detection Tasks at Evalita☆12Dec 16, 2020Updated 5 years ago
- Data and code for the paper "The Moral Integrity Corpus: A Benchmark for Ethical Dialogue Systems"☆21Jul 18, 2023Updated 2 years ago
- MRP - Multilevel + BART = BARP☆10Jan 1, 2023Updated 3 years ago
- #TidyTuesday is a weekly social data project in R which encourages participants to summarize and arrange data to make meaningful charts w…☆14Jun 10, 2025Updated 9 months ago
- Release of the ConditionalQA dataset☆21Nov 2, 2021Updated 4 years ago
- DeClarE: Debunking Fake News and False Claims using Evidence-Aware Deep Learning☆23Aug 23, 2023Updated 2 years ago
- Towards cross-lingual distributed representations without parallel text trained with adversarial autoencoders☆22Aug 11, 2016Updated 9 years ago
- Fine-tuned transformers for protest event detection.☆11Mar 9, 2021Updated 5 years ago
- Workshop Materials "Advanced Bayesian Statistical Modeling in R and Stan "☆12Nov 23, 2023Updated 2 years ago
- Debug DeepSpeed-Chat step by step in IDE (在IDE里一步一步调试DeepSpeed-Chat)☆10Apr 17, 2023Updated 2 years ago
- TGLS: Unsupervised Text Generation by Learning from Search☆25Jan 5, 2021Updated 5 years ago
- Official repository of "Distort, Distract, Decode: Instruction-Tuned Model Can Refine its Response from Noisy Instructions", ICLR 2024 Sp…☆21Mar 7, 2024Updated 2 years ago