Code for our WOAH@ACL 2021 Paper on Data Integration for Toxic Comment Classification: Making More Than 40 Datasets Easily Accessible in One Unified Format
☆30Nov 25, 2021Updated 4 years ago
Alternatives and similar repositories for toxic-comment-collection
Users that are interested in toxic-comment-collection are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- MetricEval: A framework that conceptualizes and operationalizes four main components of metric evaluation, in terms of reliability and va…☆12Nov 6, 2023Updated 2 years ago
- [ACL 2023] Counterspeeches up my sleeve! Intent Distribution Learning and Persistent Fusion for Intent-Conditioned Counterspeech Generati…☆10Sep 23, 2023Updated 2 years ago
- Source Code - https://github.com/USStateDept/State-TalentMAP☆13Sep 12, 2023Updated 2 years ago
- Can we use explanations to improve hate speech models? Our paper accepted at AAAI 2021 tries to explore that question.☆239Jun 12, 2023Updated 2 years ago
- Documenting large text datasets 🖼️ 📚☆14Dec 17, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Explaining neural decisions contrastively to alternative decisions.☆24Mar 18, 2021Updated 5 years ago
- Rekhta Dictionary Extension code☆10Aug 9, 2022Updated 3 years ago
- Hugging Face and Pyserini interoperability☆19May 18, 2023Updated 3 years ago
- A repo to keep all resources about interpretability in NLP organised and up to date☆12Nov 22, 2020Updated 5 years ago
- Implementation of unregularized, l1 regularized and l2 regularized linear regression using numpy and without sklearn☆12Oct 4, 2019Updated 6 years ago
- Code of Robust Lottery Tickets for Pre-trained Language Models (ACL2022)☆20Jul 18, 2022Updated 3 years ago
- Ongoing research training transformer language models at scale, including: BERT & GPT-2☆20Oct 23, 2023Updated 2 years ago
- Detect toxic spans in toxic texts☆70Jun 12, 2023Updated 2 years ago
- ☆19Feb 22, 2024Updated 2 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- A Dataset and Results for Classifying Emotions Across Languages☆10Jun 20, 2021Updated 4 years ago
- Stanford Internet Observatory publications☆14Dec 2, 2021Updated 4 years ago
- 2020 Summer Olympics medals per million people☆12Aug 8, 2021Updated 4 years ago
- Röttger et al. (WOAH at NAACL 2022): "Multilingual HateCheck: Functional Tests for Multilingual Hate Speech Detection Models"☆17May 23, 2022Updated 4 years ago
- Official repository for our EACL 2023 paper "LongEval: Guidelines for Human Evaluation of Faithfulness in Long-form Summarization" (https…☆45Aug 10, 2024Updated last year
- Catalog of abusive language data (PLoS 2020)☆326Jun 14, 2024Updated last year
- ☆12Sep 13, 2018Updated 7 years ago
- Causal Mediation analysis☆10May 12, 2026Updated last week
- Code for FACTOID dataset paper in LREC 2022☆18Dec 19, 2022Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- SECURE: Benchmarking Generative Large Language Models as a Cyber Advisory☆17Aug 28, 2024Updated last year
- ☆20Dec 16, 2020Updated 5 years ago
- "Why do I feel offended?" - Korean Dataset for Offensive Language Identification (EACL2023 Findings)☆15May 14, 2023Updated 3 years ago
- Official repository of the Hate Speech Detection Tasks at Evalita☆12Dec 16, 2020Updated 5 years ago
- Addressing common clinical biases in medical language models☆16Jul 27, 2024Updated last year
- CounterGeDi is a pipeline that aims at controlling the counter speech generated to make it emotional, polite and detoxified. Paper accept…☆11Jul 19, 2022Updated 3 years ago
- Notebooks, slides and dataset of the CorrelAid Machine Learning Winter School☆12Jul 13, 2022Updated 3 years ago
- ☆10Sep 17, 2022Updated 3 years ago
- Data and code for the paper "The Moral Integrity Corpus: A Benchmark for Ethical Dialogue Systems"☆21Jul 18, 2023Updated 2 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- #TidyTuesday is a weekly social data project in R which encourages participants to summarize and arrange data to make meaningful charts w…