Code for our WOAH@ACL 2021 Paper on Data Integration for Toxic Comment Classification: Making More Than 40 Datasets Easily Accessible in One Unified Format
☆30Nov 25, 2021Updated 4 years ago
Alternatives and similar repositories for toxic-comment-collection
Users that are interested in toxic-comment-collection are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- MetricEval: A framework that conceptualizes and operationalizes four main components of metric evaluation, in terms of reliability and va…☆12Nov 6, 2023Updated 2 years ago
- [ACL 2023] Counterspeeches up my sleeve! Intent Distribution Learning and Persistent Fusion for Intent-Conditioned Counterspeech Generati…☆10Sep 23, 2023Updated 2 years ago
- Source Code - https://github.com/USStateDept/State-TalentMAP☆13Sep 12, 2023Updated 2 years ago
- Can we use explanations to improve hate speech models? Our paper accepted at AAAI 2021 tries to explore that question.☆238Jun 12, 2023Updated 2 years ago
- Rekhta Dictionary Extension code☆10Aug 9, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A repo to keep all resources about interpretability in NLP organised and up to date☆12Nov 22, 2020Updated 5 years ago
- ☆24Apr 2, 2024Updated 2 years ago
- Code of Robust Lottery Tickets for Pre-trained Language Models (ACL2022)☆20Jul 18, 2022Updated 3 years ago
- Ongoing research training transformer language models at scale, including: BERT & GPT-2☆20Oct 23, 2023Updated 2 years ago
- Code for "Goodtriever: Toxicity Mitigation with Retrieval-augmented Language Models"☆25May 30, 2024Updated last year
- ☆19Feb 22, 2024Updated 2 years ago
- ☆10Jan 5, 2022Updated 4 years ago
- Stanford Internet Observatory publications☆14Dec 2, 2021Updated 4 years ago
- 2020 Summer Olympics medals per million people☆12Aug 8, 2021Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Röttger et al. (WOAH at NAACL 2022): "Multilingual HateCheck: Functional Tests for Multilingual Hate Speech Detection Models"☆17May 23, 2022Updated 3 years ago
- Official repository for our EACL 2023 paper "LongEval: Guidelines for Human Evaluation of Faithfulness in Long-form Summarization" (https…☆44Aug 10, 2024Updated last year
- ☆12Sep 13, 2018Updated 7 years ago
- Code for FACTOID dataset paper in LREC 2022☆18Dec 19, 2022Updated 3 years ago
- SECURE: Benchmarking Generative Large Language Models as a Cyber Advisory☆17Aug 28, 2024Updated last year
- The LM Contamination Index is a manually created database of contamination evidences for LMs.☆82Apr 11, 2024Updated 2 years ago
- Official repository of the Hate Speech Detection Tasks at Evalita☆12Dec 16, 2020Updated 5 years ago
- Addressing common clinical biases in medical language models☆16Jul 27, 2024Updated last year
- CounterGeDi is a pipeline that aims at controlling the counter speech generated to make it emotional, polite and detoxified. Paper accept…☆11Jul 19, 2022Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Notebooks, slides and dataset of the CorrelAid Machine Learning Winter School☆12Jul 13, 2022Updated 3 years ago
- ☆10Sep 17, 2022Updated 3 years ago
- MRP - Multilevel + BART = BARP☆10Jan 1, 2023Updated 3 years ago
- Data and code for the paper "The Moral Integrity Corpus: A Benchmark for Ethical Dialogue Systems"☆21Jul 18, 2023Updated 2 years ago
- Release of the ConditionalQA dataset☆21Nov 2, 2021Updated 4 years ago
- #TidyTuesday is a weekly social data project in R which encourages participants to summarize and arrange data to make meaningful charts w…☆14Jun 10, 2025Updated 10 months ago
- DeClarE: Debunking Fake News and False Claims using Evidence-Aware Deep Learning☆23Aug 23, 2023Updated 2 years ago
- Fortifying Toxic Speech Detectors Against Veiled Toxicity☆11Oct 21, 2020Updated 5 years ago
- Fine-tuned transformers for protest event detection.☆11Mar 9, 2021Updated 5 years ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- TGLS: Unsupervised Text Generation by Learning from Search☆25Jan 5, 2021Updated 5 years ago
- Official repository of "Distort, Distract, Decode: Instruction-Tuned Model Can Refine its Response from Noisy Instructions", ICLR 2024 Sp…☆21Mar 7, 2024Updated 2 years ago
- Arabic Dialect Identification on AOC data.☆24Mar 2, 2019Updated 7 years ago
- MoLE: Cross-Domain Label-Adaptive Stance Detection☆18Mar 3, 2022Updated 4 years ago
- Perform network analysis on reddit☆11Jun 18, 2019Updated 6 years ago
- ☆106Oct 16, 2025Updated 5 months ago
- Codes and files for the paper Are Emergent Abilities in Large Language Models just In-Context Learning☆33Jan 9, 2025Updated last year