julian-risch / toxic-comment-collection
Code for our WOAH@ACL 2021 Paper on Data Integration for Toxic Comment Classification: Making More Than 40 Datasets Easily Accessible in One Unified Format
☆28Updated 3 years ago
Alternatives and similar repositories for toxic-comment-collection:
Users that are interested in toxic-comment-collection are comparing it to the libraries listed below
- ☆38Updated last year
- A repository with several curated datasets of counter-narratives to fight online hate speech.☆88Updated last year
- Dataset + classifier tools to study social perception biases in natural language generation☆67Updated last year
- Code and test data for "On Measuring Bias in Sentence Encoders", to appear at NAACL 2019.☆54Updated 3 years ago
- Official code release for ACL 2020 paper "Contextualizing Hate Speech Classifiers with Post hoc Explanation"☆35Updated 3 years ago
- Data set for LREC 2020 paper "I Feel Offended, Don't Be Abusive!"☆18Updated last year
- Faithfulness and factuality annotations of XSum summaries from our paper "On Faithfulness and Factuality in Abstractive Summarization" (h…☆81Updated 4 years ago
- Code for ACL 2022 paper "Expanding Pretrained Models to Thousands More Languages via Lexicon-based Adaptation"☆30Updated 3 years ago
- Code associated with the paper "Entropy-based Attention Regularization Frees Unintended Bias Mitigation from Lists"☆48Updated 2 years ago
- A curated list of awesome datasets with human label variation (un-aggregated labels) in Natural Language Processing and Computer Vision, …☆82Updated last year
- [ICML 2021] Towards Understanding and Mitigating Social Biases in Language Models☆61Updated 2 years ago
- ☆51Updated 2 years ago
- code for our EACL 2021 paper: "Challenges in Automated Debiasing for Toxic Language Detection" by Xuhui Zhou, Maarten Sap, Swabha Swayamd…☆19Updated 3 years ago
- code associated with ACL 2021 DExperts paper☆114Updated last year
- ☆96Updated last year
- ☆75Updated 3 years ago
- Contrastive Fact Verification☆71Updated 2 years ago
- ☆25Updated 3 years ago
- Automatically detect errors in annotated corpora.☆47Updated last year
- UDapter is a multilingual dependency parser that uses "contextual" adapters together with language-typology features for language-specifi…☆31Updated 2 years ago
- Official repository for our EACL 2023 paper "LongEval: Guidelines for Human Evaluation of Faithfulness in Long-form Summarization" (https…☆43Updated 8 months ago
- Data and code for our paper "Exploring and Predicting Transferability across NLP Tasks", to appear at EMNLP 2020.☆50Updated 4 years ago
- Code for the paper "Measuring Bias in Contextualized Word Representations"☆35Updated 5 years ago
- ☆71Updated 3 years ago
- To analyze and remove gender bias in coreference resolution systems☆77Updated 3 years ago
- Source code for paper "Learning from Noisy Labels for Entity-Centric Information Extraction", EMNLP 2021☆55Updated 3 years ago
- FRANK: Factuality Evaluation Benchmark☆54Updated 2 years ago
- The Stanford Word Substitution (Swords) Benchmark☆32Updated 3 years ago
- Adaptation datasets and scripts for the paper "Reducing gender bias in Neural Machine Translation as a domain adaptation problem" (ACL 20…☆13Updated 4 years ago
- Research code for the paper "How Good is Your Tokenizer? On the Monolingual Performance of Multilingual Language Models"☆26Updated 3 years ago