joonkeekim / hare-hate-speech
Official repository of "HARE: Explainable Hate Speech Detection with Step-by-Step Reasoning", Findings of EMNLP 2023
☆24Updated last year
Alternatives and similar repositories for hare-hate-speech:
Users that are interested in hare-hate-speech are comparing it to the libraries listed below
- [ACL 2023] Knowledge Unlearning for Mitigating Privacy Risks in Language Models☆80Updated 7 months ago
- Unofficial re-implementation of "Trusting Your Evidence: Hallucinate Less with Context-aware Decoding"☆28Updated 5 months ago
- BLEnD: A Benchmark for LLMs on Everyday Knowledge in Diverse Cultures and Languages☆31Updated 4 months ago
- ☆10Updated last year
- ☆20Updated last year
- [EMNLP 2022] TemporalWiki: A Lifelong Benchmark for Training and Evaluating Ever-Evolving Language Models☆71Updated 11 months ago
- Official Code for EMNLP 2023 paper: "Unveiling the Implicit Toxicity in Large Language Models""☆11Updated last year
- ☆131Updated last year
- Implementation of "The Power of Scale for Parameter-Efficient Prompt Tuning"☆167Updated 3 years ago
- Official repository for "Reweighting Strategy based on Synthetic Data Identification for Sentence Similarity (COLING2022)"☆18Updated 2 years ago
- ☆30Updated 2 years ago
- Code and Dataset release of "Carpe Diem: On the Evaluation of World Knowledge in Lifelong Language Models" (NAACL 2024)☆10Updated 6 months ago
- [ICLR 2022] Towards Continual Knowledge Learning of Language Models☆92Updated 2 years ago
- Generalizable Implicit Hate Speech Detection using Contrastive Learning (COLING 2022)☆13Updated 2 years ago
- Official Implementation for "Self-Gudied Contrastive Learning for BERT Sentence Representations (ACL 2021)"☆27Updated 2 years ago
- ☆25Updated 7 months ago
- ACL 2022: An Empirical Survey of the Effectiveness of Debiasing Techniques for Pre-trained Language Models.☆137Updated 4 months ago
- ☆38Updated last year
- Code for the paper "You Truly Understand What I Need : Intellectual and Friendly Dialogue Agents grounding Knowledge and Persona" which i…☆23Updated 2 years ago
- 🤫 Code and benchmark for our ICLR 2024 spotlight paper: "Can LLMs Keep a Secret? Testing Privacy Implications of Language Models via Con…☆42Updated last year
- ☆36Updated 2 years ago
- AVocaDo : Strategy for Adapting Vocabulary to Downstream Domain☆23Updated 2 years ago
- ☆11Updated last year
- ☆34Updated 2 years ago
- official repository for ListT5☆44Updated 2 months ago
- ☆15Updated 2 years ago
- Official code for the ACL 2024 paper: Chat Vector: A Simple Approach to Equip LLMs with Instruction Following and Model Alignment in New …☆49Updated 11 months ago
- Dataset associated with "BOLD: Dataset and Metrics for Measuring Biases in Open-Ended Language Generation" paper☆80Updated 4 years ago
- EMNLP 2022: "MABEL: Attenuating Gender Bias using Textual Entailment Data" https://arxiv.org/abs/2210.14975☆37Updated last year
- Code and data for Marked Personas (ACL 2023)☆23Updated last year