joonkeekim / hare-hate-speech
Official repository of "HARE: Explainable Hate Speech Detection with Step-by-Step Reasoning", Findings of EMNLP 2023
☆21Updated last year
Alternatives and similar repositories for hare-hate-speech:
Users that are interested in hare-hate-speech are comparing it to the libraries listed below
- [ACL 2023] Knowledge Unlearning for Mitigating Privacy Risks in Language Models☆79Updated 5 months ago
- Unofficial re-implementation of "Trusting Your Evidence: Hallucinate Less with Context-aware Decoding"☆28Updated 3 months ago
- ☆20Updated last year
- [EMNLP 2022] TemporalWiki: A Lifelong Benchmark for Training and Evaluating Ever-Evolving Language Models☆70Updated 9 months ago
- Code and data for Marked Personas (ACL 2023)☆22Updated last year
- Code and Dataset release of "Carpe Diem: On the Evaluation of World Knowledge in Lifelong Language Models" (NAACL 2024)☆10Updated 4 months ago
- 🤫 Code and benchmark for our ICLR 2024 spotlight paper: "Can LLMs Keep a Secret? Testing Privacy Implications of Language Models via Con…☆39Updated last year
- BLEnD: A Benchmark for LLMs on Everyday Knowledge in Diverse Cultures and Languages☆25Updated 2 months ago
- AVocaDo : Strategy for Adapting Vocabulary to Downstream Domain☆23Updated 2 years ago
- ACL 2022: An Empirical Survey of the Effectiveness of Debiasing Techniques for Pre-trained Language Models.☆131Updated 2 months ago
- official repository for ListT5☆43Updated last week
- ☆124Updated last year
- [ICLR 2022] Towards Continual Knowledge Learning of Language Models☆92Updated 2 years ago
- EMNLP 2022: "MABEL: Attenuating Gender Bias using Textual Entailment Data" https://arxiv.org/abs/2210.14975☆37Updated last year
- [ACL 2024] LangBridge: Multilingual Reasoning Without Multilingual Supervision☆84Updated 3 months ago
- ☆24Updated last year
- Generalizable Implicit Hate Speech Detection using Contrastive Learning (COLING 2022)☆13Updated 2 years ago
- Official repository of "Distort, Distract, Decode: Instruction-Tuned Model Can Refine its Response from Noisy Instructions", ICLR 2024 Sp…☆20Updated 11 months ago
- ☆15Updated last year
- Code for the paper "You Truly Understand What I Need : Intellectual and Friendly Dialogue Agents grounding Knowledge and Persona" which i…☆23Updated last year
- Dataset and code for paper: "Can LLM Generate Culturally Relevant Commonsense QA Data? Case Study in Indonesian and Sundanese".☆16Updated 3 months ago
- ☆25Updated 5 months ago
- Dataset associated with "BOLD: Dataset and Metrics for Measuring Biases in Open-Ended Language Generation" paper☆71Updated 3 years ago
- ☆10Updated 5 months ago
- Implementation of "The Power of Scale for Parameter-Efficient Prompt Tuning"☆164Updated 3 years ago
- Official Implementation for "Self-Gudied Contrastive Learning for BERT Sentence Representations (ACL 2021)"☆27Updated 2 years ago
- KOLD: Korean Offensive Language Dataset☆80Updated 2 years ago
- ☆38Updated last year
- Official repository for "Reweighting Strategy based on Synthetic Data Identification for Sentence Similarity (COLING2022)"☆18Updated 2 years ago
- Frequency-Guided Word Substitutions for Detecting Textual Adversarial Examples (EACL 2021)☆8Updated 3 years ago