joonkeekim / hare-hate-speech
Official repository of "HARE: Explainable Hate Speech Detection with Step-by-Step Reasoning", Findings of EMNLP 2023
☆21Updated 11 months ago
Alternatives and similar repositories for hare-hate-speech:
Users that are interested in hare-hate-speech are comparing it to the libraries listed below
- [ACL 2023] Knowledge Unlearning for Mitigating Privacy Risks in Language Models☆79Updated 4 months ago
- Unofficial re-implementation of "Trusting Your Evidence: Hallucinate Less with Context-aware Decoding"☆28Updated last month
- [EMNLP 2022] TemporalWiki: A Lifelong Benchmark for Training and Evaluating Ever-Evolving Language Models☆69Updated 8 months ago
- ☆29Updated 2 years ago
- ☆118Updated last year
- ACL 2022: An Empirical Survey of the Effectiveness of Debiasing Techniques for Pre-trained Language Models.☆128Updated last month
- 🤫 Code and benchmark for our ICLR 2024 spotlight paper: "Can LLMs Keep a Secret? Testing Privacy Implications of Language Models via Con…☆36Updated last year
- [ICLR 2022] Towards Continual Knowledge Learning of Language Models☆92Updated 2 years ago
- This is the oficial repository for "Parameter-Efficient Multi-task Tuning via Attentional Mixtures of Soft Prompts" (EMNLP 2022)☆100Updated 2 years ago
- EMNLP 2022: "MABEL: Attenuating Gender Bias using Textual Entailment Data" https://arxiv.org/abs/2210.14975☆37Updated last year
- Official Code Repository for the paper "Knowledge-Augmented Reasoning Distillation for Small Language Models in Knowledge-intensive Tasks…☆37Updated last month
- ☆20Updated last year
- BLEnD: A Benchmark for LLMs on Everyday Knowledge in Diverse Cultures and Languages☆22Updated last month
- ☆36Updated last year
- AVocaDo : Strategy for Adapting Vocabulary to Downstream Domain☆23Updated 2 years ago
- ☆24Updated 3 months ago
- Implementation of "The Power of Scale for Parameter-Efficient Prompt Tuning"☆164Updated 3 years ago
- Code and data for Marked Personas (ACL 2023)☆21Updated last year
- ☆36Updated last year
- Automatically Update NLP Papers Daily using Github Actions (ref: https://github.com/Vincentqyw/cv-arxiv-daily)☆89Updated this week
- ☆57Updated 3 weeks ago
- Official repository of "Distort, Distract, Decode: Instruction-Tuned Model Can Refine its Response from Noisy Instructions", ICLR 2024 Sp…☆20Updated 10 months ago
- Dataset and code for paper: "Can LLM Generate Culturally Relevant Commonsense QA Data? Case Study in Indonesian and Sundanese".☆16Updated last month
- Generalizable Implicit Hate Speech Detection using Contrastive Learning (COLING 2022)☆13Updated 2 years ago
- Official Implementation for "Self-Gudied Contrastive Learning for BERT Sentence Representations (ACL 2021)"☆26Updated 2 years ago
- About Official PyTorch implementation of "Query-Efficient Black-Box Red Teaming via Bayesian Optimization" (ACL'23)☆14Updated last year
- Official Code for the paper "SuRe: Summarizing Retrievals using Answer Candidates for Open-domain QA of LLMs" (ICLR 2024)☆21Updated 8 months ago
- ☆15Updated last year
- Code associated with Tuning Language Models by Proxy (Liu et al., 2024)☆102Updated 9 months ago
- ☆21Updated 3 months ago