Code for "Goodtriever: Toxicity Mitigation with Retrieval-augmented Language Models"
☆25May 30, 2024Updated last year
Alternatives and similar repositories for goodtriever
Users that are interested in goodtriever are comparing it to the libraries listed below
Sorting:
- ☆32Mar 20, 2024Updated last year
- ☆29Aug 9, 2023Updated 2 years ago
- Code for our WOAH@ACL 2021 Paper on Data Integration for Toxic Comment Classification: Making More Than 40 Datasets Easily Accessible in …☆30Nov 25, 2021Updated 4 years ago
- The official repository of "Improving Large Language Models via Fine-grained Reinforcement Learning with Minimum Editing Constraint"☆39Jan 12, 2024Updated 2 years ago
- Arabic News Stance Corpus☆11Feb 5, 2021Updated 5 years ago
- Python T-Digest Module☆10Aug 17, 2015Updated 10 years ago
- Math24o: 高中奥林匹克数学竞赛测评集 High School Olympiad Mathematics Chinese Benchmark☆11Mar 27, 2025Updated 11 months ago
- Fortifying Toxic Speech Detectors Against Veiled Toxicity☆11Oct 21, 2020Updated 5 years ago
- Debug DeepSpeed-Chat step by step in IDE (在IDE里一步一步调试DeepSpeed-Chat)☆10Apr 17, 2023Updated 2 years ago
- A repository of social media posts related to the Italian 2022 general election.☆10Oct 23, 2023Updated 2 years ago
- Replication code for "The Structure of Toxic Conversations on Twitter" (WWW'21)☆10May 25, 2021Updated 4 years ago
- On the Complementarity between Pre-Training and Back-Translation for Neural Machine Translation (Findings of EMNLP 2021))☆13Nov 21, 2021Updated 4 years ago
- A character-level BERT for Ancient Greek☆10Jul 22, 2023Updated 2 years ago
- ☆15Mar 20, 2025Updated 11 months ago
- Code for paper "Concrete Subspace Learning based Interference Elimination for Multi-task Model Fusion"☆14Mar 28, 2024Updated last year
- ☆11Oct 8, 2023Updated 2 years ago
- distilled Self-Critique refines the outputs of a LLM with only synthetic data☆11Apr 11, 2024Updated last year
- The repo for paper: Exploiting the Index Gradients for Optimization-Based Jailbreaking on Large Language Models.☆13Dec 16, 2024Updated last year
- Scala port of the word2vec toolkit.☆11Aug 15, 2016Updated 9 years ago
- The official implement of paper S2-VER: Semi-Supervised Visual Emotion Recognition☆11Apr 28, 2024Updated last year
- ☆13Oct 3, 2024Updated last year
- [ICML 2023] Protecting Language Generation Models via Invisible Watermarking☆13Sep 8, 2023Updated 2 years ago
- Combining encoder-based language models☆11Nov 11, 2021Updated 4 years ago
- Complete set of English dialect transformation rules and evaluation code☆16Jun 7, 2024Updated last year
- Is BERT Robust to Label Noise? A Study on Learning with Noisy Labels in Text Classification☆10May 31, 2022Updated 3 years ago
- A repo to keep all resources about interpretability in NLP organised and up to date☆12Nov 22, 2020Updated 5 years ago
- Analyzing Latent Concept in Pre-trained Transformer Models☆12Jul 18, 2022Updated 3 years ago
- Code for Auditing Data Provenance in Text-Generation Models (in KDD 2019)☆10Jun 18, 2019Updated 6 years ago
- 中文原生等级化代码能力测试基准☆15Apr 11, 2024Updated last year
- ☆11Oct 16, 2023Updated 2 years ago
- A collection of cross-platform social media posts about the 2022 U.S. midterm elections☆13Sep 20, 2024Updated last year
- Code for the WWW'23 paper "Sanitizing Sentence Embeddings (and Labels) for Local Differential Privacy"☆12Feb 20, 2023Updated 3 years ago
- MetricEval: A framework that conceptualizes and operationalizes four main components of metric evaluation, in terms of reliability and va…☆12Nov 6, 2023Updated 2 years ago
- Code for the paper "Modelling Latent Translations for Cross-Lingual Transfer"☆17Nov 22, 2021Updated 4 years ago
- ☆14Sep 1, 2025Updated 6 months ago
- [NLPCC 2024] Shared Task 10: Regulating Large Language Models☆14Jun 12, 2024Updated last year
- Official Implementation of Paper "Learning to Jump: Thinning and Thickening Latent Counts for Generative Modeling" (ICML 2023)☆10Jun 6, 2023Updated 2 years ago
- Your finetuned model's back to its original safety standards faster than you can say "SafetyLock"!☆11Oct 16, 2024Updated last year
- ☆12Sep 23, 2024Updated last year