jaechan-repo/muse_bench

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/jaechan-repo/muse_bench)

jaechan-repo / muse_bench

☆33

Alternatives and similar repositories for muse_bench

Users that are interested in muse_bench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

UCSB-NLP-Chang / ULD
View on GitHub
Implementation of paper 'Reversing the Forget-Retain Objectives: An Efficient LLM Unlearning Framework from Logit Difference' [NeurIPS'24…
☆26Jun 14, 2024Updated 2 years ago
swj0419 / muse_bench
View on GitHub
☆34Mar 13, 2025Updated last year
licong-lin / negative-preference-optimization
View on GitHub
☆76Jul 15, 2024Updated 2 years ago
OPTML-Group / Unlearn-Simple
View on GitHub
[NeurIPS25] Official repo for "Simplicity Prevails: Rethinking Negative Preference Optimization for LLM Unlearning"
☆45Oct 3, 2025Updated 9 months ago
ethz-spylab / unlearning-vs-safety
View on GitHub
☆27Oct 6, 2024Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
OPTML-Group / Unlearn-Smooth
View on GitHub
[ICML25] Official repo for "Towards LLM Unlearning Resilient to Relearning Attacks: A Sharpness-Aware Minimization Perspective and Beyond…
☆24Sep 27, 2025Updated 10 months ago
zhliu0106 / learning-to-refuse
View on GitHub
Official Implementation of "Learning to Refuse: Towards Mitigating Privacy Risks in LLMs"
☆10Dec 13, 2024Updated last year
centerforaisafety / wmdp
View on GitHub
WMDP is a LLM proxy benchmark for hazardous knowledge in bio, cyber, and chemical security. We also release code for RMU, an unlearning m…
☆176May 29, 2025Updated last year
Lslland / T-Vaccine
View on GitHub
☆19Jun 21, 2025Updated last year
locuslab / open-unlearning
View on GitHub
[NeurIPS D&B '25] The one-stop repository for LLM unlearning
☆574Mar 18, 2026Updated 4 months ago
princeton-nlp / benign-data-breaks-safety
View on GitHub
☆47Oct 1, 2024Updated last year
UCSC-REAL / FLAT
View on GitHub
[ICLR 2025] FLAT: LLM Unlearning via Loss Adjustment with Only Forget Data
☆14Feb 26, 2025Updated last year
kevinyaobytedance / llm_unlearn
View on GitHub
LLM Unlearning
☆185Oct 20, 2023Updated 2 years ago
UCSB-NLP-Chang / causal_unlearn
View on GitHub
[EMNLP 2024] "Revisiting Who's Harry Potter: Towards Targeted Unlearning from a Causal Intervention Perspective"
☆35Jul 22, 2024Updated 2 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
git-disl / Vaccine
View on GitHub
This is the official code for the paper "Vaccine: Perturbation-aware Alignment for Large Language Models" (NeurIPS2024)
☆51Jan 15, 2026Updated 6 months ago
jinzhuoran / RWKU
View on GitHub
RWKU: Benchmarking Real-World Knowledge Unlearning for Large Language Models. NeurIPS 2024
☆100Sep 30, 2024Updated last year
franciscoliu / Awesome-GenAI-Unlearning
View on GitHub
☆188Apr 22, 2026Updated 3 months ago
EnnengYang / Efficient-WEMoE
View on GitHub
Efficient and Effective Weight-Ensembling Mixture of Experts for Multi-Task Model Merging. Arxiv, 2024.
☆16Oct 28, 2024Updated last year
HelloEveryboby / Butler
View on GitHub
Butler 是一个用于自动化服务管理和任务调度的工具项目。
☆17Updated this week
sail-sg / closer-look-LLM-unlearning
View on GitHub
[ICLR 2025] A Closer Look at Machine Unlearning for Large Language Models
☆49Dec 4, 2024Updated last year
boyiwei / CoTaEval
View on GitHub
[NeurIPS 2024 D&B] Evaluating Copyright Takedown Methods for Language Models
☆17Jul 17, 2024Updated 2 years ago
OPTML-Group / WAGLE
View on GitHub
Official repo for NeurIPS'24 paper "WAGLE: Strategic Weight Attribution for Effective and Modular Unlearning in Large Language Models"
☆19Dec 16, 2024Updated last year
houseme / sensitive-rs
View on GitHub
Sensitive-rs is a Rust library for finding, validating, filtering, and replacing sensitive words. It provides efficient algorithms to han…
☆26Jul 22, 2026Updated last week
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
googleinterns / localizing-paragraph-memorization
View on GitHub
☆15Feb 21, 2024Updated 2 years ago
OPTML-Group / Unlearn-WorstCase
View on GitHub
[ECCV24] "Challenging Forgets: Unveiling the Worst-Case Forget Sets in Machine Unlearning" by Chongyu Fan*, Jiancheng Liu*, Alfred Hero, …
☆28May 27, 2025Updated last year
zzwjames / FailureLLMUnlearning
View on GitHub
An official implementation of "Catastrophic Failure of LLM Unlearning via Quantization" (ICLR 2025)
☆39Feb 22, 2025Updated last year
felixbinder / introspection_self_prediction
View on GitHub
Code for experiments on self-prediction as a way to measure introspection in LLMs
☆16Dec 10, 2024Updated last year
arobey1 / advbench
View on GitHub
☆45Mar 3, 2023Updated 3 years ago
CharlesYu2000 / PCGU-UnlearningBias
View on GitHub
☆17Nov 7, 2023Updated 2 years ago
paul-rottger / xstest
View on GitHub
Röttger et al. (NAACL 2024): "XSTest: A Test Suite for Identifying Exaggerated Safety Behaviours in Large Language Models"
☆141Feb 24, 2025Updated last year
1andrevich / antifilter-domain
View on GitHub
Generated geosite.dat based on Antifilter Community List
☆29Updated this week
naver-ai / negmerge
View on GitHub
[ICML 2025] Official PyTorch implementation of "NegMerge: Sign-Consensual Weight Merging for Machine Unlearning"
☆16Nov 25, 2025Updated 8 months ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
graldij / transformer-fusion
View on GitHub
Official repository of the "Transformer Fusion with Optimal Transport" paper, published as a conference paper at ICLR 2024.
☆31Apr 19, 2024Updated 2 years ago
XiaoyuXU1 / Representational_Analysis_Tools
View on GitHub
☆15May 23, 2025Updated last year
vinid / safety-tuned-llamas
View on GitHub
ICLR2024 Paper. Showing properties of safety tuning and exaggerated safety.
☆95May 9, 2024Updated 2 years ago
phax / en16931-cii2ubl
View on GitHub
Converter for EN16931 invoices from CII to UBL
☆45Updated this week
if-loops / selective-synaptic-dampening
View on GitHub
[AAAI, ICLR TP] Fast Machine Unlearning Without Retraining Through Selective Synaptic Dampening
☆63Sep 11, 2024Updated last year
somvy / multimodal_unlearning
View on GitHub
Experiments for our CLEAR benchmark of unlearning methods in a multimodal setup
☆23Aug 6, 2025Updated 11 months ago
Babelscape / ALERT
View on GitHub
Official repository for the paper "ALERT: A Comprehensive Benchmark for Assessing Large Language Models’ Safety through Red Teaming"
☆60Sep 20, 2024Updated last year