kevinyaobytedance/llm_unlearn

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/kevinyaobytedance/llm_unlearn)

kevinyaobytedance / llm_unlearn

LLM Unlearning

☆185

Alternatives and similar repositories for llm_unlearn

Users that are interested in llm_unlearn are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

yaojin17 / Unlearning_LLM
View on GitHub
[ACL 2024] Code and data for "Machine Unlearning of Pre-trained Large Language Models"
☆68Sep 30, 2024Updated last year
franciscoliu / SKU
View on GitHub
Official code implementation of SKU, Accepted by ACL 2024 Findings
☆20Dec 18, 2024Updated last year
chrisliu298 / awesome-llm-unlearning
View on GitHub
A resource repository for machine unlearning in large language models
☆617Updated this week
OPTML-Group / SOUL
View on GitHub
Official repo for EMNLP'24 paper "SOUL: Unlocking the Power of Second-Order Optimization for LLM Unlearning"
☆30Oct 1, 2024Updated last year
jaechan-repo / muse_bench
View on GitHub
☆33Aug 9, 2024Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
OPTML-Group / Unlearn-Simple
View on GitHub
[NeurIPS25] Official repo for "Simplicity Prevails: Rethinking Negative Preference Optimization for LLM Unlearning"
☆45Oct 3, 2025Updated 9 months ago
SALT-NLP / Efficient_Unlearning
View on GitHub
☆38Oct 18, 2023Updated 2 years ago
locuslab / open-unlearning
View on GitHub
[NeurIPS D&B '25] The one-stop repository for LLM unlearning
☆571Mar 18, 2026Updated 4 months ago
UCSB-NLP-Chang / causal_unlearn
View on GitHub
[EMNLP 2024] "Revisiting Who's Harry Potter: Towards Targeted Unlearning from a Causal Intervention Perspective"
☆35Jul 22, 2024Updated 2 years ago
yihuaihong / ConceptVectors
View on GitHub
[EMNLP 2025 Main] ConceptVectors Benchmark and Code for the paper "Intrinsic Evaluation of Unlearning Using Parametric Knowledge Traces"
☆40Aug 20, 2025Updated 11 months ago
licong-lin / negative-preference-optimization
View on GitHub
☆76Jul 15, 2024Updated 2 years ago
franciscoliu / Awesome-GenAI-Unlearning
View on GitHub
☆188Apr 22, 2026Updated 3 months ago
ethz-spylab / unlearning-vs-safety
View on GitHub
☆27Oct 6, 2024Updated last year
sail-sg / closer-look-LLM-unlearning
View on GitHub
[ICLR 2025] A Closer Look at Machine Unlearning for Large Language Models
☆49Dec 4, 2024Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
chrisliu298 / llm-unlearn-eco
View on GitHub
[NeurIPS 2024] Large Language Model Unlearning via Embedding-Corrupted Prompts
☆41Sep 26, 2024Updated last year
KID-22 / LLM-Unlearning-Paper-List
View on GitHub
☆28Dec 18, 2025Updated 7 months ago
jinzhuoran / RWKU
View on GitHub
RWKU: Benchmarking Real-World Knowledge Unlearning for Large Language Models. NeurIPS 2024
☆100Sep 30, 2024Updated last year
centerforaisafety / wmdp
View on GitHub
WMDP is a LLM proxy benchmark for hazardous knowledge in bio, cyber, and chemical security. We also release code for RMU, an unlearning m…
☆176May 29, 2025Updated last year
joeljang / knowledge-unlearning
View on GitHub
[ACL 2023] Knowledge Unlearning for Mitigating Privacy Risks in Language Models
☆89Sep 12, 2024Updated last year
git-disl / Vaccine
View on GitHub
This is the official code for the paper "Vaccine: Perturbation-aware Alignment for Large Language Models" (NeurIPS2024)
☆51Jan 15, 2026Updated 6 months ago
boyiwei / CoTaEval
View on GitHub
[NeurIPS 2024 D&B] Evaluating Copyright Takedown Methods for Language Models
☆17Jul 17, 2024Updated 2 years ago
zhliu0106 / learning-to-refuse
View on GitHub
Official Implementation of "Learning to Refuse: Towards Mitigating Privacy Risks in LLMs"
☆10Dec 13, 2024Updated last year
Lslland / T-Vaccine
View on GitHub
☆19Jun 21, 2025Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
MartinPawelczyk / In-Context-Unlearning
View on GitHub
"In-Context Unlearning: Language Models as Few Shot Unlearners". Martin Pawelczyk, Seth Neel* and Himabindu Lakkaraju*; ICML 2024.
☆30Oct 18, 2023Updated 2 years ago
princeton-nlp / benign-data-breaks-safety
View on GitHub
☆47Oct 1, 2024Updated last year
tamlhp / awesome-machine-unlearning
View on GitHub
Awesome Machine Unlearning (A Survey of Machine Unlearning)
☆964Jul 6, 2026Updated 2 weeks ago
Jimmy-di / camouflage-poisoning
View on GitHub
Camouflage poisoning via machine unlearning
☆19Jul 3, 2025Updated last year
Lingzhi-WANG / KGAUnlearn
View on GitHub
☆19Sep 10, 2023Updated 2 years ago
alewarne / MachineUnlearning
View on GitHub
Code related to the paper "Machine Unlearning of Features and Labels"
☆71Feb 13, 2024Updated 2 years ago
domenicrosati / representation-noising
View on GitHub
Code to replicate the Representation Noising paper and tools for evaluating defences against harmful fine-tuning
☆24Dec 12, 2024Updated last year
nrimsky / InfluenceFunctions
View on GitHub
Implementation of Influence Function approximations for differently sized ML models, using PyTorch
☆18Sep 15, 2023Updated 2 years ago
OPTML-Group / WAGLE
View on GitHub
Official repo for NeurIPS'24 paper "WAGLE: Strategic Weight Attribution for Effective and Modular Unlearning in Large Language Models"
☆19Dec 16, 2024Updated last year
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
jjbrophy47 / machine_unlearning
View on GitHub
Existing Literature about Machine Unlearning
☆967Aug 29, 2025Updated 10 months ago
ajyl / dpo_toxic
View on GitHub
A Mechanistic Understanding of Alignment Algorithms: A Case Study on DPO and Toxicity.
☆90Mar 7, 2025Updated last year
AngelaZZZ-611 / reasoning_models_probing
View on GitHub
☆22May 14, 2026Updated 2 months ago
yihuaihong / Dissecting-FT-Unlearning
View on GitHub
[EMNLP 2024 Main] Code for the paper "Dissecting Fine-Tuning Unlearning in Large Language Models"
☆14Oct 10, 2024Updated last year
LLM-Tuning-Safety / LLMs-Finetuning-Safety
View on GitHub
We jailbreak GPT-3.5 Turbo’s safety guardrails by fine-tuning it on only 10 adversarially designed examples, at a cost of less than $0.20…
☆358Feb 23, 2024Updated 2 years ago
GraySwanAI / circuit-breakers
View on GitHub
Improving Alignment and Robustness with Circuit Breakers
☆266Sep 24, 2024Updated last year
franciscoliu / MLLMU-Bench
View on GitHub
[NAACL 2025 Main] Official Implementation of MLLMU-Bench
☆55Mar 13, 2025Updated last year