rohitgandikota / erasing-llm
Erasing conceptual knowledge from language models through low-rank fine-tuning
☆17Updated last month
Alternatives and similar repositories for erasing-llm:
Users that are interested in erasing-llm are comparing it to the libraries listed below
- Official repo for Detecting, Explaining, and Mitigating Memorization in Diffusion Models (ICLR 2024)☆72Updated last year
- Code for the paper - ConceptPrune: Concept Editing in Diffusion Models via Skilled Neuron Pruning☆18Updated 8 months ago
- What do we learn from inverting CLIP models?☆54Updated last year
- [NeurIPS 2024 D&B Track] UnlearnCanvas: A Stylized Image Dataset to Benchmark Machine Unlearning for Diffusion Models by Yihua Zhang, Cho…☆68Updated 5 months ago
- Unified Concept Editing in Diffusion Models☆151Updated last week
- [TMLR 2025] On Memorization in Diffusion Models☆24Updated last year
- ☆38Updated 8 months ago
- Unlearning in Diffusion Models using Sparse Autoencoders☆20Updated last month
- ☆62Updated 7 months ago
- The official implementation of ECCV'24 paper "To Generate or Not? Safety-Driven Unlearned Diffusion Models Are Still Easy To Generate Uns…☆73Updated 2 months ago
- ☆27Updated 2 weeks ago
- Official Implementation of Safe Latent Diffusion for Text2Image☆86Updated 2 years ago
- ☆27Updated last year
- Official Pytorch repo of CVPR'23 and NeurIPS'23 papers on understanding replication in diffusion models.☆105Updated last year
- Official implementation of NeurIPS'24 paper "Defensive Unlearning with Adversarial Training for Robust Concept Erasure in Diffusion Model…☆39Updated 6 months ago
- Intriguing Properties of Data Attribution on Diffusion Models (ICLR 2024)☆28Updated last year
- Official Pytorch implementation of "Interpreting and Editing Vision-Language Representations to Mitigate Hallucinations" (ICLR '25)☆67Updated 2 months ago
- [ICML 2024] Unsupervised Adversarial Fine-Tuning of Vision Embeddings for Robust Large Vision-Language Models☆129Updated 5 months ago
- Code for the paper "Data Attribution for Text-to-Image Models by Unlearning Synthesized Images."☆15Updated 2 months ago
- PDM-based Purifier☆20Updated 6 months ago
- Code of the paper: Finetuning Text-to-Image Diffusion Models for Fairness☆41Updated last year
- ☆29Updated 3 months ago
- ☆13Updated 2 months ago
- [CVPR 2024 Highlight] OpenBias: Open-set Bias Detection in Text-to-Image Generative Models☆23Updated 2 months ago
- ☆11Updated 5 months ago
- Implementation of PatchSAE as presented in "Sparse autoencoders reveal selective remapping of visual concepts during adaptation"☆12Updated last week
- The official implementation of the paper "Towards Safe Self-Distillation of Internet-Scale Text-to-Image Diffusion Models" (ICML 2023 Wor…☆22Updated last year
- PaCE: Parsimonious Concept Engineering for Large Language Models (NeurIPS 2024)☆35Updated 6 months ago
- Sparse autoencoders for vision☆28Updated last week
- ☆20Updated last year