[EMNLP 2024] "Revisiting Who's Harry Potter: Towards Targeted Unlearning from a Causal Intervention Perspective"
☆32Jul 22, 2024Updated last year
Alternatives and similar repositories for causal_unlearn
Users that are interested in causal_unlearn are comparing it to the libraries listed below
Sorting:
- ☆32Aug 9, 2024Updated last year
- ☆27Oct 6, 2024Updated last year
- LLM Unlearning☆182Oct 20, 2023Updated 2 years ago
- ☆17Nov 7, 2023Updated 2 years ago
- RWKU: Benchmarking Real-World Knowledge Unlearning for Large Language Models. NeurIPS 2024☆91Sep 30, 2024Updated last year
- [NeurIPS D&B '25] The one-stop repository for LLM unlearning☆493Feb 18, 2026Updated 2 weeks ago
- Code for Representation Bending Paper☆16Jul 15, 2025Updated 7 months ago
- Code for the paper "Pretrained Models for Multilingual Federated Learning" at NAACL 2022☆11Aug 9, 2022Updated 3 years ago
- A resource repository for machine unlearning in large language models☆539Feb 24, 2026Updated last week
- ☆21Jun 22, 2025Updated 8 months ago
- ☆35May 9, 2025Updated 9 months ago
- [NeurIPS25] Official repo for "Simplicity Prevails: Rethinking Negative Preference Optimization for LLM Unlearning"☆42Oct 3, 2025Updated 5 months ago
- Code to the paper: The Geometry of Refusal in Large Language Models: Concept Cones and Representational Independence☆26Jul 31, 2025Updated 7 months ago
- ☆19Jun 21, 2025Updated 8 months ago
- Erasing conceptual knowledge from language models through low-rank fine-tuning☆19Mar 27, 2025Updated 11 months ago
- This is the official code for the paper "Vaccine: Perturbation-aware Alignment for Large Language Models" (NeurIPS2024)☆49Jan 15, 2026Updated last month
- ☆27Feb 25, 2025Updated last year
- This is the official implementation for our ACL 2024 paper: "Causal Estimation of Memorisation Profiles".☆24Mar 25, 2025Updated 11 months ago
- <혼자 만들면서 공부하는 파이썬> 책의 깃허브 자료실☆15Jan 14, 2026Updated last month
- A Task of Fictitious Unlearning for VLMs☆28Apr 6, 2025Updated 10 months ago
- ☆26Nov 25, 2023Updated 2 years ago
- [ECCV24] "Challenging Forgets: Unveiling the Worst-Case Forget Sets in Machine Unlearning" by Chongyu Fan*, Jiancheng Liu*, Alfred Hero, …☆25May 27, 2025Updated 9 months ago
- The Codebase for Causal Distillation for Language Models (NAACL '22)☆26May 1, 2022Updated 3 years ago
- [ACL 2024 main] Aligning Large Language Models with Human Preferences through Representation Engineering (https://aclanthology.org/2024.…☆28Sep 25, 2024Updated last year
- ☆73Jul 15, 2024Updated last year
- ☆28May 4, 2023Updated 2 years ago
- Official repository of the paper: Marking Code Without Breaking It: Code Watermarking for Detecting LLM-Generated Code (Findings of EACL …☆12Feb 11, 2026Updated 3 weeks ago
- Official repo for EMNLP'24 paper "SOUL: Unlocking the Power of Second-Order Optimization for LLM Unlearning"☆29Oct 1, 2024Updated last year
- WMDP is a LLM proxy benchmark for hazardous knowledge in bio, cyber, and chemical security. We also release code for RMU, an unlearning m…☆160May 29, 2025Updated 9 months ago
- ☆31May 1, 2025Updated 10 months ago
- Repository of the COLING 2022 paper : Ordinal Log-Loss - A simple log-based loss function for ordinal text classification.☆31Mar 17, 2023Updated 2 years ago
- A Data-Driven Approach to Predict the Success of Bank Telemarketing☆10Apr 27, 2021Updated 4 years ago
- `dattri` is a PyTorch library for developing, benchmarking, and deploying efficient data attribution algorithms.☆113Updated this week
- [ICLR 2025] A Closer Look at Machine Unlearning for Large Language Models☆45Dec 4, 2024Updated last year
- ☆185Nov 17, 2025Updated 3 months ago
- Implementation of LaViC (KDD 2025)☆12Jun 1, 2025Updated 9 months ago
- Residual Quantization Autoencoder, used for interpreting LLMs☆14Jan 1, 2025Updated last year
- [ICDCS 2023] Evaluation and Optimization of Gradient Compression for Distributed Deep Learning☆10Apr 28, 2023Updated 2 years ago
- ☆24Feb 18, 2026Updated 2 weeks ago