Official github page for the paper "Evaluating Deep Unlearning in Large Language Model"
☆14Apr 25, 2025Updated 10 months ago
Alternatives and similar repositories for deep_unlearning
Users that are interested in deep_unlearning are comparing it to the libraries listed below
Sorting:
- ☆10Aug 13, 2021Updated 4 years ago
- Code repo for the NeurIPS 2021 paper "Online Adaption to Label Distribution Shift".☆15Feb 15, 2023Updated 3 years ago
- Code repo for the UAI 2023 paper "Learning To Invert: Simple Adaptive Attacks for Gradient Inversion in Federated Learning".☆16Jun 15, 2024Updated last year
- ☆15Dec 9, 2018Updated 7 years ago
- ☆37Oct 18, 2023Updated 2 years ago
- ☆22Dec 17, 2025Updated 3 months ago
- ☆21Jul 25, 2024Updated last year
- [ACL 2024] Code and data for "Machine Unlearning of Pre-trained Large Language Models"☆67Sep 30, 2024Updated last year
- 🤖 基于AutoGen的AI辩论系统 | 🗣️ 支持中文交互 | 🔄 多智能体协作 | 📝 自动记录辩论过程 🤖 AI Debate System based on AutoGen | 🗣️ Chinese Interaction | 🔄 Multi-Age…☆22Mar 4, 2026Updated 2 weeks ago
- [ICLR 2025 Spotlight] LayerDAG: A Layerwise Autoregressive Diffusion Model of Directed Acyclic Graphs☆26Jan 26, 2025Updated last year
- ☆40Nov 4, 2024Updated last year
- Certified (approximate) machine unlearning for simplified graph convolutional networks (SGCs) with theoretical guarantees (ICLR 2023)☆20Feb 17, 2023Updated 3 years ago
- ☆19Sep 15, 2022Updated 3 years ago
- Package to optimize Adversarial Attacks against (Large) Language Models with Varied Objectives☆70Feb 22, 2024Updated 2 years ago
- ☆24Apr 29, 2022Updated 3 years ago
- A holistic benchmark for LLM abstention☆73Aug 27, 2025Updated 6 months ago
- BiasFinder | IEEE TSE | Metamorphic Test Generation to Uncover Bias for Sentiment Analysis Systems☆11Jan 18, 2022Updated 4 years ago
- ☆26Apr 9, 2019Updated 6 years ago
- [NeurIPS D&B '25] The one-stop repository for LLM unlearning☆505Updated this week
- 🌟 SwarmAgent: A framework for simulating social group dynamics using multi-agent collaboration, aiding insights into collective behavior…☆13Dec 5, 2023Updated 2 years ago
- [TKDE 2024, CIKM 2022] SLA²P: Self-supervised Anomaly Detection with Adversarial Perturbation.☆39Dec 26, 2024Updated last year
- ☆14Oct 11, 2017Updated 8 years ago
- Safe Unlearning: A Surprisingly Effective and Generalizable Solution to Defend Against Jailbreak Attacks☆32Jul 9, 2024Updated last year
- ☆37Sep 23, 2021Updated 4 years ago
- ☆30Sep 28, 2023Updated 2 years ago
- Blueprint for federated finetuning, enabling multiple data owners to collaboratively fine-tune models without sharing raw data. Developed…☆39Jan 22, 2026Updated 2 months ago
- White-box Fairness Testing through Adversarial Sampling☆14Apr 16, 2021Updated 4 years ago
- Improving Machine Translation Systems via Isotopic Replacement☆12Apr 14, 2023Updated 2 years ago
- [EMNLP 2024] "Revisiting Who's Harry Potter: Towards Targeted Unlearning from a Causal Intervention Perspective"☆33Jul 22, 2024Updated last year
- ☆27Aug 1, 2024Updated last year
- ☆13Nov 2, 2022Updated 3 years ago
- Consuming Resrouce via Auto-generation for LLM-DoS Attack under Black-box Settings☆18Sep 1, 2025Updated 6 months ago
- [CCS 2024] Optimization-based Prompt Injection Attack to LLM-as-a-Judge☆39Sep 17, 2025Updated 6 months ago
- Code showing how to use a model based on the ML model base class.☆10Sep 30, 2022Updated 3 years ago
- Tests that check correctness of a single statement☆14Nov 25, 2024Updated last year
- Official repo for "ProSec: Fortifying Code LLMs with Proactive Security Alignment"☆17Feb 26, 2026Updated 3 weeks ago
- Code for "Astraea: Grammar-based Fairness Testing"☆10Jan 7, 2022Updated 4 years ago
- A compute framework for building Search, RAG, Recommendations and Analytics over complex (structured+unstructured) data, with ultra-modal…☆12Sep 16, 2024Updated last year
- A Multi-Agent Approach Integrating Socratic Guidance for Automated Prompt Optimization☆17Dec 15, 2025Updated 3 months ago