[EMNLP 2024] To Forget or Not? Towards Practical Knowledge Unlearning for Large Language Models
☆47Jan 23, 2025Updated last year
Alternatives and similar repositories for KnowUnDo
Users that are interested in KnowUnDo are comparing it to the libraries listed below
Sorting:
- Dataset for Unified Editing, EMNLP 2023. This is a model editing dataset where edits are natural language phrases.☆24Sep 4, 2024Updated last year
- [ACL 2024] Code and data for "Machine Unlearning of Pre-trained Large Language Models"☆67Sep 30, 2024Updated last year
- [WSDM 2026] LookAhead Tuning: Safer Language Models via Partial Answer Previews☆17Dec 14, 2025Updated 3 months ago
- [NeurIPS 2025@FoRLM] R1-Compress: Long Chain-of-Thought Compression via Chunk Compression and Search☆17Jan 24, 2026Updated last month
- ☆16Feb 8, 2024Updated 2 years ago
- ☆37Oct 18, 2023Updated 2 years ago
- Our research proposes a novel MoGU framework that improves LLMs' safety while preserving their usability.☆18Jan 14, 2025Updated last year
- [NeurIPS D&B '25] The one-stop repository for LLM unlearning☆502Mar 7, 2026Updated 2 weeks ago
- [ACL 2024] Unveiling Linguistic Regions in Large Language Models☆33Jun 9, 2024Updated last year
- Code for ACL 2024 paper: PrivLM-Bench: A Multi-level Privacy Evaluation Benchmark for Language Models.☆16Feb 5, 2025Updated last year
- [CCKS 2021] On Robustness and Bias Analysis of BERT-based Relation Extraction☆27Nov 6, 2021Updated 4 years ago
- ☆16May 16, 2025Updated 10 months ago
- [AAAI 2026] ReCode: Reinforced Code Knowledge Editing for API Updates☆24Jul 1, 2025Updated 8 months ago
- [NeurIPS25] Official repo for "Simplicity Prevails: Rethinking Negative Preference Optimization for LLM Unlearning"☆43Oct 3, 2025Updated 5 months ago
- [EMNLP 2025] Circuit-Aware Editing Enables Generalizable Knowledge Learners☆19Nov 17, 2025Updated 4 months ago
- Automatic Metric for Evaluating Generated Videos☆34Dec 8, 2025Updated 3 months ago
- [ACL 2021] MLBiNet: A Cross-Sentence Collective Event Detection Network☆35Jan 10, 2022Updated 4 years ago
- Evaluating the Ripple Effects of Knowledge Editing in Language Models☆56Apr 15, 2024Updated last year
- Implementation of paper 'Reversing the Forget-Retain Objectives: An Efficient LLM Unlearning Framework from Logit Difference' [NeurIPS'24…☆26Jun 14, 2024Updated last year
- ☆34Aug 5, 2023Updated 2 years ago
- [ACL 2024] An Easy-to-use Knowledge Editing Framework for LLMs.☆2,745Mar 4, 2026Updated 2 weeks ago
- [EMNLP 2024] OneGen: Efficient One-Pass Unified Generation and Retrieval for LLMs.☆147Nov 13, 2024Updated last year
- ☆60Oct 30, 2024Updated last year
- Repository for "Training Language Models To Explain Their Own Computations"☆21Dec 22, 2025Updated 2 months ago
- ☆14Feb 26, 2025Updated last year
- From Accuracy to Robustness: A Study of Rule- and Model-based Verifiers in Mathematical Reasoning.☆25Oct 7, 2025Updated 5 months ago
- DeepEE: Deep Event Extraction Algorithm Gallery (基于深度学习的开源中文事件抽取算法汇总)☆43Dec 11, 2022Updated 3 years ago
- [NAACL'25 Oral] Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering☆77Jan 16, 2026Updated 2 months ago
- ☆16Jul 20, 2023Updated 2 years ago
- [NeurIPS 2025] Official Pytorch Implementation of "The Curse of Depth in Large Language Models" by Wenfang Sun, Xinyuan Song, Pengxiang L…☆70Mar 3, 2026Updated 2 weeks ago
- The source code and manually annotated datasets for our paper "Joint Multimodal Sentiment Analysis Based on Information Relevance"☆11Dec 17, 2022Updated 3 years ago
- A Model Agnostic function to directly remove specified layers from the LLM☆10May 23, 2024Updated last year
- Code for our NAACL2025 accepted paper: Attention Tracker: Detecting Prompt Injection Attacks in LLMs☆23Sep 19, 2025Updated 6 months ago
- ☆20Feb 17, 2020Updated 6 years ago
- Safe Unlearning: A Surprisingly Effective and Generalizable Solution to Defend Against Jailbreak Attacks☆32Jul 9, 2024Updated last year
- ☆15May 6, 2021Updated 4 years ago
- [NeurIPS 2024] Large Language Model Unlearning via Embedding-Corrupted Prompts☆38Sep 26, 2024Updated last year
- ☆27Feb 25, 2025Updated last year
- Experiments for our CLEAR benchmark of unlearning methods in a multimodal setup☆20Aug 6, 2025Updated 7 months ago