jianghoucheng / AlphaEditView external linksLinks
AlphaEdit: Null-Space Constrained Knowledge Editing for Language Models, ICLR 2025 (Outstanding Paper)
☆415Oct 15, 2025Updated 3 months ago
Alternatives and similar repositories for AlphaEdit
Users that are interested in AlphaEdit are comparing it to the libraries listed below
Sorting:
- AnyEdit: Edit Any Knowledge Encoded in Language Models, ICML 2025☆45Nov 6, 2025Updated 3 months ago
- [ACL 2024] Learning to Edit: Aligning LLMs with Knowledge Editing☆36Aug 19, 2024Updated last year
- [ACL 2024] An Easy-to-use Knowledge Editing Framework for LLMs.☆2,711Feb 4, 2026Updated last week
- Official implementation for "ALI-Agent: Assessing LLMs'Alignment with Human Values via Agent-based Evaluation"☆21Jan 31, 2026Updated 2 weeks ago
- ☆205Dec 23, 2025Updated last month
- The official repository of 'Unnatural Language Are Not Bugs but Features for LLMs'☆24May 20, 2025Updated 8 months ago
- Mass-editing thousands of facts into a transformer memory (ICLR 2023)☆538Jan 31, 2024Updated 2 years ago
- Must-read Papers on Knowledge Editing for Large Language Models.☆1,212Jul 12, 2025Updated 7 months ago
- Official codes for COLING 2024 paper "Robust and Scalable Model Editing for Large Language Models": https://arxiv.org/abs/2403.17431v1☆14Mar 27, 2024Updated last year
- [CVPR 2025] Lifelong Knowledge Editing for Vision Language Models with Low-Rank Mixture-of-Experts☆22Jun 22, 2025Updated 7 months ago
- [ICLR 2025] Official codebase for the ICLR 2025 paper "Multimodal Situational Safety"☆30Jun 23, 2025Updated 7 months ago
- We introduce EMMET and unify model editing with popular algorithms ROME and MEMIT.☆25Dec 16, 2024Updated last year
- The repository for our paper: Neighboring Perturbations of Knowledge Editing on Large Language Models☆16May 4, 2024Updated last year
- [EMNLP 2023] MQuAKE: Assessing Knowledge Editing in Language Models via Multi-Hop Questions☆119Sep 12, 2024Updated last year
- EMNLP 2024: Model Editing Harms General Abilities of Large Language Models: Regularization to the Rescue☆38May 26, 2025Updated 8 months ago
- Official Repository for The Paper: Safety Alignment Should Be Made More Than Just a Few Tokens Deep☆173Apr 23, 2025Updated 9 months ago
- Evaluating the Ripple Effects of Knowledge Editing in Language Models☆56Apr 15, 2024Updated last year
- Code for safety test in "Keeping LLMs Aligned After Fine-tuning: The Crucial Role of Prompt Templates"☆22Sep 21, 2025Updated 4 months ago
- Locating and editing factual associations in GPT (NeurIPS 2022)☆724Apr 20, 2024Updated last year
- This is the official code for the paper "Booster: Tackling Harmful Fine-tuning for Large Language Models via Attenuating Harmful Perturba…☆36Mar 22, 2025Updated 10 months ago
- Awesome Large Reasoning Model(LRM) Safety.This repository is used to collect security-related research on large reasoning models such as …☆81Feb 6, 2026Updated last week
- ☆20Feb 3, 2025Updated last year
- ☆13Sep 8, 2024Updated last year
- ☆10Apr 23, 2025Updated 9 months ago
- Github repo for NeurIPS 2024 paper "Safe LoRA: the Silver Lining of Reducing Safety Risks when Fine-tuning Large Language Models"☆25Dec 21, 2025Updated last month
- [NeurIPS'23] Aging with GRACE: Lifelong Model Editing with Discrete Key-Value Adaptors☆83Dec 21, 2024Updated last year
- [COLM 2025] SEAL: Steerable Reasoning Calibration of Large Language Models for Free☆52Apr 6, 2025Updated 10 months ago
- [ICML 2024] Assessing the Brittleness of Safety Alignment via Pruning and Low-Rank Modifications☆89Mar 30, 2025Updated 10 months ago
- Source code for the paper 'Uncovering Neural Scaling Laws in Molecular Representation Learning' (NeurIPS 2023 Datasets and Benchmarks).☆14Dec 2, 2023Updated 2 years ago
- The first toolkit for MLRM safety evaluation, providing unified interface for mainstream models, datasets, and jailbreaking methods!☆14Apr 8, 2025Updated 10 months ago
- [ICLR 2025] On Evluating the Durability of Safegurads for Open-Weight LLMs☆13Jun 20, 2025Updated 7 months ago
- Code repo of our paper Towards Understanding Jailbreak Attacks in LLMs: A Representation Space Analysis (https://arxiv.org/abs/2406.10794…☆23Jul 26, 2024Updated last year
- [SIGIR 2025] implementation of AlphaFuse: Learn ID Embeddings for Sequential Recommendation in Null Space of Language Embeddings☆38Apr 15, 2025Updated 9 months ago
- Official code for the ICCV2023 paper ``One-bit Flip is All You Need: When Bit-flip Attack Meets Model Training''☆20Aug 9, 2023Updated 2 years ago
- [ACL 2024] ReactXT: Understanding Molecular “Reaction-ship” via Reaction-Contextualized Molecule-Text Pretraining. by Zhiyuan Liu*, Yaoru…☆27Sep 3, 2024Updated last year
- ☆22Apr 23, 2024Updated last year
- Code and dataset for the paper: "Can Editing LLMs Inject Harm?"☆21Dec 26, 2025Updated last month
- [ICML 2025] Official code of "DAMA: Data- and Model-aware Alignment of Multi-modal LLMs"☆16May 24, 2025Updated 8 months ago
- [NDSS'25] The official implementation of safety misalignment.☆17Jan 8, 2025Updated last year