scalable-model-editing / unified-model-editingLinks
We introduce EMMET and unify model editing with popular algorithms ROME and MEMIT.
☆21Updated 5 months ago
Alternatives and similar repositories for unified-model-editing
Users that are interested in unified-model-editing are comparing it to the libraries listed below
Sorting:
- EMNLP 2024: Model Editing Harms General Abilities of Large Language Models: Regularization to the Rescue☆35Updated last week
- This repository contains the code used for the experiments in the paper "Fine-Tuning Enhances Existing Mechanisms: A Case Study on Entity…☆26Updated last year
- Stanford NLP Python library for benchmarking the utility of LLM interpretability methods☆89Updated last week
- Official implementation of Bootstrapping Language Models via DPO Implicit Rewards☆44Updated last month
- ☆34Updated 2 weeks ago
- Function Vectors in Large Language Models (ICLR 2024)☆167Updated last month
- [NeurIPS 2024 Spotlight] Code and data for the paper "Finding Transformer Circuits with Edge Pruning".☆56Updated 2 months ago
- [ICML2024 Spotlight] Fine-Tuning Pre-trained Large Language Models Sparsely☆23Updated 11 months ago
- ☆35Updated 3 months ago
- ☆24Updated last year
- An official implementation of "Catastrophic Failure of LLM Unlearning via Quantization" (ICLR 2025)☆27Updated 3 months ago
- A Mechanistic Understanding of Alignment Algorithms: A Case Study on DPO and Toxicity.☆72Updated 2 months ago
- Repo accompanying our paper "Do Llamas Work in English? On the Latent Language of Multilingual Transformers".☆76Updated last year
- PaCE: Parsimonious Concept Engineering for Large Language Models (NeurIPS 2024)☆35Updated 7 months ago
- Codebase for Instruction Following without Instruction Tuning☆34Updated 8 months ago
- Algebraic value editing in pretrained language models☆65Updated last year
- ☆17Updated 5 months ago
- ☆37Updated last year
- Code associated with Tuning Language Models by Proxy (Liu et al., 2024)☆110Updated last year
- A library for efficient patching and automatic circuit discovery.☆65Updated last month
- Official code repo for paper "Great Memory, Shallow Reasoning: Limits of kNN-LMs"☆23Updated last month
- General-purpose activation steering library☆75Updated 3 weeks ago
- This repository includes code for the paper "Does Localization Inform Editing? Surprising Differences in Where Knowledge Is Stored vs. Ca…☆61Updated 2 years ago
- This is official project in our paper: Is Bigger and Deeper Always Better? Probing LLaMA Across Scales and Layers☆30Updated last year
- ☆51Updated last month
- ☆19Updated 10 months ago
- Code for my NeurIPS 2024 ATTRIB paper titled "Attribution Patching Outperforms Automated Circuit Discovery"☆34Updated last year
- Self-Supervised Alignment with Mutual Information☆19Updated last year
- ☆32Updated 4 months ago
- Code repository for the paper "Mission: Impossible Language Models."☆52Updated last month