google/belief-localization

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/google/belief-localization)

google / belief-localization

This repository includes code for the paper "Does Localization Inform Editing? Surprising Differences in Where Knowledge Is Stored vs. Can Be Injected in Language Models."

☆62

Alternatives and similar repositories for belief-localization

Users that are interested in belief-localization are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

kmeng01 / memit
View on GitHub
Mass-editing thousands of facts into a transformer memory (ICLR 2023)
☆556Jan 31, 2024Updated 2 years ago
kmeng01 / rome
View on GitHub
Locating and editing factual associations in GPT (NeurIPS 2022)
☆770Apr 20, 2024Updated 2 years ago
feyzaakyurek / dune
View on GitHub
Dataset for Unified Editing, EMNLP 2023. This is a model editing dataset where edits are natural language phrases.
☆24Sep 4, 2024Updated last year
ECNU-ICALK / MELO
View on GitHub
[AAAI 2024] MELO: Enhancing Model Editing with Neuron-indexed Dynamic LoRA
☆28Apr 9, 2024Updated 2 years ago
Thartvigsen / GRACE
View on GitHub
[NeurIPS'23] Aging with GRACE: Lifelong Model Editing with Discrete Key-Value Adaptors
☆86Dec 21, 2024Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
edenbiran / RippleEdits
View on GitHub
Evaluating the Ripple Effects of Knowledge Editing in Language Models
☆57Apr 15, 2024Updated 2 years ago
vipulgupta1011 / CALM
View on GitHub
☆11Oct 2, 2023Updated 2 years ago
arnab-api / romba
View on GitHub
Applies ROME and MEMIT on Mamba-S4 models
☆16Apr 5, 2024Updated 2 years ago
xszheng2020 / memorization
View on GitHub
An Empirical Study of Memorization in NLP (ACL 2022)
☆13Jun 22, 2022Updated 4 years ago
allenai / few_shot_explanations
View on GitHub
Code for NAACL 2022 paper "Reframing Human-AI Collaboration for Generating Free-Text Explanations"
☆29Apr 28, 2023Updated 3 years ago
HITsz-TMG / ICL-State-Vector
View on GitHub
☆12Jul 4, 2024Updated 2 years ago
ElisaNguyen / bayesian-tda
View on GitHub
Code for NeurIPS'23 paper "A Bayesian Approach To Analysing Training Data Attribution In Deep Learning"
☆17Jan 12, 2024Updated 2 years ago
EleutherAI / knowledge-neurons
View on GitHub
A library for finding knowledge neurons in pretrained transformer models.
☆160Feb 13, 2022Updated 4 years ago
mega002 / ff-layers
View on GitHub
The accompanying code for "Transformer Feed-Forward Layers Are Key-Value Memories". Mor Geva, Roei Schuster, Jonathan Berant, and Omer Le…
☆103Sep 5, 2021Updated 4 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
eric-mitchell / mend
View on GitHub
MEND: Fast Model Editing at Scale
☆259Aug 30, 2023Updated 2 years ago
thunlp / EREN
View on GitHub
Official codes for COLING 2024 paper "Robust and Scalable Model Editing for Large Language Models": https://arxiv.org/abs/2403.17431v1
☆14Mar 27, 2024Updated 2 years ago
QwenLM / ConsisEval
View on GitHub
☆14Jul 5, 2024Updated 2 years ago
yoavgur / PISCES
View on GitHub
🪝PISCES - Precise In-Parameter Suppression for Concept EraSure in Large Language Models
☆13Jun 28, 2026Updated 3 weeks ago
john-hewitt / model-editing-canonical-examples
View on GitHub
☆14Feb 12, 2024Updated 2 years ago
ericwtodd / function_vectors
View on GitHub
Function Vectors in Large Language Models (ICLR 2024)
☆199Apr 30, 2026Updated 2 months ago
mohsenfayyaz / GlobEnc
View on GitHub
[NAACL 2022] GlobEnc: Quantifying Global Token Attribution by Incorporating the Whole Encoder Layer in Transformers
☆21May 16, 2023Updated 3 years ago
au-revoir / model-editing-ft
View on GitHub
☆13Sep 8, 2024Updated last year
Hunter-DDM / knowledge-neurons
View on GitHub
Code for the ACL-2022 paper "Knowledge Neurons in Pretrained Transformers"
☆177May 4, 2024Updated 2 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
CharlesYu2000 / PCGU-UnlearningBias
View on GitHub
☆17Nov 7, 2023Updated 2 years ago
aviclu / ffn-values
View on GitHub
☆67May 18, 2023Updated 3 years ago
mlfoundations / task_vectors
View on GitHub
Editing Models with Task Arithmetic
☆548Jan 11, 2024Updated 2 years ago
visinf / fast-axiomatic-attribution
View on GitHub
Fast Axiomatic Attribution for Neural Networks (NeurIPS*2021)
☆15Feb 24, 2026Updated 4 months ago
naver-ai / imagenet-annotation-tool
View on GitHub
☆17Jul 24, 2023Updated 2 years ago
evandez / REMEDI
View on GitHub
Inspecting and Editing Knowledge Representations in Language Models
☆120Jul 24, 2023Updated 2 years ago
zjunlp / KnowledgeCircuits
View on GitHub
[NeurIPS 2024] Knowledge Circuits in Pretrained Transformers
☆172Nov 14, 2025Updated 8 months ago
huashen218 / convxai
View on GitHub
CSCW 2023 Best Demo Award: Conversational AI Explanations to Support Human-AI Scientific Writing
☆14Jun 25, 2023Updated 3 years ago
abertsch72 / long-context-icl
View on GitHub
Data and code for the preprint "In-Context Learning with Long-Context Models: An In-Depth Exploration"
☆44Aug 20, 2024Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
michaelsaxon / CoCoCroLa
View on GitHub
The Conceptual Coverage Across Languages Benchmark for Text-to-Image Models
☆12Oct 28, 2024Updated last year
adiSimhi / Interpreting-Embedding-Spaces-by-Conceptualization
View on GitHub
☆15Oct 17, 2023Updated 2 years ago
benpry / chain-of-thought-metaphor
View on GitHub
This repo contains code for the paper "Psychologically-informed chain-of-thought prompts for metaphor understanding in large language mod…
☆14Apr 28, 2023Updated 3 years ago
mt-upc / transformer-contributions-nmt
View on GitHub
☆18Oct 6, 2022Updated 3 years ago
koayon / atp_star
View on GitHub
PyTorch and NNsight implementation of AtP* (Kramar et al 2024, DeepMind)
☆20Jan 19, 2025Updated last year
GChrysostomou / saloss
View on GitHub
☆11Dec 23, 2021Updated 4 years ago
nicola-decao / KnowledgeEditor
View on GitHub
Code for Editing Factual Knowledge in Language Models
☆142Jan 28, 2022Updated 4 years ago