mega002/lm-debugger

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/mega002/lm-debugger)

mega002 / lm-debugger

The official code of LM-Debugger, an interactive tool for inspection and intervention in transformer-based language models.

☆186

Alternatives and similar repositories for lm-debugger

Users that are interested in lm-debugger are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

aviclu / ffn-values
View on GitHub
☆67May 18, 2023Updated 3 years ago
mega002 / ff-layers
View on GitHub
The accompanying code for "Transformer Feed-Forward Layers Are Key-Value Memories". Mor Geva, Roei Schuster, Jonathan Berant, and Omer Le…
☆103Sep 5, 2021Updated 4 years ago
allenai / few_shot_explanations
View on GitHub
Code for NAACL 2022 paper "Reframing Human-AI Collaboration for Generating Free-Text Explanations"
☆29Apr 28, 2023Updated 3 years ago
yoavgur / PISCES
View on GitHub
🪝PISCES - Precise In-Parameter Suppression for Concept EraSure in Large Language Models
☆13Jun 28, 2026Updated 3 weeks ago
kanishkamisra / wugs-and-daxes
View on GitHub
Collection of academic works in natural language processing, computational linguistics, and computational cognitive science that study th…
☆22Mar 20, 2024Updated 2 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
google / belief-localization
View on GitHub
This repository includes code for the paper "Does Localization Inform Editing? Surprising Differences in Where Knowledge Is Stored vs. Ca…
☆62May 9, 2023Updated 3 years ago
kmeng01 / rome
View on GitHub
Locating and editing factual associations in GPT (NeurIPS 2022)
☆770Apr 20, 2024Updated 2 years ago
guy-dar / embedding-space
View on GitHub
☆58Jun 15, 2023Updated 3 years ago
wietsedv / gpt2-recycle
View on GitHub
As good as new. How to successfully recycle English GPT-2 to make models for other languages (ACL Findings 2021)
☆48Aug 2, 2021Updated 4 years ago
yanaiela / amnesic_probing
View on GitHub
☆40Jun 19, 2021Updated 5 years ago
Leukas / CUTE
View on GitHub
☆20Apr 26, 2026Updated 2 months ago
apartresearch / specificityplus
View on GitHub
👩‍💻 Code for the ACL paper "Detecting Edit Failures in LLMs: An Improved Specificity Benchmark"
☆20Jan 19, 2024Updated 2 years ago
arnab-api / romba
View on GitHub
Applies ROME and MEMIT on Mamba-S4 models
☆16Apr 5, 2024Updated 2 years ago
Alrope123 / prompt-waywardness
View on GitHub
☆14Apr 27, 2022Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
TomFrederik / unseal
View on GitHub
Mechanistic Interpretability for Transformer Models
☆53Jun 1, 2022Updated 4 years ago
evandez / REMEDI
View on GitHub
Inspecting and Editing Knowledge Representations in Language Models
☆120Jul 24, 2023Updated 2 years ago
googleinterns / localizing-paragraph-memorization
View on GitHub
☆15Feb 21, 2024Updated 2 years ago
Hunter-DDM / knowledge-neurons
View on GitHub
Code for the ACL-2022 paper "Knowledge Neurons in Pretrained Transformers"
☆177May 4, 2024Updated 2 years ago
mt-upc / transformer-contributions-nmt
View on GitHub
☆18Oct 6, 2022Updated 3 years ago
warnikchow / prosem
View on GitHub
Prosody-semantics Interface in Seoul Korean
☆12Oct 9, 2020Updated 5 years ago
carina-kauf / better-mlm-scoring
View on GitHub
[Kauf & Ivanova, ACL 2023] A Better Way to Do Masked Language Model Scoring
☆12Dec 1, 2023Updated 2 years ago
jalammar / ecco
View on GitHub
Explain, analyze, and visualize NLP language models. Ecco creates interactive visualizations directly in Jupyter notebooks explaining the…
☆2,100Aug 15, 2024Updated last year
chrisliu298 / llm-unlearn-eco
View on GitHub
[NeurIPS 2024] Large Language Model Unlearning via Embedding-Corrupted Prompts
☆41Sep 26, 2024Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
lukasgarbas / can-we-tune-together
View on GitHub
Combining encoder-based language models
☆11Nov 11, 2021Updated 4 years ago
ydyjya / LLM-IHS-Explanation
View on GitHub
☆60Jun 13, 2024Updated 2 years ago
EleutherAI / knowledge-neurons
View on GitHub
A library for finding knowledge neurons in pretrained transformer models.
☆160Feb 13, 2022Updated 4 years ago
mega002 / DocQN
View on GitHub
Author implementation of "Learning to Search in Long Documents Using Document Structure" (Mor Geva and Jonathan Berant, 2018)
☆22Jul 12, 2018Updated 8 years ago
hichoe95 / Artifact-Detection-and-Sequential-Ablation
View on GitHub
[IJCAI-2022] Can We Find Neurons that Cause Unrealistic Images in Deep Generative Networks?
☆23Nov 19, 2024Updated last year
cdpierse / transformers-interpret
View on GitHub
Model explainability that works seamlessly with 🤗 transformers. Explain your transformers model in just 2 lines of code.
☆1,416Aug 30, 2023Updated 2 years ago
davidberenstein1957 / fast-sentence-transformers
View on GitHub
Simply, faster, sentence-transformers
☆144Aug 27, 2024Updated last year
ckkissane / crosscoder-model-diff-replication
View on GitHub
Open source replication of Anthropic's Crosscoders for Model Diffing
☆68Oct 27, 2024Updated last year
tm4roon / data-augmentation-for-nlp
View on GitHub
An implementation of data augmentation methods for natural language processing tasks.
☆13Jul 25, 2024Updated last year
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
saprmarks / feature-circuits
View on GitHub
☆223Oct 14, 2025Updated 9 months ago
GEM-benchmark / NL-Augmenter
View on GitHub
NL-Augmenter 🦎 → 🐍 A Collaborative Repository of Natural Language Transformations
☆786May 19, 2024Updated 2 years ago
ethz-spylab / unlearning-vs-safety
View on GitHub
☆27Oct 6, 2024Updated last year
aryamanarora / causalgym
View on GitHub
CausalGym: Benchmarking causal interpretability methods on linguistic tasks
☆54Nov 30, 2024Updated last year
yihuaihong / ConceptVectors
View on GitHub
[EMNLP 2025 Main] ConceptVectors Benchmark and Code for the paper "Intrinsic Evaluation of Unlearning Using Parametric Knowledge Traces"
☆40Aug 20, 2025Updated 11 months ago
mohsenfayyaz / GlobEnc
View on GitHub
[NAACL 2022] GlobEnc: Quantifying Global Token Attribution by Incorporating the Whole Encoder Layer in Transformers
☆21May 16, 2023Updated 3 years ago
peterwestuw / surface-form-competition
View on GitHub
☆59May 4, 2022Updated 4 years ago