A library for finding knowledge neurons in pretrained transformer models.
☆159Feb 13, 2022Updated 4 years ago
Alternatives and similar repositories for knowledge-neurons
Users that are interested in knowledge-neurons are comparing it to the libraries listed below
Sorting:
- Code for the ACL-2022 paper "Knowledge Neurons in Pretrained Transformers"☆173May 4, 2024Updated last year
- Complexity Based Prompting for Multi-Step Reasoning☆17Mar 10, 2023Updated 2 years ago
- ☆68May 18, 2023Updated 2 years ago
- ☆20Aug 19, 2021Updated 4 years ago
- Code for Editing Factual Knowledge in Language Models☆142Jan 28, 2022Updated 4 years ago
- ☆65Nov 4, 2021Updated 4 years ago
- Mechanistic Interpretability for Transformer Models☆53Jun 1, 2022Updated 3 years ago
- Script and models for clustering LAION-400m CLIP embeddings.☆26Jan 10, 2022Updated 4 years ago
- Redwood Research's transformer interpretability tools☆15Apr 15, 2022Updated 3 years ago
- ☆16Apr 11, 2022Updated 3 years ago
- This repository includes code for the paper "Does Localization Inform Editing? Surprising Differences in Where Knowledge Is Stored vs. Ca…☆61May 9, 2023Updated 2 years ago
- ☆57Jun 15, 2023Updated 2 years ago
- Locating and editing factual associations in GPT (NeurIPS 2022)☆730Apr 20, 2024Updated last year
- ☆30Nov 25, 2021Updated 4 years ago
- ☆48Jan 21, 2024Updated 2 years ago
- Companion repository to "Prompt Compression and Contrastive Conditioning for Controllability and Toxicity Reduction in Language Models"☆14May 31, 2023Updated 2 years ago
- ☆18May 21, 2018Updated 7 years ago
- A python library for highly configurable transformers - easing model architecture search and experimentation.☆48Nov 30, 2021Updated 4 years ago
- Knowledge Infused Decoding☆71Dec 31, 2023Updated 2 years ago
- ☆21Mar 15, 2023Updated 2 years ago
- Providing the answer to "How to do patching on all available SAEs on GPT-2?". It is an official repository of the implementation of the p…☆13Jan 26, 2025Updated last year
- ☆89Oct 8, 2022Updated 3 years ago
- ☆78Dec 7, 2023Updated 2 years ago
- ReCross: Unsupervised Cross-Task Generalization via Retrieval Augmentation☆24May 1, 2022Updated 3 years ago
- Exploring Few-Shot Adaptation of Language Models with Tables☆24Aug 22, 2022Updated 3 years ago
- The official implementation of "ICDPO: Effectively Borrowing Alignment Capability of Others via In-context Direct Preference Optimization…☆16Feb 15, 2024Updated 2 years ago
- One stop shop for all things carp☆59Sep 9, 2022Updated 3 years ago
- ☆14Apr 27, 2022Updated 3 years ago
- ☆46Apr 13, 2022Updated 3 years ago
- ☆27Mar 13, 2021Updated 4 years ago
- Training simple models to predict CLIP image embeddings from text embeddings, and vice versa.☆60Mar 31, 2022Updated 3 years ago
- Mass-editing thousands of facts into a transformer memory (ICLR 2023)☆541Jan 31, 2024Updated 2 years ago
- Code publication to the paper "Normalized Attention Without Probability Cage"☆17Nov 9, 2021Updated 4 years ago
- The official implemetation of "Evidentiality-guided Generation for Knowledge-Intensive NLP Tasks" (NAACL 2022).☆44Dec 25, 2022Updated 3 years ago
- This repository accompanies our paper “Do Prompt-Based Models Really Understand the Meaning of Their Prompts?”☆85May 10, 2022Updated 3 years ago
- MEND: Fast Model Editing at Scale☆257Aug 30, 2023Updated 2 years ago
- ☆290Dec 2, 2022Updated 3 years ago
- ☆187Jul 2, 2025Updated 7 months ago
- Code for the ACL-2022 paper "StableMoE: Stable Routing Strategy for Mixture of Experts"☆51Jul 17, 2022Updated 3 years ago