fjzzq2002 / pizza
Code repository for "The Clock and the Pizza: Two Stories in Mechanistic Explanation of Neural Networks"
☆13Updated last year
Alternatives and similar repositories for pizza:
Users that are interested in pizza are comparing it to the libraries listed below
- ☆24Updated 2 years ago
- ☆26Updated last year
- Sparse Autoencoder Training Library☆42Updated 4 months ago
- ☆44Updated last year
- ☆28Updated 4 months ago
- ☆21Updated last month
- ☆12Updated 11 months ago
- ☆61Updated 2 years ago
- The official repository for our paper "Are Neural Nets Modular? Inspecting Functional Modularity Through Differentiable Weight Masks". We…☆46Updated last year
- ZeroC is a neuro-symbolic method that trained with elementary visual concepts and relations, can zero-shot recognize and acquire more com…☆30Updated last year
- Deep Learning & Information Bottleneck☆57Updated last year
- ☆23Updated 5 months ago
- Omnigrok: Grokking Beyond Algorithmic Data☆53Updated 2 years ago
- ☆80Updated 11 months ago
- A library for efficient patching and automatic circuit discovery.☆54Updated 2 weeks ago
- This repository contains the code used for the experiments in the paper "Fine-Tuning Enhances Existing Mechanisms: A Case Study on Entity…☆23Updated 11 months ago
- ☆11Updated 6 months ago
- ☆88Updated 2 weeks ago
- ☆24Updated last week
- The Energy Transformer block, in JAX☆56Updated last year
- ☆9Updated 2 years ago
- ☆13Updated 2 years ago
- Interpretating the latent space representations of attention head outputs for LLMs☆30Updated 6 months ago
- This is the official repository for the "Towards Vision-Language Mechanistic Interpretability: A Causal Tracing Tool for BLIP" paper acce…☆19Updated 10 months ago
- ☆35Updated 11 months ago
- This repository contains the official code for Energy Transformer---an efficient Energy-based Transformer variant for graph classificatio…☆23Updated last year
- This is the code repository associated with the paper "Abstractors and relational cross-attention: An inductive bias for explicit relati…☆16Updated 6 months ago
- Xmixers: A collection of SOTA efficient token/channel mixers☆11Updated 3 months ago
- Implementation of OpenAI's 'Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets' paper.☆35Updated last year
- PyTorch codes for the paper "An Empirical Study of Multimodal Model Merging"☆38Updated last year