KempnerInstitute / overcompleteLinks
π Overcomplete is a Vision-based SAE Toolbox
β118Updated 2 months ago
Alternatives and similar repositories for overcomplete
Users that are interested in overcomplete are comparing it to the libraries listed below
Sorting:
- π Code for : "CRAFT: Concept Recursive Activation FacTorization for Explainability" (CVPR 2023)β71Updated 2 years ago
- LENS Projectβ52Updated last year
- β16Updated 9 months ago
- β115Updated 11 months ago
- ViT Prisma is a mechanistic interpretability library for Vision and Video Transformers (ViTs).β337Updated 6 months ago
- β58Updated last year
- Sparse Autoencoder Training Libraryβ56Updated 9 months ago
- β143Updated last month
- A toolkit for quantitative evaluation of data attribution methods.β55Updated 6 months ago
- Repository for PURE: Turning Polysemantic Neurons Into Pure Features by Identifying Relevant Circuits, accepted at CVPR 2024 XAI4CV Worksβ¦β19Updated last year
- NeuroSurgeon is a package that enables researchers to uncover and manipulate subnetworks within models in Huggingface Transformersβ42Updated 11 months ago
- Sparse and discrete interpretability tool for neural networksβ64Updated last year
- β88Updated last month
- Spurious Features Everywhere - Large-Scale Detection of Harmful Spurious Features in ImageNetβ32Updated 2 years ago
- Attribution-based Parameter Decompositionβ33Updated 7 months ago
- Code for my NeurIPS 2024 ATTRIB paper titled "Attribution Patching Outperforms Automated Circuit Discovery"β45Updated last year
- Personal implementation of ASIF by Antonio Norelliβ26Updated last year
- β23Updated last year
- Open source replication of Anthropic's Crosscoders for Model Diffingβ63Updated last year
- β132Updated 2 years ago
- [NeurIPS 2024] Official implementation of the paper "MambaLRP: Explaining Selective State Space Sequence Models" πβ45Updated last year
- Universal Neurons in GPT2 Language Modelsβ30Updated last year
- β28Updated 2 years ago
- Official code for the ICML 2024 paper "The Entropy Enigma: Success and Failure of Entropy Minimization"β55Updated last year
- β25Updated 9 months ago
- Sparse Autoencoder for Mechanistic Interpretabilityβ290Updated last year
- Delphi was the home of a temple to Phoebus Apollo, which famously had the inscription, 'Know Thyself.' This library lets language models β¦β241Updated last week
- A fast, effective data attribution method for neural networks in PyTorchβ229Updated last year
- β388Updated 5 months ago
- Create feature-centric and prompt-centric visualizations for sparse autoencoders (like those from Anthropic's published research).β238Updated last year