KempnerInstitute / overcompleteLinks
π Overcomplete is a Vision-based SAE Toolbox
β106Updated last week
Alternatives and similar repositories for overcomplete
Users that are interested in overcomplete are comparing it to the libraries listed below
Sorting:
- π Code for : "CRAFT: Concept Recursive Activation FacTorization for Explainability" (CVPR 2023)β71Updated 2 years ago
- LENS Projectβ51Updated last year
- β16Updated 7 months ago
- Repository for PURE: Turning Polysemantic Neurons Into Pure Features by Identifying Relevant Circuits, accepted at CVPR 2024 XAI4CV Worksβ¦β19Updated last year
- β38Updated 2 months ago
- A toolkit for quantitative evaluation of data attribution methods.β54Updated 4 months ago
- Sparse and discrete interpretability tool for neural networksβ64Updated last year
- Universal Neurons in GPT2 Language Modelsβ31Updated last year
- Official implementation of MAIA, A Multimodal Automated Interpretability Agentβ99Updated last month
- ViT Prisma is a mechanistic interpretability library for Vision and Video Transformers (ViTs).β324Updated 4 months ago
- β53Updated 10 months ago
- β111Updated 10 months ago
- NeuroSurgeon is a package that enables researchers to uncover and manipulate subnetworks within models in Huggingface Transformersβ42Updated 9 months ago
- Decomposing and Editing Predictions by Modeling Model Computationβ139Updated last year
- Attribution-based Parameter Decompositionβ33Updated 6 months ago
- Mechanistic understanding and validation of large AI models with SemanticLensβ47Updated last week
- Sparse Autoencoder Training Libraryβ55Updated 7 months ago
- Replicating and dissecting the git-re-basin project in one-click-replication Colabsβ36Updated 3 years ago
- β27Updated 2 years ago
- β136Updated 3 weeks ago
- This repository includes code to reproduce the tables in "Loss Landscapes are All You Need: Neural Network Generalization Can Be Explaineβ¦β40Updated 2 years ago
- PyTorch library for Active Fine-Tuningβ95Updated 2 months ago
- Official code for the ICML 2024 paper "The Entropy Enigma: Success and Failure of Entropy Minimization"β55Updated last year
- Spurious Features Everywhere - Large-Scale Detection of Harmful Spurious Features in ImageNetβ32Updated 2 years ago
- Layer-wise Relevance Propagation for Large Language Models and Vision Transformers [ICML 2024]β211Updated 5 months ago
- πͺ Interpreto is an interpretability toolbox for LLMsβ71Updated last week
- A fast, effective data attribution method for neural networks in PyTorchβ222Updated last year
- Delphi was the home of a temple to Phoebus Apollo, which famously had the inscription, 'Know Thyself.' This library lets language models β¦β231Updated last week
- β81Updated last week
- β25Updated last year