KempnerInstitute / overcompleteLinks
π Overcomplete is a Vision-based SAE Toolbox
β101Updated 2 weeks ago
Alternatives and similar repositories for overcomplete
Users that are interested in overcomplete are comparing it to the libraries listed below
Sorting:
- π Code for : "CRAFT: Concept Recursive Activation FacTorization for Explainability" (CVPR 2023)β70Updated 2 years ago
- LENS Projectβ51Updated last year
- Official code for the ICML 2024 paper "The Entropy Enigma: Success and Failure of Entropy Minimization"β55Updated last year
- β16Updated 6 months ago
- ViT Prisma is a mechanistic interpretability library for Vision and Video Transformers (ViTs).β323Updated 3 months ago
- Repository for PURE: Turning Polysemantic Neurons Into Pure Features by Identifying Relevant Circuits, accepted at CVPR 2024 XAI4CV Worksβ¦β19Updated last year
- β110Updated 9 months ago
- A toolkit for quantitative evaluation of data attribution methods.β53Updated 4 months ago
- Sparse and discrete interpretability tool for neural networksβ64Updated last year
- β27Updated 2 years ago
- This repository includes code to reproduce the tables in "Loss Landscapes are All You Need: Neural Network Generalization Can Be Explaineβ¦β40Updated 2 years ago
- β53Updated 10 months ago
- Natural Language Descriptions of Deep Visual Features, ICLR 2022β64Updated 2 years ago
- Sparse Autoencoder Training Libraryβ55Updated 6 months ago
- Model Zoos published at the NeurIPS 2022 Dataset & Benchmark track: "Model Zoos: A Dataset of Diverse Populations of Neural Network Modelβ¦β56Updated last month
- Decomposing and Editing Predictions by Modeling Model Computationβ138Updated last year
- Personal implementation of ASIF by Antonio Norelliβ26Updated last year
- Universal Neurons in GPT2 Language Modelsβ31Updated last year
- NeuroSurgeon is a package that enables researchers to uncover and manipulate subnetworks within models in Huggingface Transformersβ41Updated 9 months ago
- [NeurIPS 2024] Official implementation of the paper "MambaLRP: Explaining Selective State Space Sequence Models" πβ45Updated last year
- A fast, effective data attribution method for neural networks in PyTorchβ220Updated last year
- Spurious Features Everywhere - Large-Scale Detection of Harmful Spurious Features in ImageNetβ32Updated 2 years ago
- β37Updated last month
- Deep Networks Grok All the Time and Here is Whyβ37Updated last year
- Uncertainty-aware representation learning (URL) benchmarkβ105Updated 8 months ago
- Visualizing representations with diffusion based conditional generative model.β102Updated 2 years ago
- β24Updated 11 months ago
- Codebase for Mechanistic Mode Connectivityβ14Updated 2 years ago
- Omnigrok: Grokking Beyond Algorithmic Dataβ62Updated 2 years ago
- β136Updated this week