KempnerInstitute / overcompleteLinks
π Overcomplete is a Vision-based SAE Toolbox
β118Updated 2 months ago
Alternatives and similar repositories for overcomplete
Users that are interested in overcomplete are comparing it to the libraries listed below
Sorting:
- LENS Projectβ52Updated last year
- π Code for : "CRAFT: Concept Recursive Activation FacTorization for Explainability" (CVPR 2023)β71Updated 2 years ago
- ViT Prisma is a mechanistic interpretability library for Vision and Video Transformers (ViTs).β337Updated 6 months ago
- β16Updated 9 months ago
- Repository for PURE: Turning Polysemantic Neurons Into Pure Features by Identifying Relevant Circuits, accepted at CVPR 2024 XAI4CV Worksβ¦β19Updated last year
- β115Updated 11 months ago
- β143Updated last month
- NeuroSurgeon is a package that enables researchers to uncover and manipulate subnetworks within models in Huggingface Transformersβ42Updated 11 months ago
- Sparse and discrete interpretability tool for neural networksβ64Updated last year
- β58Updated last year
- Layer-wise Relevance Propagation for Large Language Models and Vision Transformers [ICML 2024]β219Updated 6 months ago
- A toolkit for quantitative evaluation of data attribution methods.β55Updated 6 months ago
- Code for the paper: Discover-then-Name: Task-Agnostic Concept Bottlenecks via Automated Concept Discovery. ECCV 2024.β56Updated last year
- π Aligning Human & Machine Vision using explainabilityβ54Updated 2 years ago
- Official code for the ICML 2024 paper "The Entropy Enigma: Success and Failure of Entropy Minimization"β55Updated last year
- Attribution-based Parameter Decompositionβ33Updated 7 months ago
- Personal implementation of ASIF by Antonio Norelliβ26Updated last year
- β23Updated last year
- Sparse Autoencoder Training Libraryβ56Updated 9 months ago
- PyTorch library for Active Fine-Tuningβ96Updated 4 months ago
- Mechanistic understanding and validation of large AI models with SemanticLensβ50Updated 2 months ago
- Delphi was the home of a temple to Phoebus Apollo, which famously had the inscription, 'Know Thyself.' This library lets language models β¦β241Updated last week
- Universal Neurons in GPT2 Language Modelsβ30Updated last year
- [NeurIPS 2025 MechInterp Workshop - Spotlight] Official implementation of the paper "RelP: Faithful and Efficient Circuit Discovery in Laβ¦β24Updated 3 months ago
- Tools for optimizing steering vectors in LLMs.β19Updated 9 months ago
- Sparse Autoencoder for Mechanistic Interpretabilityβ290Updated last year
- β132Updated 2 years ago
- β83Updated 11 months ago
- Open source replication of Anthropic's Crosscoders for Model Diffingβ63Updated last year
- πͺ Interpreto is an interpretability toolbox for LLMsβ139Updated last week