MadryLab / modelcomponentsLinks
Decomposing and Editing Predictions by Modeling Model Computation
☆138Updated last year
Alternatives and similar repositories for modelcomponents
Users that are interested in modelcomponents are comparing it to the libraries listed below
Sorting:
- Official implementation of MAIA, A Multimodal Automated Interpretability Agent☆92Updated 3 months ago
- Official implementation of Phi-Mamba. A MOHAWK-distilled model (Transformers to SSMs: Distilling Quadratic Knowledge to Subquadratic Mode…☆116Updated last year
- Official PyTorch Implementation for Vision-Language Models Create Cross-Modal Task Representations, ICML 2025☆31Updated 5 months ago
- [ICCV 2025] Auto Interpretation Pipeline and many other functionalities for Multimodal SAE Analysis.☆155Updated last week
- Official PyTorch Implementation of "The Hidden Attention of Mamba Models"☆227Updated last year
- Optimal Transport in the Big Data Era☆110Updated 11 months ago
- Holistic evaluation of multimodal foundation models☆48Updated last year
- Implementation of 🥥 Coconut, Chain of Continuous Thought, in Pytorch☆180Updated 3 months ago
- ☆33Updated 9 months ago
- Towards Understanding the Mixture-of-Experts Layer in Deep Learning☆31Updated last year
- Official code for the ICML 2024 paper "The Entropy Enigma: Success and Failure of Entropy Minimization"☆53Updated last year
- Sparse and discrete interpretability tool for neural networks☆63Updated last year
- [NeurIPS 2024] Official Repository of The Mamba in the Llama: Distilling and Accelerating Hybrid Models☆230Updated 5 months ago
- [ICLR 2025] When Attention Sink Emerges in Language Models: An Empirical View (Spotlight)☆125Updated 3 months ago
- Implementation of the paper: "Mixture-of-Depths: Dynamically allocating compute in transformer-based language models"☆106Updated last week
- ☆50Updated 8 months ago
- Code accompanying the paper "Massive Activations in Large Language Models"☆183Updated last year
- ☆142Updated last year
- A curated list of Model Merging methods.☆92Updated last year
- Code for reproducing our paper "Not All Language Model Features Are Linear"☆81Updated 10 months ago
- ☆85Updated last year
- The official repository for HyperZ⋅Z⋅W Operator Connects Slow-Fast Networks for Full Context Interaction.☆39Updated 6 months ago
- ☆191Updated last year
- Stanford NLP Python library for benchmarking the utility of LLM interpretability methods☆136Updated 3 months ago
- Awesome list of papers that extend Mamba to various applications.☆138Updated 3 months ago
- Official Code for Paper: Beyond Matryoshka: Revisiting Sparse Coding for Adaptive Representation☆125Updated 3 months ago
- Official repository of "LiNeS: Post-training Layer Scaling Prevents Forgetting and Enhances Model Merging"☆30Updated 11 months ago
- PyTorch library for Active Fine-Tuning☆93Updated last week
- [NeurIPS 2024] A Novel Rank-Based Metric for Evaluating Large Language Models☆53Updated 4 months ago
- Official PyTorch implementation and models for paper "Diffusion Beats Autoregressive in Data-Constrained Settings". We find diffusion mod…☆101Updated last month