MadryLab / modelcomponentsLinks

Decomposing and Editing Predictions by Modeling Model Computation

☆138

Alternatives and similar repositories for modelcomponents

Users that are interested in modelcomponents are comparing it to the libraries listed below

Sorting:

multimodal-interpretability / maia
Official implementation of MAIA, A Multimodal Automated Interpretability Agent
☆83Updated last month
EvolvingLMMs-Lab / multimodal-sae
[ICCV 2025] Auto Interpretation Pipeline and many other functionalities for Multimodal SAE Analysis.
☆146Updated 3 weeks ago
katiekang1998 / reasoning_generalization
☆34Updated 6 months ago
g-luo / vlm_cross_modal_reps
Official PyTorch Implementation for Vision-Language Models Create Cross-Modal Task Representations, ICML 2025
☆29Updated 3 months ago
locuslab / massive-activations
Code accompanying the paper "Massive Activations in Large Language Models"
☆173Updated last year
goombalab / phi-mamba
Official implementation of Phi-Mamba. A MOHAWK-distilled model (Transformers to SSMs: Distilling Quadratic Knowledge to Subquadratic Mode…
☆113Updated 10 months ago
oripress / EntropyEnigma
Official code for the ICML 2024 paper "The Entropy Enigma: Success and Failure of Entropy Minimization"
☆53Updated last year
ycjing / Awesome-Model-Merging
A curated list of Model Merging methods.
☆92Updated 10 months ago
prateeky2806 / ties-merging
☆185Updated last year
taufeeque9 / codebook-features
Sparse and discrete interpretability tool for neural networks
☆63Updated last year
JoshEngels / MultiDimensionalFeatures
Code for reproducing our paper "Not All Language Model Features Are Linear"
☆77Updated 8 months ago
stanfordmlgroup / ManyICL
☆142Updated last year
stanfordnlp / axbench
Stanford NLP Python library for benchmarking the utility of LLM interpretability methods
☆112Updated last month
SakanaAI / Sudoku-Bench
An AI benchmark for creative, human-like problem solving using Sudoku variants
☆84Updated last week
uclaml / MoE
Towards Understanding the Mixture-of-Experts Layer in Deep Learning
☆31Updated last year
pliang279 / HEMM
Holistic evaluation of multimodal foundation models
☆48Updated 11 months ago
lucidrains / coconut-pytorch
Implementation of 🥥 Coconut, Chain of Continuous Thought, in Pytorch
☆178Updated last month
neilwen987 / CSR_Adaptive_Rep
Official Code for Paper: Beyond Matryoshka: Revisiting Sparse Coding for Adaptive Representation
☆116Updated last month
abdelwahed / OT_for_big_data
Optimal Transport in the Big Data Era
☆107Updated 9 months ago
HKUNLP / diffusion-of-thoughts
[NeurIPS 2024] Code for the paper "Diffusion of Thoughts: Chain-of-Thought Reasoning in Diffusion Language Models"
☆172Updated 5 months ago
llm-merging / LLM-Merging
LLM-Merging: Building LLMs Efficiently through Merging
☆202Updated 10 months ago
csinva / interpretable-embeddings
Interpretable text embeddings by asking LLMs yes/no questions (NeurIPS 2024)
☆39Updated 8 months ago
jxiw / MambaInLlama
[NeurIPS 2024] Official Repository of The Mamba in the Llama: Distilling and Accelerating Hybrid Models
☆225Updated 3 months ago
Leiay / looped_transformer
☆30Updated last year
gstoica27 / KnOTS
Model Merging with SVD to Tie the KnOTS [ICLR 2025]
☆62Updated 4 months ago
RobertCsordas / moeut
☆83Updated 11 months ago
koayon / awesome-adaptive-computation
A curated reading list of research in Adaptive Computation, Inference-Time Computation & Mixture of Experts (MoE).
☆149Updated 7 months ago
jonhue / activeft
PyTorch library for Active Fine-Tuning
☆87Updated 5 months ago
fangyuan-ksgk / selective-attention-transformer
Unofficial Implementation of Selective Attention Transformer
☆17Updated 9 months ago
Luckfort / CD
[COLING'25] Exploring Concept Depth: How Large Language Models Acquire Knowledge at Different Layers?
☆79Updated 6 months ago