MadryLab / modelcomponentsLinks
Decomposing and Editing Predictions by Modeling Model Computation
☆138Updated 11 months ago
Alternatives and similar repositories for modelcomponents
Users that are interested in modelcomponents are comparing it to the libraries listed below
Sorting:
- Official implementation of MAIA, A Multimodal Automated Interpretability Agent☆81Updated 3 months ago
- Official PyTorch Implementation of "The Hidden Attention of Mamba Models"☆222Updated last year
- Interpretable text embeddings by asking LLMs yes/no questions (NeurIPS 2024)☆36Updated 6 months ago
- Official implementation of Phi-Mamba. A MOHAWK-distilled model (Transformers to SSMs: Distilling Quadratic Knowledge to Subquadratic Mode…☆108Updated 8 months ago
- [NeurIPS 2024] Official Repository of The Mamba in the Llama: Distilling and Accelerating Hybrid Models☆221Updated last month
- A curated list of Model Merging methods.☆92Updated 8 months ago
- Inference Speed Benchmark for Learning to (Learn at Test Time): RNNs with Expressive Hidden States☆67Updated 10 months ago
- [NeurIPS 2024] Code for the paper "Diffusion of Thoughts: Chain-of-Thought Reasoning in Diffusion Language Models"☆160Updated 3 months ago
- ☆179Updated last year
- Auto Interpretation Pipeline and many other functionalities for Multimodal SAE Analysis.☆132Updated 4 months ago
- [COLING'25] Exploring Concept Depth: How Large Language Models Acquire Knowledge at Different Layers?☆78Updated 4 months ago
- Collection of Reverse Engineering in Large Model☆32Updated 4 months ago
- Code for reproducing our paper "Not All Language Model Features Are Linear"☆75Updated 6 months ago
- Code accompanying the paper "Massive Activations in Large Language Models"☆162Updated last year
- Using sparse coding to find distributed representations used by neural networks.☆247Updated last year
- ☆93Updated 3 months ago
- Integrating Mamba/SSMs with Transformer for Enhanced Long Context and High-Quality Sequence Modeling☆193Updated 2 months ago
- Co-LLM: Learning to Decode Collaboratively with Multiple Language Models☆114Updated last year
- State Space Models☆67Updated last year
- ☆124Updated 6 months ago
- tinybig for deep function learning☆60Updated 5 months ago
- [NeurIPS 2024] Official implementation of the paper "MambaLRP: Explaining Selective State Space Sequence Models".☆38Updated 7 months ago
- One-shot Entropy Minimization☆119Updated this week
- Universal Neurons in GPT2 Language Models☆29Updated last year
- [NeurIPS 2024] A Novel Rank-Based Metric for Evaluating Large Language Models☆46Updated last week
- ICLR 2025 - official implementation for "I-Con: A Unifying Framework for Representation Learning"☆91Updated 3 weeks ago
- ☆60Updated 4 months ago
- [ACL2025] Unsolvable Problem Detection: Robust Understanding Evaluation for Large Multimodal Models☆77Updated last week
- A curated reading list of research in Adaptive Computation, Inference-Time Computation & Mixture of Experts (MoE).☆147Updated 5 months ago
- LLM-Merging: Building LLMs Efficiently through Merging☆197Updated 8 months ago