multimodal-interpretability / maiaLinks

Official implementation of MAIA, A Multimodal Automated Interpretability Agent

☆83

Alternatives and similar repositories for maia

Users that are interested in maia are comparing it to the libraries listed below

Sorting:

stanfordnlp / axbench
Stanford NLP Python library for benchmarking the utility of LLM interpretability methods
☆112Updated last month
EvolvingLMMs-Lab / multimodal-sae
[ICCV 2025] Auto Interpretation Pipeline and many other functionalities for Multimodal SAE Analysis.
☆146Updated 3 weeks ago
g-luo / vlm_cross_modal_reps
Official PyTorch Implementation for Vision-Language Models Create Cross-Modal Task Representations, ICML 2025
☆29Updated 3 months ago
jonhue / activeft
PyTorch library for Active Fine-Tuning
☆87Updated 5 months ago
JoshEngels / MultiDimensionalFeatures
Code for reproducing our paper "Not All Language Model Features Are Linear"
☆77Updated 8 months ago
pliang279 / HEMM
Holistic evaluation of multimodal foundation models
☆48Updated 11 months ago
katiekang1998 / reasoning_generalization
☆34Updated 7 months ago
peterljq / Parsimonious-Concept-Engineering
PaCE: Parsimonious Concept Engineering for Large Language Models (NeurIPS 2024)
☆39Updated 9 months ago
dmis-lab / Monet
[ICLR 2025] Monet: Mixture of Monosemantic Experts for Transformers
☆70Updated last month
apple / ml-rpm-bench
☆41Updated last year
ahans30 / goldfish-loss
[NeurIPS 2024] Goldfish Loss: Mitigating Memorization in Generative LLMs
☆91Updated 8 months ago
lucidrains / coconut-pytorch
Implementation of 🥥 Coconut, Chain of Continuous Thought, in Pytorch
☆178Updated last month
ericwtodd / function_vectors
Function Vectors in Large Language Models (ICLR 2024)
☆175Updated 3 months ago
MadryLab / modelcomponents
Decomposing and Editing Predictions by Modeling Model Computation
☆138Updated last year
stanfordmlgroup / ManyICL
☆142Updated last year
fangyuan-ksgk / CoT-Reasoning-without-Prompting
Unofficial Implementation of Chain-of-Thought Reasoning Without Prompting
☆32Updated last year
AtsuMiyai / UPD
[ACL2025] Unsolvable Problem Detection: Robust Understanding Evaluation for Large Multimodal Models
☆77Updated 2 months ago
RobertCsordas / moeut
☆83Updated 11 months ago
naver-ai / model-stock
Model Stock: All we need is just a few fine-tuned models
☆119Updated 10 months ago
oripress / EntropyEnigma
Official code for the ICML 2024 paper "The Entropy Enigma: Success and Failure of Entropy Minimization"
☆53Updated last year
goombalab / phi-mamba
Official implementation of Phi-Mamba. A MOHAWK-distilled model (Transformers to SSMs: Distilling Quadratic Knowledge to Subquadratic Mode…
☆113Updated 10 months ago
declare-lab / trust-align
Codes and datasets for the paper Measuring and Enhancing Trustworthiness of LLMs in RAG through Grounded Attributions and Learning to Ref…
☆63Updated 5 months ago
lucidrains / CALM-pytorch
Implementation of CALM from the paper "LLM Augmented LLMs: Expanding Capabilities through Composition", out of Google Deepmind
☆177Updated 10 months ago
bethgelab / frequency_determines_performance
Code for the paper: "No Zero-Shot Without Exponential Data: Pretraining Concept Frequency Determines Multimodal Model Performance" [NeurI…
☆90Updated last year
ContextualAI / CLAIR_and_APO
Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment
☆60Updated 11 months ago
prateeky2806 / ties-merging
☆185Updated last year
clinicalml / co-llm
Co-LLM: Learning to Decode Collaboratively with Multiple Language Models
☆116Updated last year
architsharma97 / dpo-rlaif
☆99Updated last year
microsoft / x-reasoner
X-Reasoner: Towards Generalizable Reasoning Across Modalities and Domains
☆47Updated 3 months ago
BorealisAI / flora-opt
This is the official repository for the paper "Flora: Low-Rank Adapters Are Secretly Gradient Compressors" in ICML 2024.
☆104Updated last year