πͺ Interpreto is an interpretability toolbox for LLMs
β160Mar 18, 2026Updated this week
Alternatives and similar repositories for interpreto
Users that are interested in interpreto are comparing it to the libraries listed below
Sorting:
- Simple, compact, and hackable post-hoc deep OOD detection for already trained tensorflow or pytorch image classifiers.β60Feb 17, 2026Updated last month
- π Influenciae is a Tensorflow Toolbox for Influence Functionsβ66Updated this week
- β14May 6, 2025Updated 10 months ago
- ICLR 2024: Energy-Based Concept Bottleneck Models: Unifying Prediction, Concept Intervention, and Probabilistic Interpretationsβ22May 1, 2025Updated 10 months ago
- Build and train Lipschitz constrained networks: TensorFlow implementation of k-Lipschitz layersβ102Mar 14, 2025Updated last year
- MishformerLens intends to be a drop-in replacement for TransformerLens that AST patches HuggingFace Transformers rather than implementingβ¦β10Oct 7, 2024Updated last year
- β15Jan 2, 2023Updated 3 years ago
- Arrakis is a library to conduct, track and visualize mechanistic interpretability experiments.β31Apr 22, 2025Updated 11 months ago
- A library for training crosscodersβ16May 28, 2025Updated 9 months ago
- Code for my NeurIPS 2024 ATTRIB paper titled "Attribution Patching Outperforms Automated Circuit Discovery"β47May 31, 2024Updated last year
- DiffuLab is designed to provide a simple and flexible way to train diffusion models while allowing full customization of its core componeβ¦β43Jan 11, 2026Updated 2 months ago
- π¬ Interpretability for Leela Chess Zero networks.β19Updated this week
- π Puncc is a python library for predictive uncertainty quantification using conformal prediction.β377Updated this week
- Easy-to-use MIRAGE code for faithful answer attribution in RAG applications. Paper: https://aclanthology.org/2024.emnlp-main.347/β26Mar 10, 2025Updated last year
- [CVPRW 2024] Conformal prediction for uncertainty quantification in image segmentationβ26Dec 9, 2024Updated last year
- LENS Projectβ52Feb 22, 2024Updated 2 years ago
- Hash-routed Networksβ20Nov 20, 2020Updated 5 years ago
- Linear Relational Embeddings (LREs) and Linear Relational Concepts (LRCs) for LLMs in PyTorchβ10Aug 7, 2024Updated last year
- β17Jul 9, 2025Updated 8 months ago
- [MICCAI 2024] DRIM: Learning Disentangled Representations from Incomplete Multimodal Healthcare Dataβ17Apr 3, 2025Updated 11 months ago
- Unified access to Large Language Model modules using NNsightβ103Feb 28, 2026Updated 3 weeks ago
- β18Apr 7, 2025Updated 11 months ago
- https://footprints.baulab.infoβ18Oct 4, 2024Updated last year
- Tools for optimizing steering vectors in LLMs.β20Apr 10, 2025Updated 11 months ago
- β14Jun 11, 2025Updated 9 months ago
- A lightweight didactic library of kernel methods using the back-end JAX.β12Mar 8, 2023Updated 3 years ago
- β87Updated this week
- Layer-wise Relevance Propagation for Large Language Models and Vision Transformers [ICML 2024]β227Jul 11, 2025Updated 8 months ago
- FastClassification is a tensorflow toolbox for class classification. It provides a training module with various backbones and training trβ¦β15Jun 26, 2021Updated 4 years ago
- Tidy up your machine learning experimentsβ17Sep 5, 2019Updated 6 years ago
- Agent-OM: Leveraging LLM Agents for Ontology Matchingβ19Jan 24, 2026Updated last month
- Residual Quantization Autoencoder, used for interpreting LLMsβ14Jan 1, 2025Updated last year
- Comprehensive Python Plotly tutorial & cheat sheet. Covers plotly.express, graph_objects & figure_factory for Data Science, 3D plotting, β¦β22Dec 3, 2025Updated 3 months ago
- BM-MAE: Multimodal Masked Autoencoder Pre-training for 3D MRI-based Brain Tumor Analysis with Missing Modalitiesβ29Aug 24, 2025Updated 6 months ago
- A tiny easily hackable implementation of a feature dashboard.β16Oct 21, 2025Updated 5 months ago
- Code to enable layer-level steering in LLMs using sparse auto encodersβ31Sep 18, 2025Updated 6 months ago
- β18Mar 13, 2026Updated last week
- Delphi was the home of a temple to Phoebus Apollo, which famously had the inscription, 'Know Thyself.' This library lets language models β¦β245Updated this week
- Build and train Lipschitz-constrained networks: PyTorch implementation of 1-Lipschitz layers. For TensorFlow/Keras implementation, see htβ¦β41Updated this week