FOR-sight-ai / interpretoLinks
πͺ Interpreto is an interpretability toolbox for LLMs
β34Updated this week
Alternatives and similar repositories for interpreto
Users that are interested in interpreto are comparing it to the libraries listed below
Sorting:
- π Influenciae is a Tensorflow Toolbox for Influence Functionsβ64Updated last year
- π Overcomplete is a Vision-based SAE Toolboxβ77Updated last month
- β14Updated 3 months ago
- Build and train Lipschitz constrained networks: TensorFlow implementation of k-Lipschitz layersβ98Updated 5 months ago
- Simple, compact, and hackable post-hoc deep OOD detection for already trained tensorflow or pytorch image classifiers.β59Updated last month
- Attribution-based Parameter Decompositionβ29Updated 2 months ago
- β37Updated last week
- New implementations of old orthogonal layers unlock large scale training.β20Updated 2 months ago
- A reinforcement learning environment for the IGLU 2022 at NeurIPSβ34Updated 2 years ago
- Engine for collecting, uploading, and downloading model activationsβ22Updated 4 months ago
- Latent Program Network (from the "Searching Latent Program Spaces" paper)β93Updated 5 months ago
- Build and train Lipschitz-constrained networks: PyTorch implementation of 1-Lipschitz layers. For TensorFlow/Keras implementation, see htβ¦β34Updated 3 weeks ago
- β81Updated 6 months ago
- Repository for PURE: Turning Polysemantic Neurons Into Pure Features by Identifying Relevant Circuits, accepted at CVPR 2024 XAI4CV Worksβ¦β19Updated last year
- β52Updated last year
- β15Updated 4 months ago
- β15Updated last month
- A collection of meta-learning algorithms in Jaxβ23Updated 2 years ago
- β20Updated 7 months ago
- Sparse Autoencoder Training Libraryβ54Updated 4 months ago
- Interpreting how transformers simulate agents performing RL tasksβ87Updated last year
- β19Updated 2 years ago
- π CODS - Conformal Object Detection and Segmentationβ16Updated last month
- Flexible library for merging large language models (LLMs) via evolutionary optimization (ACL 2025 Demo).β82Updated 3 weeks ago
- A toolkit for quantitative evaluation of data attribution methods.β53Updated last month
- Unified access to Large Language Model modules using NNsightβ44Updated last week
- Modified to support crosscoder training.β22Updated last month
- β‘ Flashbax: Accelerated Replay Buffers in JAXβ246Updated 3 weeks ago
- Comparison between GFlowNets & Maximum Entropy RLβ19Updated last year
- DiffuLab is designed to provide a simple and flexible way to train diffusion models while allowing full customization of its core componeβ¦β30Updated this week