google-research / interpretability-theoryLinks
☆27Updated 2 years ago
Alternatives and similar repositories for interpretability-theory
Users that are interested in interpretability-theory are comparing it to the libraries listed below
Sorting:
- ☆39Updated last year
- Recycling diverse models☆46Updated 2 years ago
- BenchBench is a Python package to evaluate multi-task benchmarks.☆18Updated 2 months ago
- Official code for the paper: "Metadata Archaeology"☆19Updated 2 years ago
- PyTorch implementation for "Long Horizon Temperature Scaling", ICML 2023☆20Updated 2 years ago
- Implementations of growing and pruning in neural networks☆22Updated 2 years ago
- A weak supervision framework for (partial) labeling functions☆16Updated last year
- Interpretable and efficient predictors using pre-trained language models. Scikit-learn compatible.☆44Updated last month
- Updated code base for GlanceNets: Interpretable, Leak-proof Concept-based models☆25Updated 2 years ago
- ModelDiff: A Framework for Comparing Learning Algorithms☆58Updated 2 years ago
- Personal implementation of ASIF by Antonio Norelli☆26Updated last year
- Code Release for "Broken Neural Scaling Laws" (BNSL) paper☆59Updated 2 years ago
- Learning to Split for Automatic Bias Detection☆47Updated 2 years ago
- Code accompanying paper: Meta-Learning to Improve Pre-Training☆37Updated 4 years ago
- Code repository for the AISTATS 2021 paper "Towards Understanding the Optimal Behaviors of Deep Active Learning Algorithms"☆15Updated 4 years ago
- Quantification of Uncertainty with Adversarial Models☆29Updated 2 years ago
- Measuring if attention is explanation with ROAR☆22Updated 2 years ago
- Google Research☆46Updated 3 years ago
- Code for the paper "Data Feedback Loops: Model-driven Amplification of Dataset Biases"☆18Updated 3 years ago
- Library implementing state-of-the-art Concept-based and Disentanglement Learning methods for Explainable AI☆55Updated 3 years ago
- Training and evaluating NBM and SPAM for interpretable machine learning.☆78Updated 2 years ago
- This repository contains a Jax implementation of conformal training corresponding to the ICLR'22 paper "learning optimal conformal classi…☆130Updated 3 years ago
- Minimum Description Length probing for neural network representations☆20Updated 11 months ago
- Understanding how features learned by neural networks evolve throughout training☆41Updated last year
- ☆18Updated 5 months ago
- Beta Shapley: a Unified and Noise-reduced Data Valuation Framework for Machine Learning (AISTATS 2022 Oral)☆43Updated 3 years ago
- Model Patching: Closing the Subgroup Performance Gap with Data Augmentation☆42Updated 5 years ago
- This repository includes code to reproduce the tables in "Loss Landscapes are All You Need: Neural Network Generalization Can Be Explaine…☆40Updated 2 years ago
- ☆111Updated 3 years ago
- This repository contains the code of the distribution shift framework presented in A Fine-Grained Analysis on Distribution Shift (Wiles e…☆84Updated last month