google-research / interpretability-theory
☆26Updated last year
Alternatives and similar repositories for interpretability-theory:
Users that are interested in interpretability-theory are comparing it to the libraries listed below
- Recycling diverse models☆44Updated 2 years ago
- Official code for the paper: "Metadata Archaeology"☆18Updated last year
- Quantification of Uncertainty with Adversarial Models☆27Updated last year
- ModelDiff: A Framework for Comparing Learning Algorithms☆54Updated last year
- ☆14Updated last year
- Model Patching: Closing the Subgroup Performance Gap with Data Augmentation☆42Updated 4 years ago
- Repository for the PopulAtion Parameter Averaging (PAPA) paper☆26Updated 9 months ago
- ☆17Updated 2 years ago
- reproduces experiments from "Grounding inductive biases in natural images: invariance stems from variations in data"☆17Updated 4 months ago
- (ICML 2021) Mandoline: Model Evaluation under Distribution Shift☆31Updated 3 years ago
- Minimum Description Length probing for neural network representations☆18Updated this week
- ☆36Updated 6 months ago
- DiWA: Diverse Weight Averaging for Out-of-Distribution Generalization☆29Updated last year
- Research on Tabular Foundation Models☆38Updated last month
- PyTorch implementation for "Long Horizon Temperature Scaling", ICML 2023☆20Updated last year
- Interpretable and efficient predictors using pre-trained language models. Scikit-learn compatible.☆39Updated 9 months ago
- ☆18Updated 2 years ago
- Code and results accompanying our paper titled RLSbench: Domain Adaptation under Relaxed Label Shift☆34Updated last year
- ☆12Updated last year
- Data for "Datamodels: Predicting Predictions with Training Data"☆94Updated last year
- Personal implementation of ASIF by Antonio Norelli☆25Updated 8 months ago
- Official implementation of the paper "Interventions, Where and How? Experimental Design for Causal Models at Scale", NeurIPS 2022.☆19Updated 2 years ago
- ☆18Updated 3 years ago
- ☆21Updated last year
- Code for "SAM as an Optimal Relaxation of Bayes", ICLR 2023.☆24Updated last year
- ☆44Updated 2 years ago
- 📰 Computing the information content of trained neural networks☆21Updated 3 years ago
- ☆35Updated last year
- An Empirical Study of Invariant Risk Minimization☆27Updated 4 years ago
- Automatic identification of regions in the latent space of a model that correspond to unique concepts, namely to concepts with a semantic…☆13Updated last year