google-research / interpretability-theoryLinks
☆26Updated 2 years ago
Alternatives and similar repositories for interpretability-theory
Users that are interested in interpretability-theory are comparing it to the libraries listed below
Sorting:
- Recycling diverse models☆44Updated 2 years ago
- ☆18Updated 2 years ago
- Official code for the paper: "Metadata Archaeology"☆19Updated 2 years ago
- ModelDiff: A Framework for Comparing Learning Algorithms☆57Updated last year
- Library implementing state-of-the-art Concept-based and Disentanglement Learning methods for Explainable AI☆55Updated 2 years ago
- Official implementation of the paper "Interventions, Where and How? Experimental Design for Causal Models at Scale", NeurIPS 2022.☆20Updated 2 years ago
- BenchBench is a Python package to evaluate multi-task benchmarks.☆15Updated 11 months ago
- 📰 Computing the information content of trained neural networks☆21Updated 3 years ago
- Model Patching: Closing the Subgroup Performance Gap with Data Augmentation☆42Updated 4 years ago
- PyTorch implementation for "Long Horizon Temperature Scaling", ICML 2023☆20Updated 2 years ago
- Latest Weight Averaging (NeurIPS HITY 2022)☆30Updated 2 years ago
- DiWA: Diverse Weight Averaging for Out-of-Distribution Generalization☆31Updated 2 years ago
- ☆12Updated 2 years ago
- ☆18Updated 3 years ago
- Interactive Weak Supervision: Learning Useful Heuristics for Data Labeling☆31Updated 4 years ago
- Minimum Description Length probing for neural network representations☆18Updated 4 months ago
- Automatic identification of regions in the latent space of a model that correspond to unique concepts, namely to concepts with a semantic…☆14Updated last year
- (ICML 2021) Mandoline: Model Evaluation under Distribution Shift☆30Updated 4 years ago
- Code repository for the AISTATS 2021 paper "Towards Understanding the Optimal Behaviors of Deep Active Learning Algorithms"☆15Updated 4 years ago
- Code accompanying paper: Meta-Learning to Improve Pre-Training☆37Updated 3 years ago
- Advances in Neural Information Processing Systems (NeurIPS 2021)☆22Updated 2 years ago
- Repository for the PopulAtion Parameter Averaging (PAPA) paper☆26Updated last year
- Repository for the paper Do SSL Models Have Déjà Vu? A Case of Unintended Memorization in Self-supervised Learning☆36Updated 2 years ago
- Personal implementation of ASIF by Antonio Norelli☆25Updated last year
- Beta Shapley: a Unified and Noise-reduced Data Valuation Framework for Machine Learning (AISTATS 2022 Oral)☆41Updated 2 years ago
- Google Research☆46Updated 2 years ago
- Active and Sample-Efficient Model Evaluation☆24Updated last month
- Minimal, standalone library for solving GLMs in PyTorch☆26Updated 3 years ago
- Quantification of Uncertainty with Adversarial Models☆29Updated last year
- Fast Axiomatic Attribution for Neural Networks (NeurIPS*2021)☆16Updated 2 years ago