google-research / interpretability-theory
☆26Updated last year
Related projects ⓘ
Alternatives and complementary repositories for interpretability-theory
- ☆17Updated 2 years ago
- Recycling diverse models☆44Updated last year
- ModelDiff: A Framework for Comparing Learning Algorithms☆53Updated last year
- Official code for the paper: "Metadata Archaeology"☆18Updated last year
- ☆21Updated last year
- ☆41Updated last year
- Quantification of Uncertainty with Adversarial Models☆27Updated last year
- PyTorch implementation for "Long Horizon Temperature Scaling", ICML 2023☆19Updated last year
- ☆14Updated 11 months ago
- DiWA: Diverse Weight Averaging for Out-of-Distribution Generalization☆28Updated last year
- Deep Learning & Information Bottleneck☆50Updated last year
- Improving Transformation Invariance in Contrastive Representation Learning☆13Updated 3 years ago
- ☆18Updated 3 years ago
- Code for paper "Can contrastive learning avoid shortcut solutions?" NeurIPS 2021.☆47Updated 2 years ago
- ☆35Updated 3 months ago
- Repository for the PopulAtion Parameter Averaging (PAPA) paper☆26Updated 7 months ago
- Code for "SAM as an Optimal Relaxation of Bayes", ICLR 2023.☆23Updated last year
- Updated code base for GlanceNets: Interpretable, Leak-proof Concept-based models☆25Updated last year
- Code accompanying paper: Meta-Learning to Improve Pre-Training☆37Updated 3 years ago
- Beta Shapley: a Unified and Noise-reduced Data Valuation Framework for Machine Learning (AISTATS 2022 Oral)☆40Updated 2 years ago
- Code for the paper "Data Feedback Loops: Model-driven Amplification of Dataset Biases"☆15Updated 2 years ago
- ☆22Updated last year
- Implementation of the models and datasets used in "An Information-theoretic Approach to Distribution Shifts"☆25Updated 3 years ago
- Model Patching: Closing the Subgroup Performance Gap with Data Augmentation☆42Updated 4 years ago
- ☆36Updated 2 years ago
- Repository for the paper Do SSL Models Have Déjà Vu? A Case of Unintended Memorization in Self-supervised Learning☆37Updated last year
- Minimum Description Length probing for neural network representations☆16Updated last week
- A weak supervision framework for (partial) labeling functions☆14Updated 4 months ago
- Advances in Neural Information Processing Systems (NeurIPS 2021)☆22Updated 2 years ago
- Simple data balancing baselines for worst-group-accuracy benchmarks.☆40Updated last year