google-research / interpretability-theoryLinks

☆26

Alternatives and similar repositories for interpretability-theory

Users that are interested in interpretability-theory are comparing it to the libraries listed below

Sorting:

facebookresearch / ModelRatatouille
Recycling diverse models
☆45Updated 2 years ago
AndyShih12 / LongHorizonTemperatureScaling
PyTorch implementation for "Long Horizon Temperature Scaling", ICML 2023
☆20Updated 2 years ago
microsoft / augmented-interpretable-models
Interpretable and efficient predictors using pre-trained language models. Scikit-learn compatible.
☆42Updated 4 months ago
EleutherAI / mdl
Minimum Description Length probing for neural network representations
☆18Updated 5 months ago
shoaibahmed / metadata_archaeology
Official code for the paper: "Metadata Archaeology"
☆19Updated 2 years ago
ekinakyurek / google-research
Google Research
☆46Updated 2 years ago
yannadani / cbed
Official implementation of the paper "Interventions, Where and How? Experimental Design for Causal Models at Scale", NeurIPS 2022.
☆20Updated 2 years ago
BatsResearch / nplm
A weak supervision framework for (partial) labeling functions
☆16Updated last year
YilunZhou / optimal-active-learning
Code repository for the AISTATS 2021 paper "Towards Understanding the Optimal Behaviors of Deep Active Learning Algorithms"
☆15Updated 4 years ago
jxbz / entropix
📰 Computing the information content of trained neural networks
☆21Updated 3 years ago
AndreasMadsen / nlp-roar-interpretability
Measuring if attention is explanation with ROAR
☆22Updated 2 years ago
JonasGeiping / dataaugs
☆18Updated 2 years ago
facebookresearch / grounding-inductive-biases
reproduces experiments from "Grounding inductive biases in natural images: invariance stems from variations in data"
☆17Updated 9 months ago
google-deepmind / ssl_hsic
☆37Updated 11 months ago
fgranese / DOCTOR
Advances in Neural Information Processing Systems (NeurIPS 2021)
☆22Updated 2 years ago
alexrame / diwa
DiWA: Diverse Weight Averaging for Out-of-Distribution Generalization
☆31Updated 2 years ago
mfederici / dsit
Implementation of the models and datasets used in "An Information-theoretic Approach to Distribution Shifts"
☆25Updated 3 years ago
sjunhongshen / DASH
☆23Updated 2 years ago
sayakpaul / parameter-ensemble-differential-evolution
Shows how to do parameter ensembling using differential evolution.
☆10Updated 3 years ago
benbo / interactive-weak-supervision
Interactive Weak Supervision: Learning Useful Heuristics for Data Labeling
☆31Updated 4 years ago
SuReLI / NeurOps
Implementations of growing and pruning in neural networks
☆22Updated last year
HazyResearch / embroid
Embroid: Unsupervised Prediction Smoothing Can Improve Few-Shot Classification
☆11Updated last year
fsschneider / cockpit-experiments
Experiments for the NeurIPS 2021 paper "Cockpit: A Practical Debugging Tool for the Training of Deep Neural Networks"
☆13Updated 3 years ago
anndvision / quince
Code for Quantifying Ignorance in Individual-Level Causal-Effect Estimates under Hidden Confounding
☆22Updated 2 years ago
jmtomczak / git_flow
General Invertible Transformations for Flow-based Generative Models
☆18Updated 4 years ago
rmwu / sea-reproduce
Sample, estimate, aggregate: A recipe for causal discovery foundation models
☆11Updated last year
HazyResearch / model-patching
Model Patching: Closing the Subgroup Performance Gap with Data Augmentation
☆42Updated 4 years ago
JeanKaddour / LAWA
Latest Weight Averaging (NeurIPS HITY 2022)
☆30Updated 2 years ago
google-research / jax-influence
☆60Updated 3 years ago
allenai / EmbeddingRecycling
Embedding Recycling for Language models
☆38Updated 2 years ago