rmovva / HypotheSAEs
Hypothesizing interpretable relationships in text datasets using sparse autoencoders.
☆24Updated this week
Alternatives and similar repositories for HypotheSAEs:
Users that are interested in HypotheSAEs are comparing it to the libraries listed below
- Parametric and non-parametric conditional independence testing.☆10Updated 4 years ago
- ☆23Updated 3 years ago
- Code for "Counterfactual Token Generation in Large Language Models", Arxiv 2024.☆25Updated 6 months ago
- ☆11Updated 8 months ago
- Achieve error-rate fairness between societal groups for any score-based classifier.☆17Updated last year
- ☆29Updated last year
- AutoML Two-Sample Test☆19Updated 2 years ago
- Code for ICML 2021 paper "Regularizing towards Causal Invariance: Linear Models with Proxies" (ICML 2021)☆11Updated 3 years ago
- [Paper] Repository for the paper "On a Guided Nonnegative matrix factorization," published in IEEE ICASSP 2021.☆10Updated 2 years ago
- Statistical inference for fairness auditing☆13Updated last year
- Replication data and code for "Prestige drives epistemic inequality in the diffusion of scientific ideas"☆14Updated 6 years ago
- ☆34Updated last year
- Implementation of Influence Function approximations for differently sized ML models, using PyTorch☆15Updated last year
- Functional matrix factorization via Bayesian tensor filtering☆13Updated 2 years ago
- ❓y0 (pronounced "why not?") is for causal inference in Python☆51Updated last month
- BenchBench is a Python package to evaluate multi-task benchmarks.☆15Updated 9 months ago
- Minimal, standalone library for solving GLMs in PyTorch☆26Updated 3 years ago
- ☆26Updated 2 years ago
- ☆22Updated last year
- ☆15Updated 3 months ago
- This is the code for the paper Jacobian-based Causal Discovery with Nonlinear ICA, demonstrating how identifiable representations (partic…☆18Updated 8 months ago
- Visualize the proportion of results for a given PubMed search over time and compare searches to one another!☆12Updated 2 years ago
- A visual labeling system implemented in Jupyter widgets.☆11Updated last year
- Testing Language Models for Memorization of Tabular Datasets.☆33Updated 2 months ago
- Contains public materials for students enrolled in MITx: 6.871x, Machine Learning for Healthcare☆20Updated 3 years ago
- ☆26Updated 2 years ago
- Matrix tools for building and inspecting latent spaces☆27Updated 6 years ago
- ☆37Updated 3 years ago
- Understanding how features learned by neural networks evolve throughout training☆34Updated 6 months ago
- A package for Safe Anytime Valid Inference☆26Updated 6 months ago