rmovva / HypotheSAEsLinks
Hypothesizing interpretable relationships in text datasets using sparse autoencoders.
☆29Updated this week
Alternatives and similar repositories for HypotheSAEs
Users that are interested in HypotheSAEs are comparing it to the libraries listed below
Sorting:
- This is the repo for constructing a comprehensive and rigorous evaluation framework for LLM calibration.☆13Updated last year
- Implementation of Influence Function approximations for differently sized ML models, using PyTorch☆15Updated last year
- ☆35Updated 2 years ago
- Discovering Data-driven Hypotheses in the Wild☆85Updated 6 months ago
- Implementation of the BatchTopK activation function for training sparse autoencoders (SAEs)☆41Updated 3 weeks ago
- Simple and scalable tools for data-driven pretraining data selection.☆24Updated 3 months ago
- Evaluate uncertainty, calibration, accuracy, and fairness of LLMs on real-world survey data!☆22Updated last month
- ☆48Updated last week
- CascadER: Cross-Modal Cascading for Knowledge Graph Link Prediction (arXiv 22)☆13Updated 2 years ago
- Dataset and evaluation suite enabling LLM instruction-following for scientific literature understanding.