nrimsky / InfluenceFunctions
Implementation of Influence Function approximations for differently sized ML models, using PyTorch
☆15Updated last year
Alternatives and similar repositories for InfluenceFunctions:
Users that are interested in InfluenceFunctions are comparing it to the libraries listed below
- In-context Example Selection with Influences☆15Updated last year
- ☆36Updated last year
- Code for "Tracing Knowledge in Language Models Back to the Training Data"☆37Updated 2 years ago
- Official code repo for paper "Great Memory, Shallow Reasoning: Limits of kNN-LMs"☆21Updated 5 months ago
- ☆31Updated last year
- Skill-It! A Data-Driven Skills Framework for Understanding and Training Language Models☆43Updated last year
- ☆16Updated 7 months ago
- ☆24Updated 2 months ago
- ☆53Updated last year
- Code and Data Repo for the CoNLL Paper -- Future Lens: Anticipating Subsequent Tokens from a Single Hidden State☆18Updated last year
- Investigating the generalization behavior of LM probes trained to predict truth labels: (1) from one annotator to another, and (2) from e…☆26Updated 8 months ago
- ☆46Updated last year
- Few-shot Learning with Auxiliary Data☆26Updated last year
- ☆60Updated 3 years ago
- `dattri` is a PyTorch library for developing, benchmarking, and deploying efficient data attribution algorithms.☆60Updated this week
- Is In-Context Learning Sufficient for Instruction Following in LLMs? [ICLR 2025]☆29Updated 3 weeks ago
- ☆76Updated 6 months ago
- Redwood Research's transformer interpretability tools☆14Updated 2 years ago
- A library for efficient patching and automatic circuit discovery.☆53Updated 2 months ago
- Sparse Autoencoder Training Library☆41Updated 3 months ago
- ☆38Updated 9 months ago
- ☆52Updated last year
- Open source replication of Anthropic's Crosscoders for Model Diffing☆36Updated 3 months ago
- ☆28Updated last year
- Official Repository for Dataset Inference for LLMs☆31Updated 6 months ago
- AI Logging for Interpretability and Explainability🔬☆102Updated 8 months ago
- Influence Functions with (Eigenvalue-corrected) Kronecker-Factored Approximate Curvature☆132Updated 6 months ago