nrimsky / InfluenceFunctionsLinks

Implementation of Influence Function approximations for differently sized ML models, using PyTorch

☆15

Alternatives and similar repositories for InfluenceFunctions

Users that are interested in InfluenceFunctions are comparing it to the libraries listed below

Sorting:

ethancaballero / broken_neural_scaling_laws
Code Release for "Broken Neural Scaling Laws" (BNSL) paper
☆59Updated 2 years ago
koayon / atp_star
PyTorch and NNsight implementation of AtP* (Kramar et al 2024, DeepMind)
☆20Updated 9 months ago
ethz-spylab / superhuman-ai-consistency
☆31Updated 2 years ago
ckkissane / sae-transfer
Code to reproduce key results accompanying "SAEs (usually) Transfer Between Base and Chat Models"
☆12Updated last year
google-research / jax-influence
☆62Updated 3 years ago
KoyenaPal / future-lens
Code and Data Repo for the CoNLL Paper -- Future Lens: Anticipating Subsequent Tokens from a Single Hidden State
☆20Updated last week
HazyResearch / skill-it
Skill-It! A Data-Driven Skills Framework for Understanding and Training Language Models
☆47Updated 2 years ago
formll / resolving-scaling-law-discrepancies
☆20Updated last year
guy-dar / embedding-space
☆55Updated 2 years ago
mcleish7 / gemstone-scaling-laws
Gemstones: A Model Suite for Multi-Faceted Scaling Laws (NeurIPS 2025)
☆29Updated last month
taufeeque9 / codebook-features
Sparse and discrete interpretability tool for neural networks
☆64Updated last year
LoryPack / LLM-LieDetector
Code for the ICLR 2024 paper "How to catch an AI liar: Lie detection in black-box LLMs by asking unrelated questions"
☆71Updated last year
pietrolesci / memorisation-profiles
This is the official implementation for our ACL 2024 paper: "Causal Estimation of Memorisation Profiles".
☆23Updated 7 months ago
jmerullo / lm_vector_arithmetic
☆36Updated 2 years ago
shauli-ravfogel / rlace-icml
☆36Updated 3 years ago
TristanThrush / perplexity-correlations
Simple and scalable tools for data-driven pretraining data selection.
☆28Updated 4 months ago
GSYfate / knnlm-limits
Official code repo for paper "Great Memory, Shallow Reasoning: Limits of kNN-LMs"
☆24Updated 6 months ago
Nix07 / finetuning
This repository contains the code used for the experiments in the paper "Fine-Tuning Enhances Existing Mechanisms: A Case Study on Entity…
☆28Updated this week
MadryLab / datamodels-data
Data for "Datamodels: Predicting Predictions with Training Data"
☆97Updated 2 years ago
jiahai-feng / binding-iclr
☆16Updated last year
adamkarvonen / SAE_BoardGameEval
☆23Updated 9 months ago
msakarvadia / AttentionLens
Interpretating the latent space representations of attention head outputs for LLMs
☆34Updated last year
ApolloResearch / e2e_sae
Sparse Autoencoder Training Library
☆55Updated 6 months ago
sylinrl / CalibratedMath
Teaching Models to Express Their Uncertainty in Words
☆39Updated 3 years ago
EleutherAI / elk-generalization
Investigating the generalization behavior of LM probes trained to predict truth labels: (1) from one annotator to another, and (2) from e…
☆28Updated last year
srush / LLM-Talk
☆52Updated last year
bilal-chughtai / rep-theory-mech-interp
☆27Updated 2 years ago
yidingjiang / ado
The repository contains code for Adaptive Data Optimization
☆27Updated 10 months ago
tml-epfl / icl-alignment
Is In-Context Learning Sufficient for Instruction Following in LLMs? [ICLR 2025]
☆31Updated 9 months ago
JeanKaddour / NoTrainNoGain
Revisiting Efficient Training Algorithms For Transformer-based Language Models (NeurIPS 2023)
☆80Updated 2 years ago