dmbeaglehole / neural_controllersLinks
Code for steering and monitoring with concepts vectors in LLMs. https://arxiv.org/abs/2502.03708
☆19Updated 6 months ago
Alternatives and similar repositories for neural_controllers
Users that are interested in neural_controllers are comparing it to the libraries listed below
Sorting:
- A fast, effective data attribution method for neural networks in PyTorch☆229Updated last year
- ☆80Updated 3 years ago
- Source code of "Task arithmetic in the tangent space: Improved editing of pre-trained models".☆110Updated 2 years ago
- Influence Functions with (Eigenvalue-corrected) Kronecker-Factored Approximate Curvature☆178Updated 7 months ago
- Influence Analysis and Estimation - Survey, Papers, and Taxonomy☆86Updated last year
- Model Zoos published at the NeurIPS 2022 Dataset & Benchmark track: "Model Zoos: A Dataset of Diverse Populations of Neural Network Model…☆58Updated 4 months ago
- `dattri` is a PyTorch library for developing, benchmarking, and deploying efficient data attribution algorithms.☆108Updated this week
- ☆116Updated last year
- A simple PyTorch implementation of influence functions.☆92Updated last year
- Code for the paper "The Journey, Not the Destination: How Data Guides Diffusion Models"☆25Updated 2 years ago
- ☆32Updated last year
- ☆13Updated 2 years ago
- ☆34Updated last year
- Data for "Datamodels: Predicting Predictions with Training Data"☆97Updated 2 years ago
- ☆24Updated last year
- ☆34Updated 2 years ago
- This is an official repository for "LAVA: Data Valuation without Pre-Specified Learning Algorithms" (ICLR2023).☆52Updated last year
- Efficient empirical NTKs in PyTorch☆22Updated 3 years ago
- ☆37Updated last year
- Framework code with wandb, checkpointing, logging, configs, experimental protocols. Useful for fine-tuning models or training from scratc…☆155Updated 3 years ago
- ☆63Updated 4 years ago
- [ICLR 2025] General-purpose activation steering library☆141Updated 4 months ago
- ☆247Updated last year
- Evaluate interpretability methods on localizing and disentangling concepts in LLMs.☆57Updated 3 months ago
- A modern look at the relationship between sharpness and generalization [ICML 2023]☆43Updated 2 years ago
- ☆39Updated 3 years ago
- DataInf: Efficiently Estimating Data Influence in LoRA-tuned LLMs and Diffusion Models (ICLR 2024)☆79Updated last year
- Code for the paper "Pretraining task diversity and the emergence of non-Bayesian in-context learning for regression"☆25Updated 2 years ago
- ☆146Updated last month
- Code for the ICLR 2022 paper. Salient Imagenet: How to discover spurious features in deep learning?☆41Updated 3 years ago