dmbeaglehole / neural_controllersLinks
Code for steering and monitoring with concepts vectors in LLMs. https://arxiv.org/abs/2502.03708
☆19Updated 6 months ago
Alternatives and similar repositories for neural_controllers
Users that are interested in neural_controllers are comparing it to the libraries listed below
Sorting:
- Influence Functions with (Eigenvalue-corrected) Kronecker-Factored Approximate Curvature☆178Updated 7 months ago
- ☆116Updated last year
- `dattri` is a PyTorch library for developing, benchmarking, and deploying efficient data attribution algorithms.☆108Updated this week
- Model Zoos published at the NeurIPS 2022 Dataset & Benchmark track: "Model Zoos: A Dataset of Diverse Populations of Neural Network Model…☆58Updated 4 months ago
- A fast, effective data attribution method for neural networks in PyTorch☆229Updated last year
- Source code of "Task arithmetic in the tangent space: Improved editing of pre-trained models".☆110Updated 2 years ago
- ☆80Updated 3 years ago
- Efficient empirical NTKs in PyTorch☆22Updated 3 years ago
- ☆32Updated last year
- Code for the paper "The Journey, Not the Destination: How Data Guides Diffusion Models"☆25Updated 2 years ago
- Data for "Datamodels: Predicting Predictions with Training Data"☆97Updated 2 years ago
- ☆58Updated last year
- This is an official repository for "LAVA: Data Valuation without Pre-Specified Learning Algorithms" (ICLR2023).☆52Updated last year
- Code for the ICLR 2022 paper. Salient Imagenet: How to discover spurious features in deep learning?☆41Updated 3 years ago
- ☆104Updated 2 years ago
- AI Logging for Interpretability and Explainability🔬☆140Updated last year
- A simple PyTorch implementation of influence functions.☆92Updated last year
- A library for efficient patching and automatic circuit discovery.☆88Updated last month
- ☆63Updated 4 years ago
- ☆23Updated 5 months ago
- Official Repository for ICML 2023 paper "Can Neural Network Memorization Be Localized?"☆21Updated 2 years ago
- ☆13Updated 2 years ago
- ☆146Updated last month
- ☆51Updated 2 years ago
- ☆206Updated 3 months ago
- NeuroSurgeon is a package that enables researchers to uncover and manipulate subnetworks within models in Huggingface Transformers☆42Updated last year
- ☆132Updated 2 years ago
- DataInf: Efficiently Estimating Data Influence in LoRA-tuned LLMs and Diffusion Models (ICLR 2024)☆79Updated last year
- ☆37Updated last year
- Code for my NeurIPS 2024 ATTRIB paper titled "Attribution Patching Outperforms Automated Circuit Discovery"☆47Updated last year