VectorInstitute / vectorlm
LLM finetuning in resource-constrained environments.
☆43Updated 7 months ago
Alternatives and similar repositories for vectorlm:
Users that are interested in vectorlm are comparing it to the libraries listed below
- Efficient LLM inference on Slurm clusters using vLLM.☆45Updated this week
- Influence Functions with (Eigenvalue-corrected) Kronecker-Factored Approximate Curvature☆127Updated 5 months ago
- nanoGPT-like codebase for LLM training☆85Updated this week
- ☆116Updated last year
- ☆202Updated 3 months ago
- ☆83Updated last year
- ☆62Updated 2 years ago
- ☆45Updated this week
- ☆139Updated this week
- AI Logging for Interpretability and Explainability🔬☆100Updated 7 months ago
- Create feature-centric and prompt-centric visualizations for sparse autoencoders (like those from Anthropic's published research).☆177Updated last month
- Experiments and code to generate the GINC small-scale in-context learning dataset from "An Explanation for In-context Learning as Implici…☆101Updated last year
- ☆54Updated 2 months ago
- ☆12Updated 10 months ago
- Source code of "Task arithmetic in the tangent space: Improved editing of pre-trained models".☆92Updated last year
- Using sparse coding to find distributed representations used by neural networks.☆210Updated last year
- A library for efficient patching and automatic circuit discovery.☆48Updated 2 months ago
- DataInf: Efficiently Estimating Data Influence in LoRA-tuned LLMs and Diffusion Models (ICLR 2024)☆61Updated 3 months ago
- ☆17Updated last month
- ☆30Updated 2 months ago
- ☆28Updated 6 months ago
- ☆78Updated 10 months ago
- Röttger et al. (2023): "XSTest: A Test Suite for Identifying Exaggerated Safety Behaviours in Large Language Models"☆79Updated last year
- ☆169Updated last year
- This repository collects all relevant resources about interpretability in LLMs☆309Updated 2 months ago
- Code for NeurIPS 2024 Spotlight: "Scaling Laws and Compute-Optimal Training Beyond Fixed Training Durations"☆67Updated 3 months ago
- Steering vectors for transformer language models in Pytorch / Huggingface☆81Updated 2 months ago
- NeuroSurgeon is a package that enables researchers to uncover and manipulate subnetworks within models in Huggingface Transformers☆39Updated 5 months ago
- ☆34Updated 11 months ago
- Mechanistic Interpretability Visualizations using React☆223Updated last month