VectorInstitute / vectorlm
LLM finetuning in resource-constrained environments.
☆45Updated 7 months ago
Alternatives and similar repositories for vectorlm:
Users that are interested in vectorlm are comparing it to the libraries listed below
- Efficient LLM inference on Slurm clusters using vLLM.☆46Updated this week
- Influence Functions with (Eigenvalue-corrected) Kronecker-Factored Approximate Curvature☆132Updated 6 months ago
- AI Logging for Interpretability and Explainability🔬☆102Updated 8 months ago
- DataInf: Efficiently Estimating Data Influence in LoRA-tuned LLMs and Diffusion Models (ICLR 2024)☆62Updated 4 months ago
- nanoGPT-like codebase for LLM training☆89Updated this week
- ☆63Updated 2 years ago
- NeuroSurgeon is a package that enables researchers to uncover and manipulate subnetworks within models in Huggingface Transformers☆39Updated this week
- `dattri` is a PyTorch library for developing, benchmarking, and deploying efficient data attribution algorithms.☆60Updated this week
- ☆86Updated this week
- ☆154Updated 2 months ago
- A fast, effective data attribution method for neural networks in PyTorch☆192Updated 2 months ago
- Experiments and code to generate the GINC small-scale in-context learning dataset from "An Explanation for In-context Learning as Implici…☆103Updated last year
- ☆189Updated 11 months ago
- ☆36Updated last year
- ☆116Updated last year
- ☆28Updated 7 months ago
- ☆203Updated 4 months ago
- A library for efficient patching and automatic circuit discovery.☆53Updated 2 months ago
- Code for NeurIPS 2024 Spotlight: "Scaling Laws and Compute-Optimal Training Beyond Fixed Training Durations"☆70Updated 3 months ago
- ☆109Updated 6 months ago
- ☆13Updated 11 months ago
- ☆34Updated 10 months ago
- Röttger et al. (NAACL 2024): "XSTest: A Test Suite for Identifying Exaggerated Safety Behaviours in Large Language Models"☆82Updated this week
- A Mechanistic Understanding of Alignment Algorithms: A Case Study on DPO and Toxicity.☆62Updated 3 months ago
- Function Vectors in Large Language Models (ICLR 2024)☆137Updated 4 months ago
- datasets from the paper "Towards Understanding Sycophancy in Language Models"☆71Updated last year
- ☆89Updated last year
- ☆17Updated last month
- [NeurIPS'23] Aging with GRACE: Lifelong Model Editing with Discrete Key-Value Adaptors☆71Updated last month
- ☆40Updated last month