VectorInstitute / vectorlmLinks
LLM finetuning in resource-constrained environments.
☆50Updated last year
Alternatives and similar repositories for vectorlm
Users that are interested in vectorlm are comparing it to the libraries listed below
Sorting:
- Influence Functions with (Eigenvalue-corrected) Kronecker-Factored Approximate Curvature☆158Updated last month
- `dattri` is a PyTorch library for developing, benchmarking, and deploying efficient data attribution algorithms.☆83Updated 2 months ago
- A fast, effective data attribution method for neural networks in PyTorch☆215Updated 8 months ago
- AI Logging for Interpretability and Explainability🔬☆125Updated last year
- nanoGPT-like codebase for LLM training☆102Updated 2 months ago
- Efficient LLM inference on Slurm clusters using vLLM.☆69Updated last week
- ☆235Updated last year
- ☆31Updated last year
- ☆103Updated 6 months ago
- ☆71Updated 3 years ago
- NeuroSurgeon is a package that enables researchers to uncover and manipulate subnetworks within models in Huggingface Transformers☆41Updated 6 months ago
- Experiments and code to generate the GINC small-scale in-context learning dataset from "An Explanation for In-context Learning as Implici…☆108Updated last year
- ☆60Updated 3 years ago
- A Mechanistic Understanding of Alignment Algorithms: A Case Study on DPO and Toxicity.☆74Updated 5 months ago
- LLM-Merging: Building LLMs Efficiently through Merging☆202Updated 10 months ago
- ☆184Updated last year
- ☆96Updated last year
- Source code of "Task arithmetic in the tangent space: Improved editing of pre-trained models".☆103Updated 2 years ago
- PyTorch library for Active Fine-Tuning☆88Updated 5 months ago
- ☆83Updated last year
- ☆50Updated last year
- ☆125Updated last year
- ☆43Updated last year
- Building modular LMs with parameter-efficient fine-tuning.☆112Updated this week
- DataInf: Efficiently Estimating Data Influence in LoRA-tuned LLMs and Diffusion Models (ICLR 2024)☆73Updated 10 months ago
- ☆34Updated last year
- Stanford NLP Python library for benchmarking the utility of LLM interpretability methods☆112Updated last month
- The accompanying code for "Transformer Feed-Forward Layers Are Key-Value Memories". Mor Geva, Roei Schuster, Jonathan Berant, and Omer Le…☆94Updated 3 years ago
- [NeurIPS'23] Aging with GRACE: Lifelong Model Editing with Discrete Key-Value Adaptors☆78Updated 7 months ago
- Sparse and discrete interpretability tool for neural networks☆63Updated last year