mlfoundations / task_vectorsLinks
Editing Models with Task Arithmetic
☆508Updated last year
Alternatives and similar repositories for task_vectors
Users that are interested in task_vectors are comparing it to the libraries listed below
Sorting:
- ☆194Updated last year
- Source code of "Task arithmetic in the tangent space: Improved editing of pre-trained models".☆105Updated 2 years ago
- Tools for understanding how transformer predictions are built layer-by-layer☆535Updated 2 months ago
- ☆234Updated last year
- ☆77Updated 3 years ago
- Function Vectors in Large Language Models (ICLR 2024)☆181Updated 6 months ago
- LLM-Merging: Building LLMs Efficiently through Merging☆204Updated last year
- Using sparse coding to find distributed representations used by neural networks.☆280Updated last year
- ☆532Updated last year
- A fast, effective data attribution method for neural networks in PyTorch☆220Updated 11 months ago
- Code accompanying the paper "Massive Activations in Large Language Models"☆184Updated last year
- ☆240Updated last year
- Repo for Rho-1: Token-level Data Selection & Selective Pretraining of LLMs.☆439Updated last year
- ☆181Updated 11 months ago
- ViT Prisma is a mechanistic interpretability library for Vision and Video Transformers (ViTs).☆315Updated 3 months ago
- Model soups: averaging weights of multiple fine-tuned models improves accuracy without increasing inference time☆488Updated last year
- Model Merging in LLMs, MLLMs, and Beyond: Methods, Theories, Applications and Opportunities. arXiv:2408.07666.☆572Updated last week
- Implementation of ST-Moe, the latest incarnation of MoE after years of research at Brain, in Pytorch☆366Updated last year
- AI Logging for Interpretability and Explainability🔬☆129Updated last year
- ☆107Updated 8 months ago
- ☆98Updated last year
- ☆247Updated last year
- ☆279Updated last year
- Code release for Dataless Knowledge Fusion by Merging Weights of Language Models (https://openreview.net/forum?id=FCnohuR6AnM)☆90Updated 2 years ago
- Steering Llama 2 with Contrastive Activation Addition☆191Updated last year
- This repository collects all relevant resources about interpretability in LLMs☆375Updated 11 months ago
- An Extensible Continual Learning Framework Focused on Language Models (LMs)☆288Updated last year
- Collection of Tools and Papers related to Adapters / Parameter-Efficient Transfer Learning/ Fine-Tuning☆199Updated last year
- ☆131Updated last week
- A framework for merging models solving different tasks with different initializations into one multi-task model without any additional tr…☆309Updated last year