mlfoundations / task_vectorsLinks

Editing Models with Task Arithmetic

☆490

Alternatives and similar repositories for task_vectors

Users that are interested in task_vectors are comparing it to the libraries listed below

Sorting:

gortizji / tangent_task_arithmetic
Source code of "Task arithmetic in the tangent space: Improved editing of pre-trained models".
☆103Updated 2 years ago
AlignmentResearch / tuned-lens
Tools for understanding how transformer predictions are built layer-by-layer
☆512Updated last year
prateeky2806 / ties-merging
☆185Updated last year
openai / sparse_autoencoder
☆505Updated last year
davidbau / baukit
☆222Updated last year
microsoft / rho
Repo for Rho-1: Token-level Data Selection & Selective Pretraining of LLMs.
☆428Updated last year
mmatena / model_merging
☆71Updated 3 years ago
ericwtodd / function_vectors
Function Vectors in Large Language Models (ICLR 2024)
☆175Updated 3 months ago
locuslab / massive-activations
Code accompanying the paper "Massive Activations in Large Language Models"
☆173Updated last year
EnnengYang / Awesome-Model-Merging-Methods-Theories-Applications
Model Merging in LLMs, MLLMs, and Beyond: Methods, Theories, Applications and Opportunities. arXiv:2408.07666.
☆490Updated this week
llm-merging / LLM-Merging
LLM-Merging: Building LLMs Efficiently through Merging
☆202Updated 10 months ago
kmeng01 / rome
Locating and editing factual associations in GPT (NeurIPS 2022)
☆653Updated last year
MadryLab / trak
A fast, effective data attribution method for neural networks in PyTorch
☆214Updated 8 months ago
logix-project / logix
AI Logging for Interpretability and Explainability🔬
☆125Updated last year
likenneth / honest_llama
Inference-Time Intervention: Eliciting Truthful Answers from a Language Model
☆540Updated 6 months ago
jacobdunefsky / transcoder_circuits
☆154Updated 8 months ago
HoagyC / sparse_coding
Using sparse coding to find distributed representations used by neural networks.
☆261Updated last year
collin-burns / discovering_latent_knowledge
☆274Updated last year
mlfoundations / model-soups
Model soups: averaging weights of multiple fine-tuned models improves accuracy without increasing inference time
☆477Updated last year
calpt / awesome-adapter-resources
Collection of Tools and Papers related to Adapters / Parameter-Efficient Transfer Learning/ Fine-Tuning
☆197Updated last year
Cohere-Labs-Community / parameter-efficient-moe
☆269Updated last year
lucidrains / st-moe-pytorch
Implementation of ST-Moe, the latest incarnation of MoE after years of research at Brain, in Pytorch
☆352Updated last year
r-three / t-few
Code for T-Few from "Few-Shot Parameter-Efficient Fine-Tuning is Better and Cheaper than In-Context Learning"
☆456Updated last year
roeehendel / icl_task_vectors
☆96Updated last year
nrimsky / CAA
Steering Llama 2 with Contrastive Activation Addition
☆167Updated last year
ruizheliUOA / Awesome-Interpretability-in-Large-Language-Models
This repository collects all relevant resources about interpretability in LLMs
☆366Updated 9 months ago
stanfordnlp / pyvene
Stanford NLP Python library for understanding and improving PyTorch models via interventions
☆783Updated last week
dtsip / in-context-learning
☆234Updated last year
KihoPark / linear_rep_geometry
☆103Updated 5 months ago
ArthurConmy / Automatic-Circuit-Discovery
☆234Updated 10 months ago