mmatena / model_mergingLinks

☆77

Alternatives and similar repositories for model_merging

Users that are interested in model_merging are comparing it to the libraries listed below

Sorting:

gortizji / tangent_task_arithmetic
Source code of "Task arithmetic in the tangent space: Improved editing of pre-trained models".
☆105Updated 2 years ago
nik-dim / tall_masks
Official repository of "Localizing Task Information for Improved Model Merging and Compression" [ICML 2024]
☆51Updated last year
roeehendel / icl_task_vectors
☆98Updated last year
r-three / mats
☆31Updated last year
adymaharana / d2pruning
☆41Updated 2 years ago
UKPLab / iclr2024-model-merging
This is the repository for "Model Merging by Uncertainty-Based Gradient Matching", ICLR 2024.
☆28Updated last year
ykwon0407 / DataInf
DataInf: Efficiently Estimating Data Influence in LoRA-tuned LLMs and Diffusion Models (ICLR 2024)
☆76Updated last year
prateeky2806 / ties-merging
☆194Updated last year
locuslab / massive-activations
Code accompanying the paper "Massive Activations in Large Language Models"
☆184Updated last year
abhishekpanigrahi1996 / Skill-Localization-by-grafting
☆51Updated last year
Thartvigsen / GRACE
[NeurIPS'23] Aging with GRACE: Lifelong Model Editing with Discrete Key-Value Adaptors
☆81Updated 10 months ago
logix-project / logix
AI Logging for Interpretability and Explainability🔬
☆129Updated last year
EnnengYang / AdaMerging
AdaMerging: Adaptive Model Merging for Multi-Task Learning. ICLR, 2024.
☆92Updated 11 months ago
bloomberg / dataless-model-merging
Code release for Dataless Knowledge Fusion by Merging Weights of Language Models (https://openreview.net/forum?id=FCnohuR6AnM)
☆90Updated 2 years ago
socialfoundations / tttlm
Test-time-training on nearest neighbors for large language models
☆46Updated last year
MadryLab / trak
A fast, effective data attribution method for neural networks in PyTorch
☆220Updated 11 months ago
MadryLab / DsDm
☆50Updated last year
zlin7 / UQ-NLG
☆102Updated last year
harveyhuang18 / EMR_Merging
[NeurIPS 2024 Spotlight] EMR-Merging: Tuning-Free High-Performance Model Merging
☆69Updated 7 months ago
ajyl / dpo_toxic
A Mechanistic Understanding of Alignment Algorithms: A Case Study on DPO and Toxicity.
☆82Updated 7 months ago
tml-epfl / long-is-more-for-alignment
Long Is More for Alignment: A Simple but Tough-to-Beat Baseline for Instruction Fine-Tuning [ICML 2024]
☆19Updated last year
hkust-nlp / PEM_composition
[NeurIPS 2023] Github repository for "Composing Parameter-Efficient Modules with Arithmetic Operations"
☆61Updated last year
zjysteven / mink-plus-plus
[ICLR'25 Spotlight] Min-K%++: Improved baseline for detecting pre-training data of LLMs
☆45Updated 5 months ago
BeyonderXX / TRACE
TRACE: A Comprehensive Benchmark for Continual Learning in Large Language Models
☆79Updated last year
tanganke / opcm
official code repo for paper "Merging Models on the Fly Without Retraining: A Sequential Approach to Scalable Continual Model Merging"
☆20Updated 2 weeks ago
dannyallover / overthinking_the_truth
☆29Updated last year
ericwtodd / function_vectors
Function Vectors in Large Language Models (ICLR 2024)
☆181Updated 6 months ago
deeplearning-wisc / args
☆45Updated last year
TRAIS-Lab / dattri
`dattri` is a PyTorch library for developing, benchmarking, and deploying efficient data attribution algorithms.
☆90Updated last week
milesaturpin / cot-unfaithfulness
☆48Updated 2 years ago