tanganke / opcmLinks

official code repo for paper "Merging Models on the Fly Without Retraining: A Sequential Approach to Scalable Continual Model Merging"

☆20

Alternatives and similar repositories for opcm

Users that are interested in opcm are comparing it to the libraries listed below

Sorting:

tanganke / peta
Code for paper "Parameter Efficient Multi-task Model Fusion with Partial Linearization"
☆22Updated last year
EnnengYang / RepresentationSurgery
Representation Surgery for Multi-Task Model Merging. ICML, 2024.
☆46Updated last year
nik-dim / tall_masks
Official repository of "Localizing Task Information for Improved Model Merging and Compression" [ICML 2024]
☆51Updated last year
Model-GLUE / Model-GLUE
☆18Updated last year
harveyhuang18 / EMR_Merging
[NeurIPS 2024 Spotlight] EMR-Merging: Tuning-Free High-Performance Model Merging
☆70Updated 7 months ago
EnnengYang / AdaMerging
AdaMerging: Adaptive Model Merging for Multi-Task Learning. ICLR, 2024.
☆94Updated last year
gortizji / tangent_task_arithmetic
Source code of "Task arithmetic in the tangent space: Improved editing of pre-trained models".
☆105Updated 2 years ago
Pbihao / SLM
☆28Updated last year
mmatena / model_merging
☆78Updated 3 years ago
VITA-Group / SEAL
Official code for SEAL: Steerable Reasoning Calibration of Large Language Models for Free
☆44Updated 6 months ago
uiuctml / Localize-and-Stitch
Localize-and-Stitch: Efficient Model Merging via Sparse Task Arithmetic
☆30Updated last month
kyrie-23 / linear_task_arithmetic
☆11Updated 3 months ago
avalonstrel / Mitigating-the-Alignment-Tax-of-RLHF
☆15Updated last year
cliang1453 / task-aware-distillation
Less is More: Task-aware Layer-wise Distillation for Language Model Compression (ICML2023)
☆39Updated 2 years ago
VITA-Group / Robust_Weight_Signatures
[ICML 2023] "Robust Weight Signatures: Gaining Robustness as Easy as Patching Weights?" by Ruisi Cai, Zhenyu Zhang, Zhangyang Wang
☆16Updated 2 years ago
arumaekawa / DiLM
Implementaiton of "DiLM: Distilling Dataset into Language Model for Text-level Dataset Distillation" (accepted by NAACL2024 Findings)".
☆24Updated 8 months ago
yule-BUAA / MergeLLM
Codes for Merging Large Language Models
☆33Updated last year
abhishekpanigrahi1996 / Skill-Localization-by-grafting
☆51Updated last year
tanganke / weight-ensembling_MoE
Code for paper "Merging Multi-Task Models via Weight-Ensembling Mixture of Experts"
☆29Updated last year
VITA-Group / Junk_DNA_Hypothesis
[ICML 2024] Junk DNA Hypothesis: A Task-Centric Angle of LLM Pre-trained Weights through Sparsity; Lu Yin*, Ajay Jaiswal*, Shiwei Liu, So…
☆16Updated 6 months ago
r-three / mats
☆31Updated last year
tmlr-group / G-effect
[ICLR 2025] "Rethinking LLM Unlearning Objectives: A Gradient Perspective and Go Beyond"
☆12Updated 8 months ago
TsinghuaC3I / SoRA
[EMNLP 2023, Main Conference] Sparse Low-rank Adaptation of Pre-trained Language Models
☆83Updated last year
adymaharana / d2pruning
☆41Updated 2 years ago
which47 / LLMCL
Analyzing and Reducing Catastrophic Forgetting in Parameter Efficient Tuning
☆36Updated 11 months ago
NUS-TRAIL / Unnatural_Language
The official repository of 'Unnatural Language Are Not Bugs but Features for LLMs'
☆23Updated 5 months ago
UKPLab / iclr2024-model-merging
This is the repository for "Model Merging by Uncertainty-Based Gradient Matching", ICLR 2024.
☆28Updated last year
AntoAndGar / task_singular_vectors
Task Singular Vectors: Reducing Task Interference in Model Merging. Merge models avoiding task interference through separable models.
☆34Updated 2 months ago
licong-lin / negative-preference-optimization
☆66Updated last year
IBM / SafeLoRA
Github repo for NeurIPS 2024 paper "Safe LoRA: the Silver Lining of Reducing Safety Risks when Fine-tuning Large Language Models"
☆21Updated last month