yule-BUAA/MergeLLM

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/yule-BUAA/MergeLLM)

yule-BUAA / MergeLLM

Codes for Merging Large Language Models

☆37

Alternatives and similar repositories for MergeLLM

Users that are interested in MergeLLM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

declare-lab / della
View on GitHub
DELLA-Merging: Reducing Interference in Model Merging through Magnitude-Based Sampling
☆37Jul 12, 2024Updated 2 years ago
r-three / realistic_evaluation_of_model_merging_for_compositional_generalization
View on GitHub
☆13Feb 11, 2026Updated 5 months ago
duguodong7 / pcb-merging
View on GitHub
[NeurIPS 2024] For paper Parameter Competition Balancing for Model Merging
☆48Oct 11, 2024Updated last year
jinlanfu / Polyglot_Prompt
View on GitHub
Code and dataset for Polyglot Prompting: Multilingual Multitask Prompt Training.
☆18Dec 7, 2022Updated 3 years ago
uiuctml / Localize-and-Stitch
View on GitHub
Localize-and-Stitch: Efficient Model Merging via Sparse Task Arithmetic
☆32Feb 18, 2026Updated 5 months ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
tanganke / weight-ensembling_MoE
View on GitHub
Code for paper "Merging Multi-Task Models via Weight-Ensembling Mixture of Experts"
☆32Jun 7, 2024Updated 2 years ago
Zhengsh123 / FREE-Merging
View on GitHub
The implementation for FREE-Merging: Fourier Transform for Model Merging with Lightweight Experts (ICCV25)
☆16Jun 26, 2025Updated last year
MANGA-UOFA / PTfer
View on GitHub
☆11Nov 13, 2024Updated last year
EnnengYang / RepresentationSurgery
View on GitHub
Representation Surgery for Multi-Task Model Merging. ICML, 2024.
☆49Oct 10, 2024Updated last year
harveyhuang18 / EMR_Merging
View on GitHub
[NeurIPS 2024 Spotlight] EMR-Merging: Tuning-Free High-Performance Model Merging
☆83Mar 1, 2025Updated last year
yule-BUAA / MergeLM
View on GitHub
Codebase for Merging Language Models (ICML 2024)
☆869May 5, 2024Updated 2 years ago
hkust-nlp / PEM_composition
View on GitHub
[NeurIPS 2023] Github repository for "Composing Parameter-Efficient Modules with Arithmetic Operations"
☆61Nov 26, 2023Updated 2 years ago
martyn / safetensors-merge-supermario
View on GitHub
Merge safetensor files using the technique described in "Language Models are Super Mario: Absorbing Abilities from Homologous Models as a…
☆83Oct 17, 2024Updated last year
tanganke / fusion_bench
View on GitHub
FusionBench: A Comprehensive Benchmark/Toolkit of Deep Model Fusion
☆235Jun 23, 2026Updated last month
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
ArminAzizi98 / LaMDA
View on GitHub
☆15Nov 7, 2024Updated last year
Miaow-Lab / RLVR-Linearity
View on GitHub
[arXiv] "Linear Dynamics in the RLVR Training of Large Language Models"
☆17May 25, 2026Updated last month
tianyi-lab / RuleR
View on GitHub
[NAACL'25] RuleR: Improving LLM Controllability by Rule-based Data Recycling
☆14Sep 27, 2025Updated 9 months ago
LHL3341 / MetaLadder
View on GitHub
MetaLadder: Ascending Mathematical Solution Quality via Analogical-Problem Reasoning Transfer (EMNLP 2025)
☆12Apr 18, 2025Updated last year
xydaytoy / EVA
View on GitHub
☆14Apr 16, 2024Updated 2 years ago
CHEN-YIZHU / GACL
View on GitHub
[NeurIPS 2024] GACL: Exemplar-Free Generalized Analytic Continual Learning
☆18Nov 5, 2024Updated last year
EnnengYang / AdaMerging
View on GitHub
AdaMerging: Adaptive Model Merging for Multi-Task Learning. ICLR, 2024.
☆113Oct 28, 2024Updated last year
snudm-starlab / K-prune
View on GitHub
Accurate Retraining-free Pruning for Pretrained Encoder-based Language Models (ICLR 2024)
☆14May 31, 2025Updated last year
prateeky2806 / ties-merging
View on GitHub
☆216Feb 3, 2024Updated 2 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
r-three / mats
View on GitHub
☆33Jul 8, 2024Updated 2 years ago
EnnengYang / Awesome-Model-Merging-Methods-Theories-Applications
View on GitHub
Model Merging in LLMs, MLLMs, and Beyond: Methods, Theories, Applications and Opportunities. ACM Computing Surveys, 2026.
☆769Updated this week
nik-dim / tall_masks
View on GitHub
Official repository of "Localizing Task Information for Improved Model Merging and Compression" [ICML 2024]
☆53Dec 22, 2025Updated 7 months ago
zzhang0179 / Unveiling-Linguistic-Regions-in-LLMs
View on GitHub
[ACL 2024] Unveiling Linguistic Regions in Large Language Models
☆34Jun 9, 2024Updated 2 years ago
tanganke / peta
View on GitHub
Code for paper "Parameter Efficient Multi-task Model Fusion with Partial Linearization"
☆26Sep 13, 2024Updated last year
deepghs / sdeval
View on GitHub
Evaluation for stable diffusion model training
☆27Aug 24, 2024Updated last year
yifanycc / loretta
View on GitHub
[NAACL 24 Oral] LoRETTA: Low-Rank Economic Tensor-Train Adaptation for Ultra-Low-Parameter Fine-Tuning of Large Language Models
☆39Jan 9, 2025Updated last year
AntoAndGar / task_singular_vectors
View on GitHub
Task Singular Vectors: Reducing Task Interference in Model Merging. Merge models avoiding task interference through separable models.
☆57Dec 15, 2025Updated 7 months ago
Cohere-Labs-Community / iterative-data-selection
View on GitHub
☆30Nov 5, 2024Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
VITA-Group / instant_soup
View on GitHub
[ICML2023] Instant Soup Cheap Pruning Ensembles in A Single Pass Can Draw Lottery Tickets from Large Models. Ajay Jaiswal, Shiwei Liu, Ti…
☆11Nov 28, 2023Updated 2 years ago
DYR1 / MoGU
View on GitHub
Our research proposes a novel MoGU framework that improves LLMs' safety while preserving their usability.
☆18Jan 14, 2025Updated last year
hkgc-1 / GHPO
View on GitHub
☆62Jul 21, 2025Updated last year
YuxiaoWang-AI / PIHOT
View on GitHub
☆12Dec 19, 2024Updated last year
peijunallin / alphalora
View on GitHub
☆19Nov 10, 2024Updated last year
fbarez / neuroplasticity
View on GitHub
☆14Mar 31, 2024Updated 2 years ago
opendatalab / MLLM-DataEngine
View on GitHub
MLLM-DataEngine: An Iterative Refinement Approach for MLLM
☆49May 24, 2024Updated 2 years ago