Codes for Merging Large Language Models
☆35Aug 7, 2024Updated last year
Alternatives and similar repositories for MergeLLM
Users that are interested in MergeLLM are comparing it to the libraries listed below
Sorting:
- DELLA-Merging: Reducing Interference in Model Merging through Magnitude-Based Sampling☆36Jul 12, 2024Updated last year
- [NeurIPS 2024] For paper Parameter Competition Balancing for Model Merging☆48Oct 11, 2024Updated last year
- ☆12Feb 11, 2026Updated 2 weeks ago
- The implementation for FREE-Merging: Fourier Transform for Model Merging with Lightweight Experts (ICCV25)☆14Jun 26, 2025Updated 8 months ago
- Localize-and-Stitch: Efficient Model Merging via Sparse Task Arithmetic☆32Feb 18, 2026Updated last week
- Code and dataset for Polyglot Prompting: Multilingual Multitask Prompt Training.☆18Dec 7, 2022Updated 3 years ago
- [NeurIPS 2024 Spotlight] EMR-Merging: Tuning-Free High-Performance Model Merging☆77Mar 1, 2025Updated last year
- FusionBench: A Comprehensive Benchmark/Toolkit of Deep Model Fusion☆205Feb 6, 2026Updated 3 weeks ago
- [NeurIPS2024] Twin-Merging: Dynamic Integration of Modular Expertise in Model Merging☆140Mar 17, 2025Updated 11 months ago
- ☆11Nov 13, 2024Updated last year
- MetaLadder: Ascending Mathematical Solution Quality via Analogical-Problem Reasoning Transfer (EMNLP 2025)☆11Apr 18, 2025Updated 10 months ago
- Representation Surgery for Multi-Task Model Merging. ICML, 2024.☆47Oct 10, 2024Updated last year
- [NAACL'25] RuleR: Improving LLM Controllability by Rule-based Data Recycling☆14Sep 27, 2025Updated 5 months ago
- MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation☆13Sep 2, 2024Updated last year
- ☆30Nov 5, 2024Updated last year
- Model Merging in LLMs, MLLMs, and Beyond: Methods, Theories, Applications and Opportunities. ACM Computing Surveys, 2026.☆680Updated this week
- ☆15Nov 7, 2024Updated last year
- This repository is the implementation of the paper Training Free Pretrained Model Merging (CVPR2024).☆33Mar 5, 2024Updated last year
- ☆14Apr 16, 2024Updated last year
- ☆14Mar 31, 2024Updated last year
- super-resolution; post-training quantization; model compression☆14Nov 10, 2023Updated 2 years ago
- FreeEnricher: Enriching Face Landmarks without Additional Cost [Official, AAAI 2023]☆18Dec 2, 2024Updated last year
- CFT-RAG: An Entity Tree Based Retrieval Augmented Generation Algorithm With Cuckoo Filter☆22May 28, 2025Updated 9 months ago
- ☆18Nov 10, 2024Updated last year
- [NeurIPS 2023] Github repository for "Composing Parameter-Efficient Modules with Arithmetic Operations"☆61Nov 26, 2023Updated 2 years ago
- ☆210Feb 3, 2024Updated 2 years ago
- ☆18Aug 19, 2024Updated last year
- The evaluation code for A Safety Report on GPT-5.2, Gemini 3 Pro, Qwen3-VL, Grok 4.1 Fast, Nano Banana Pro, and Seedream 4.5☆52Jan 18, 2026Updated last month
- Control LLM☆22Apr 6, 2025Updated 10 months ago
- Codebase for Merging Language Models (ICML 2024)☆863May 5, 2024Updated last year
- The official implementation of "Well Begun is Half Done: Low-resource Preference Alignment by Weak-to-Strong Decoding"☆23Jun 26, 2025Updated 8 months ago
- Accurate Retraining-free Pruning for Pretrained Encoder-based Language Models (ICLR 2024)☆14May 31, 2025Updated 9 months ago
- ☆22Mar 2, 2025Updated last year
- [NAACL 24 Oral] LoRETTA: Low-Rank Economic Tensor-Train Adaptation for Ultra-Low-Parameter Fine-Tuning of Large Language Models☆39Jan 9, 2025Updated last year
- Code for "Reasoning to Learn from Latent Thoughts"☆124Mar 28, 2025Updated 11 months ago
- Unofficial Implementation of Evolutionary Model Merging☆41Mar 28, 2024Updated last year
- Model merging is a highly efficient approach for long-to-short reasoning.☆100Oct 15, 2025Updated 4 months ago
- ☆25Nov 17, 2025Updated 3 months ago
- [ACL'25] Mosaic-IT: Cost-Free Compositional Data Synthesis for Instruction Tuning☆20Sep 27, 2025Updated 5 months ago