☆80Mar 17, 2022Updated 3 years ago
Alternatives and similar repositories for model_merging
Users that are interested in model_merging are comparing it to the libraries listed below
Sorting:
- Code release for Dataless Knowledge Fusion by Merging Weights of Language Models (https://openreview.net/forum?id=FCnohuR6AnM)☆92Jul 25, 2023Updated 2 years ago
- ☆210Feb 3, 2024Updated 2 years ago
- Editing Models with Task Arithmetic☆535Jan 11, 2024Updated 2 years ago
- AdaMerging: Adaptive Model Merging for Multi-Task Learning. ICLR, 2024.☆99Oct 28, 2024Updated last year
- ☆11Jul 21, 2024Updated last year
- Source code of "Task arithmetic in the tangent space: Improved editing of pre-trained models".☆112Jun 8, 2023Updated 2 years ago
- Model Merging in LLMs, MLLMs, and Beyond: Methods, Theories, Applications and Opportunities. ACM Computing Surveys, 2026.☆680Updated this week
- Codes for Merging Large Language Models☆35Aug 7, 2024Updated last year
- [ACL 2023] Are Pre-trained Language Models Useful for Model Ensemble in Chinese Grammatical Error Correction?☆10Dec 15, 2025Updated 2 months ago
- Running massive simulations using RNNs on CPUs for building bots and all kinds of things.☆13Jun 13, 2021Updated 4 years ago
- Official Implementation of ACL2023: Don't Parse, Choose Spans! Continuous and Discontinuous Constituency Parsing via Autoregressive Span …☆14Aug 25, 2023Updated 2 years ago
- ☆33Jul 8, 2024Updated last year
- Code for paper "Parameter Efficient Multi-task Model Fusion with Partial Linearization"☆24Sep 13, 2024Updated last year
- [NeurIPS 2025] MergeBench: A Benchmark for Merging Domain-Specialized LLMs☆43Feb 11, 2026Updated 3 weeks ago
- DiWA: Diverse Weight Averaging for Out-of-Distribution Generalization☆31Jan 31, 2023Updated 3 years ago
- Automatic Recall Machines: Internal Replay, Continual Learning and the Brain☆11Jul 14, 2020Updated 5 years ago
- Code for paper "Merging Multi-Task Models via Weight-Ensembling Mixture of Experts"☆30Jun 7, 2024Updated last year
- Code for "BERTifying the Hidden Markov Model for Multi-Source Weakly Supervised Named Entity Recognition"☆32Jun 20, 2023Updated 2 years ago
- ☆52Jan 1, 2024Updated 2 years ago
- This is the repository for "Model Merging by Uncertainty-Based Gradient Matching", ICLR 2024.☆29May 15, 2024Updated last year
- ☆11Jun 23, 2022Updated 3 years ago
- Patching open-vocabulary models by interpolating weights☆91Sep 28, 2023Updated 2 years ago
- PyTorch codes for the paper "An Empirical Study of Multimodal Model Merging"☆37Oct 11, 2023Updated 2 years ago
- [NeurIPS 2024 Spotlight] EMR-Merging: Tuning-Free High-Performance Model Merging☆77Mar 1, 2025Updated last year
- Codebase for Merging Language Models (ICML 2024)☆863May 5, 2024Updated last year
- Pytorch optimizers implementing Hilbert Constrained Gradient Descent☆19May 9, 2019Updated 6 years ago
- ☆17Jun 20, 2024Updated last year
- Implementation and experiments for Partially Supervised NER via Expected Entity Ratio in TACL 2022☆14Nov 7, 2022Updated 3 years ago
- ☆58Jan 28, 2026Updated last month
- ☆18Mar 10, 2023Updated 2 years ago
- [NeurIPS 2023] Github repository for "Composing Parameter-Efficient Modules with Arithmetic Operations"☆61Nov 26, 2023Updated 2 years ago
- A curated list of Model Merging methods.☆95Dec 3, 2025Updated 3 months ago
- Learning Representations that Support Robust Transfer of Predictors☆20Nov 7, 2021Updated 4 years ago
- ☆15Mar 22, 2023Updated 2 years ago
- ☆18Aug 19, 2024Updated last year
- Code base for the EMNLP 2021 Findings paper: Cartography Active Learning☆14Jun 3, 2025Updated 9 months ago
- FusionBench: A Comprehensive Benchmark/Toolkit of Deep Model Fusion☆205Feb 6, 2026Updated 3 weeks ago
- ☆77Apr 29, 2024Updated last year
- ☆52Jan 19, 2023Updated 3 years ago