bloomberg/dataless-model-merging

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/bloomberg/dataless-model-merging)

bloomberg / dataless-model-merging

Code release for Dataless Knowledge Fusion by Merging Weights of Language Models (https://openreview.net/forum?id=FCnohuR6AnM)

☆92

Alternatives and similar repositories for dataless-model-merging

Users that are interested in dataless-model-merging are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

mmatena / model_merging
View on GitHub
☆81Mar 17, 2022Updated 4 years ago
prateeky2806 / ties-merging
View on GitHub
☆216Feb 3, 2024Updated 2 years ago
EnnengYang / RepresentationSurgery
View on GitHub
Representation Surgery for Multi-Task Model Merging. ICML, 2024.
☆49Oct 10, 2024Updated last year
Raincleared-Song / ConPET
View on GitHub
Source code for a LoRA-based continual relation extraction method.
☆14Sep 25, 2023Updated 2 years ago
yule-BUAA / MergeLM
View on GitHub
Codebase for Merging Language Models (ICML 2024)
☆870May 5, 2024Updated 2 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
ylsung / vl-merging
View on GitHub
PyTorch codes for the paper "An Empirical Study of Multimodal Model Merging"
☆37Oct 11, 2023Updated 2 years ago
r-three / mats
View on GitHub
☆33Jul 8, 2024Updated 2 years ago
hkust-nlp / PEM_composition
View on GitHub
[NeurIPS 2023] Github repository for "Composing Parameter-Efficient Modules with Arithmetic Operations"
☆61Nov 26, 2023Updated 2 years ago
nverma1 / merging-text-transformers
View on GitHub
Code for "Merging Text Transformers from Different Initializations"
☆20Feb 2, 2025Updated last year
mlfoundations / task_vectors
View on GitHub
Editing Models with Task Arithmetic
☆548Jan 11, 2024Updated 2 years ago
uiuctml / GOAT
View on GitHub
[JMLR] Gradual Domain Adaptation: Theory and Algorithms
☆11Jan 14, 2025Updated last year
EnnengYang / AdaMerging
View on GitHub
AdaMerging: Adaptive Model Merging for Multi-Task Learning. ICLR, 2024.
☆113Oct 28, 2024Updated last year
text-machine-lab / adversarial_decomposition
View on GitHub
The code for the paper "Adversarial Decomposition of Text Representation", NAACL 2019
☆29Dec 8, 2022Updated 3 years ago
EnnengYang / Awesome-Model-Merging-Methods-Theories-Applications
View on GitHub
Model Merging in LLMs, MLLMs, and Beyond: Methods, Theories, Applications and Opportunities. ACM Computing Surveys, 2026.
☆769Jul 17, 2026Updated last week
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
uiuctml / Localize-and-Stitch
View on GitHub
Localize-and-Stitch: Efficient Model Merging via Sparse Task Arithmetic
☆32Feb 18, 2026Updated 5 months ago
zju-vipa / training_free_model_merging
View on GitHub
This repository is the implementation of the paper Training Free Pretrained Model Merging (CVPR2024).
☆34Mar 5, 2024Updated 2 years ago
IBM / NeuronAlignment
View on GitHub
Codes for the paper "Optimizing Mode Connectivity via Neuron Alignment" from NeurIPS 2020.
☆16Dec 10, 2020Updated 5 years ago
yule-BUAA / MergeLLM
View on GitHub
Codes for Merging Large Language Models
☆37Aug 7, 2024Updated last year
alon-albalak / online-data-mixing
View on GitHub
An implementation of online data mixing for the Pile dataset, based on the GPT-NeoX library.
☆14Jan 9, 2024Updated 2 years ago
ychen-stat-ml / kernel-adapters
View on GitHub
Code for "Inducer-tuning: Connecting Prefix-tuning and Adapter-tuning" (EMNLP 2022) and "Empowering Parameter-Efficient Transfer Learning…
☆11Feb 6, 2023Updated 3 years ago
SALT-NLP / Adaptive-Compositional-Modules
View on GitHub
Code for the ACL 2022 paper "Continual Sequence Generation with Adaptive Compositional Modules"
☆39Apr 4, 2022Updated 4 years ago
cambridgeltl / autopeft
View on GitHub
AutoPEFT: Automatic Configuration Search for Parameter-Efficient Fine-Tuning (Zhou et al.; TACL 2024)
☆51Mar 17, 2024Updated 2 years ago
gortizji / tangent_task_arithmetic
View on GitHub
Source code of "Task arithmetic in the tangent space: Improved editing of pre-trained models".
☆114Jun 8, 2023Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
yiren-jian / NonLing-CSE
View on GitHub
[NeurIPS 2022] Non-Linguistic Supervision for Contrastive Learning of Sentence Embeddings
☆22Jan 30, 2023Updated 3 years ago
qiuzh20 / EMoE
View on GitHub
Official PyTorch Implementation of EMoE: Unlocking Emergent Modularity in Large Language Models [main conference @ NAACL2024]
☆39May 28, 2024Updated 2 years ago
thunlp / CSS-LM
View on GitHub
CSS-LM: Contrastive Semi-supervised Fine-tuning of Pre-trained Language Models
☆11Jul 1, 2023Updated 3 years ago
latynt / ans
View on GitHub
Arabic News Stance Corpus
☆11Feb 5, 2021Updated 5 years ago
gstoica27 / ZipIt
View on GitHub
A framework for merging models solving different tasks with different initializations into one multi-task model without any additional tr…
☆316Jan 18, 2024Updated 2 years ago
ShaojieJiang / CT-Loss
View on GitHub
The contrastive token loss function for reducing generative repetition of autoregressive neural language models.
☆13May 11, 2022Updated 4 years ago
thunlp / Prompt-Transferability
View on GitHub
On Transferability of Prompt Tuning for Natural Language Processing
☆98May 3, 2024Updated 2 years ago
nik-dim / tall_masks
View on GitHub
Official repository of "Localizing Task Information for Improved Model Merging and Compression" [ICML 2024]
☆53Dec 22, 2025Updated 7 months ago
uiuctml / MergeBench
View on GitHub
[NeurIPS 2025] MergeBench: A Benchmark for Merging Domain-Specialized LLMs
☆47Feb 11, 2026Updated 5 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
wjko2 / Linguistically-Informed-Specificity-and-Semantic-Plausibility-for-Dialogue-Generation
View on GitHub
☆10Jun 11, 2019Updated 7 years ago
mlfoundations / model-soups
View on GitHub
Model soups: averaging weights of multiple fine-tuned models improves accuracy without increasing inference time
☆517Jul 15, 2024Updated 2 years ago
google / t5patches
View on GitHub
T5Patches is a set of tools for fast and targeted editing of generative language models built with T5X.
☆12May 31, 2024Updated 2 years ago
kernelmachine / demix-data
View on GitHub
Benchmark API for Multidomain Language Modeling
☆25Aug 26, 2022Updated 3 years ago
snimu / rebasin
View on GitHub
Apply methods described in "Git Re-basin"-paper [1] to arbitrary models --- [1] Ainsworth et al. (https://arxiv.org/abs/2209.04836)
☆16Updated this week
tanganke / fusion_bench
View on GitHub
FusionBench: A Comprehensive Benchmark/Toolkit of Deep Model Fusion
☆235Jun 23, 2026Updated last month
adamxyang / laplace-lora
View on GitHub
Bayesian low-rank adaptation for large language models
☆29May 4, 2024Updated 2 years ago