A framework for merging models solving different tasks with different initializations into one multi-task model without any additional training
☆315Jan 18, 2024Updated 2 years ago
Alternatives and similar repositories for ZipIt
Users that are interested in ZipIt are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for "Merging Text Transformers from Different Initializations"☆20Feb 2, 2025Updated last year
- A curated list of Model Merging methods.☆95Dec 3, 2025Updated 6 months ago
- Model Merging with SVD to Tie the KnOTS [ICLR 2025]☆92Apr 3, 2025Updated last year
- Code release for REPAIR: REnormalizing Permuted Activations for Interpolation Repair☆53Jan 29, 2024Updated 2 years ago
- Git Re-Basin: Merging Models modulo Permutation Symmetries in PyTorch☆78Feb 9, 2023Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Code release for "Git Re-Basin: Merging Models modulo Permutation Symmetries"☆513Mar 7, 2023Updated 3 years ago
- This repository is the implementation of the paper Training Free Pretrained Model Merging (CVPR2024).☆34Mar 5, 2024Updated 2 years ago
- Codebase for Merging Language Models (ICML 2024)☆868May 5, 2024Updated 2 years ago
- Editing Models with Task Arithmetic☆545Jan 11, 2024Updated 2 years ago
- Official repository of the "Transformer Fusion with Optimal Transport" paper, published as a conference paper at ICLR 2024.☆31Apr 19, 2024Updated 2 years ago
- Codes for the paper "Optimizing Mode Connectivity via Neuron Alignment" from NeurIPS 2020.☆16Dec 10, 2020Updated 5 years ago
- ☆19Feb 15, 2023Updated 3 years ago
- Official PyTorch Implementation of "Learning to Learn with Generative Models of Neural Network Checkpoints"☆346Oct 3, 2022Updated 3 years ago
- Codes for Merging Large Language Models☆37Aug 7, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- an official PyTorch implementation of the paper "Partial Network Cloning", CVPR 2023☆13Mar 21, 2023Updated 3 years ago
- Model Fusion via Optimal Transport, NeurIPS 2020☆155Nov 16, 2022Updated 3 years ago
- Code for paper "Merging Multi-Task Models via Weight-Ensembling Mixture of Experts"☆32Jun 7, 2024Updated 2 years ago
- The implementation for FREE-Merging: Fourier Transform for Model Merging with Lightweight Experts (ICCV25)☆15Jun 26, 2025Updated 11 months ago
- [NeurIPS2024] Twin-Merging: Dynamic Integration of Modular Expertise in Model Merging☆144Mar 17, 2025Updated last year
- Code release for Dataless Knowledge Fusion by Merging Weights of Language Models (https://openreview.net/forum?id=FCnohuR6AnM)☆92Jul 25, 2023Updated 2 years ago
- Code for ECCV 2022 paper “Learning with Recoverable Forgetting”☆21Jul 27, 2022Updated 3 years ago
- ☆77Apr 29, 2024Updated 2 years ago
- Implementation of Hyena Hierarchy in JAX☆10Apr 30, 2023Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [COLM 2024] LoraHub: Efficient Cross-Task Generalization via Dynamic LoRA Composition☆671Jul 22, 2024Updated last year
- Package to align tokens from different tokenizations.☆16Mar 25, 2024Updated 2 years ago
- Tools for merging pretrained large language models.☆7,154Updated this week
- Code for CVPR 2024 Oral "Neural Lineage"☆17Jun 18, 2024Updated 2 years ago
- Proportional Amplitude Spectrum Training Augmentation for Synthetic to Real Domain Generalization☆23Mar 11, 2024Updated 2 years ago
- FusionBench: A Comprehensive Benchmark/Toolkit of Deep Model Fusion☆231Jun 11, 2026Updated last week
- Personal implementation of ASIF by Antonio Norelli☆26May 24, 2024Updated 2 years ago
- Source code of "Task arithmetic in the tangent space: Improved editing of pre-trained models".☆114Jun 8, 2023Updated 3 years ago
- A benchmark suite for Scalable Diverse Model Selection for Accessible Transfer Learning from our NeurIPS 2021 paper.☆15Dec 14, 2022Updated 3 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Localize-and-Stitch: Efficient Model Merging via Sparse Task Arithmetic☆32Feb 18, 2026Updated 4 months ago
- Apply methods described in "Git Re-basin"-paper [1] to arbitrary models --- [1] Ainsworth et al. (https://arxiv.org/abs/2209.04836)☆15Jun 8, 2026Updated last week
- Code repository for the c-BTM paper☆109Sep 26, 2023Updated 2 years ago
- Official repository of Evolutionary Optimization of Model Merging Recipes☆1,425Nov 29, 2024Updated last year
- ☆12Oct 2, 2023Updated 2 years ago
- Token-level adaptation of LoRA matrices for downstream task generalization.☆15Apr 14, 2024Updated 2 years ago
- [ECCV2022] Factorizing Knowledge in Neural Networks☆91Sep 12, 2022Updated 3 years ago