mmatena / model_merging
☆66Updated 3 years ago
Alternatives and similar repositories for model_merging:
Users that are interested in model_merging are comparing it to the libraries listed below
- Source code of "Task arithmetic in the tangent space: Improved editing of pre-trained models".☆101Updated last year
- ☆30Updated 9 months ago
- ☆93Updated last year
- Official repository of "Localizing Task Information for Improved Model Merging and Compression" [ICML 2024]☆44Updated 6 months ago
- [NeurIPS'23] Aging with GRACE: Lifelong Model Editing with Discrete Key-Value Adaptors☆75Updated 4 months ago
- Simple Parameter-efficient Fine-tuning for Transformer-based Masked Language-models☆142Updated 2 years ago
- Revisiting Efficient Training Algorithms For Transformer-based Language Models (NeurIPS 2023)☆80Updated last year
- Code release for Dataless Knowledge Fusion by Merging Weights of Language Models (https://openreview.net/forum?id=FCnohuR6AnM)☆89Updated last year
- DataInf: Efficiently Estimating Data Influence in LoRA-tuned LLMs and Diffusion Models (ICLR 2024)☆64Updated 7 months ago
- ☆29Updated last month
- AI Logging for Interpretability and Explainability🔬☆115Updated 10 months ago
- ☆176Updated last year
- This is the repository for "Model Merging by Uncertainty-Based Gradient Matching", ICLR 2024.☆27Updated 11 months ago
- Codebase for ICML submission "DOGE: Domain Reweighting with Generalization Estimation"☆17Updated last year
- A Kernel-Based View of Language Model Fine-Tuning https://arxiv.org/abs/2210.05643☆76Updated last year
- ☆29Updated last year
- ☆92Updated 2 months ago
- ☆49Updated last year
- ☆62Updated last year
- ☆40Updated last year
- Learning adapter weights from task descriptions☆17Updated last year
- LoFiT: Localized Fine-tuning on LLM Representations☆37Updated 3 months ago
- A Mechanistic Understanding of Alignment Algorithms: A Case Study on DPO and Toxicity.☆72Updated last month
- AdaMerging: Adaptive Model Merging for Multi-Task Learning. ICLR, 2024.☆78Updated 6 months ago
- [NeurIPS 2023] Github repository for "Composing Parameter-Efficient Modules with Arithmetic Operations"☆61Updated last year
- The accompanying code for "Transformer Feed-Forward Layers Are Key-Value Memories". Mor Geva, Roei Schuster, Jonathan Berant, and Omer Le…☆91Updated 3 years ago
- ☆35Updated last year
- Skill-It! A Data-Driven Skills Framework for Understanding and Training Language Models☆46Updated last year
- ☆11Updated 2 years ago
- Localize-and-Stitch: Efficient Model Merging via Sparse Task Arithmetic☆24Updated 3 months ago