ycjing / Awesome-Model-Merging
A curated list of Model Merging methods.
☆91Updated 7 months ago
Alternatives and similar repositories for Awesome-Model-Merging:
Users that are interested in Awesome-Model-Merging are comparing it to the libraries listed below
- AdaMerging: Adaptive Model Merging for Multi-Task Learning. ICLR, 2024.☆77Updated 5 months ago
- Awesome-Low-Rank-Adaptation☆92Updated 6 months ago
- [NeurIPS 2024 Spotlight] EMR-Merging: Tuning-Free High-Performance Model Merging☆56Updated last month
- [EMNLP 2023 Main] Sparse Low-rank Adaptation of Pre-trained Language Models☆74Updated last year
- Code for paper "Parameter Efficient Multi-task Model Fusion with Partial Linearization"☆21Updated 7 months ago
- Official repository of "Localizing Task Information for Improved Model Merging and Compression" [ICML 2024]☆42Updated 5 months ago
- Official code for our paper, "LoRA-Pro: Are Low-Rank Adapters Properly Optimized? "☆111Updated last week
- ☆48Updated 4 months ago
- Representation Surgery for Multi-Task Model Merging. ICML, 2024.☆44Updated 6 months ago
- Source code of "Task arithmetic in the tangent space: Improved editing of pre-trained models".☆99Updated last year
- [ICML 2024] Official code for the paper "Revisiting Zeroth-Order Optimization for Memory-Efficient LLM Fine-Tuning: A Benchmark ".☆97Updated 9 months ago
- Code accompanying the paper "Massive Activations in Large Language Models"☆154Updated last year
- ☆99Updated 9 months ago
- Awesome Learn From Model Beyond Fine-Tuning: A Survey☆62Updated 4 months ago
- LISA: Layerwise Importance Sampling for Memory-Efficient Large Language Model Fine-Tuning☆30Updated last year
- ☆173Updated last year
- A generalized framework for subspace tuning methods in parameter efficient fine-tuning.☆136Updated 2 months ago
- Model Merging in LLMs, MLLMs, and Beyond: Methods, Theories, Applications and Opportunities. arXiv:2408.07666.☆367Updated this week
- A block pruning framework for LLMs.☆22Updated 9 months ago
- Code for paper "Merging Multi-Task Models via Weight-Ensembling Mixture of Experts"☆23Updated 10 months ago
- A curated list of resources for activation engineering☆59Updated last week
- CoT-Valve: Length-Compressible Chain-of-Thought Tuning☆62Updated 2 months ago
- [NeurIPS'24 Oral] HydraLoRA: An Asymmetric LoRA Architecture for Efficient Fine-Tuning☆186Updated 4 months ago
- Official Pytorch Implementation of "OwLore: Outlier-weighed Layerwise Sampled Low-Rank Projection for Memory-Efficient LLM Fine-tuning" b…☆31Updated 10 months ago
- Official Pytorch Implementation of Our Paper Accepted at ICLR 2024-- Dynamic Sparse No Training: Training-Free Fine-tuning for Sparse LLM…☆46Updated last year
- [ICLR 2024 Spotlight] Code for the paper "Merge, Then Compress: Demystify Efficient SMoE with Hints from Its Routing Policy"☆79Updated 10 months ago
- [ICLR 2025] When Attention Sink Emerges in Language Models: An Empirical View (Spotlight)☆62Updated 5 months ago
- [ICML2024 Spotlight] Fine-Tuning Pre-trained Large Language Models Sparsely☆23Updated 9 months ago
- ☆132Updated 8 months ago
- ☆50Updated last year