EnnengYang / Awesome-Model-Merging-Methods-Theories-Applications
Model Merging in LLMs, MLLMs, and Beyond: Methods, Theories, Applications and Opportunities. arXiv:2408.07666.
☆255Updated this week
Alternatives and similar repositories for Awesome-Model-Merging-Methods-Theories-Applications:
Users that are interested in Awesome-Model-Merging-Methods-Theories-Applications are comparing it to the libraries listed below
- ☆154Updated 10 months ago
- A curated list of Model Merging methods.☆85Updated 3 months ago
- The official GitHub page for the survey paper "A Survey on Mixture of Experts".☆162Updated last week
- AdaMerging: Adaptive Model Merging for Multi-Task Learning. ICLR, 2024.☆56Updated last month
- Continual Learning of Large Language Models: A Comprehensive Survey☆286Updated last week
- A curated list of LLM Interpretability related material - Tutorial, Library, Survey, Paper, Blog, etc..☆185Updated 2 months ago
- Must-read Papers on Large Language Model (LLM) Continual Learning☆138Updated last year
- Source code of "Task arithmetic in the tangent space: Improved editing of pre-trained models".☆89Updated last year
- ☆118Updated 4 months ago
- This repository collects awesome survey, resource, and paper for Lifelong Learning with Large Language Models. (Updated Regularly)☆43Updated last month
- Code accompanying the paper "Massive Activations in Large Language Models"☆130Updated 9 months ago
- [NeurIPS 2024 Spotlight] EMR-Merging: Tuning-Free High-Performance Model Merging☆35Updated last month
- ☆250Updated last year
- Collection of Tools and Papers related to Adapters / Parameter-Efficient Transfer Learning/ Fine-Tuning☆178Updated 7 months ago
- [ICML 2024 Oral] Official code repository for MLLM-as-a-Judge.☆58Updated 3 weeks ago
- open-source code for paper: Retrieval Head Mechanistically Explains Long-Context Factuality☆167Updated 4 months ago
- A Survey on Data Selection for Language Models☆193Updated 2 months ago
- [NeurIPS'24 Oral] HydraLoRA: An Asymmetric LoRA Architecture for Efficient Fine-Tuning☆114Updated 2 weeks ago
- The Paper List on Data Contamination for Large Language Models Evaluation.☆78Updated last week
- Official repository of "Localizing Task Information for Improved Model Merging and Compression" [ICML 2024]☆36Updated last month
- AnchorAttention: Improved attention for LLMs long-context training☆189Updated last week
- This paper list focuses on the theoretical and empirical analysis of language models, especially large language models (LLMs). The papers…☆66Updated 2 weeks ago
- Repo for Rho-1: Token-level Data Selection & Selective Pretraining of LLMs.☆352Updated 7 months ago
- LLM-Merging: Building LLMs Efficiently through Merging☆179Updated 2 months ago
- [NeurIPS2024] Twin-Merging: Dynamic Integration of Modular Expertise in Model Merging☆43Updated 2 weeks ago
- ☆10Updated 8 months ago
- A curated list of awesome Multimodal studies.☆106Updated last month
- AdaLoRA: Adaptive Budget Allocation for Parameter-Efficient Fine-Tuning (ICLR 2023).☆281Updated last year
- [ICML 2024] Unveiling and Harnessing Hidden Attention Sinks: Enhancing Large Language Models without Training through Attention Calibrati…☆27Updated 5 months ago
- PASTA: Post-hoc Attention Steering for LLMs☆108Updated 3 weeks ago