EnnengYang / Awesome-Model-Merging-Methods-Theories-Applications
Model Merging in LLMs, MLLMs, and Beyond: Methods, Theories, Applications and Opportunities. arXiv:2408.07666.
☆286Updated this week
Alternatives and similar repositories for Awesome-Model-Merging-Methods-Theories-Applications:
Users that are interested in Awesome-Model-Merging-Methods-Theories-Applications are comparing it to the libraries listed below
- ☆159Updated 11 months ago
- A curated list of Model Merging methods.☆89Updated 4 months ago
- Must-read Papers on Large Language Model (LLM) Continual Learning☆141Updated last year
- A curated list of LLM Interpretability related material - Tutorial, Library, Survey, Paper, Blog, etc..☆197Updated 3 months ago
- Continual Learning of Large Language Models: A Comprehensive Survey☆314Updated 2 weeks ago
- ☆121Updated 5 months ago
- Repo for Rho-1: Token-level Data Selection & Selective Pretraining of LLMs.☆382Updated 9 months ago
- AdaMerging: Adaptive Model Merging for Multi-Task Learning. ICLR, 2024.☆61Updated 2 months ago
- A Survey on Data Selection for Language Models☆201Updated 3 months ago
- The official GitHub page for the survey paper "A Survey on Mixture of Experts in Large Language Models".☆200Updated this week
- [NeurIPS 2024 Spotlight] EMR-Merging: Tuning-Free High-Performance Model Merging☆42Updated 2 months ago
- Official repository of "Localizing Task Information for Improved Model Merging and Compression" [ICML 2024]☆39Updated 2 months ago
- ☆251Updated last year
- LLM-Merging: Building LLMs Efficiently through Merging☆184Updated 3 months ago
- [ICML 2024 Oral] Official code repository for MLLM-as-a-Judge.☆62Updated last month
- [ICML 2024] LESS: Selecting Influential Data for Targeted Instruction Tuning☆400Updated 2 months ago
- This paper list focuses on the theoretical and empirical analysis of language models, especially large language models (LLMs). The papers…☆69Updated last month
- Code accompanying the paper "Massive Activations in Large Language Models"☆133Updated 10 months ago
- Source code of "Task arithmetic in the tangent space: Improved editing of pre-trained models".☆92Updated last year
- [NeurIPS'24 Oral] HydraLoRA: An Asymmetric LoRA Architecture for Efficient Fine-Tuning☆142Updated last month
- FusionBench: A Comprehensive Benchmark/Toolkit of Deep Model Fusion☆101Updated this week
- ☆152Updated 6 months ago
- AdaLoRA: Adaptive Budget Allocation for Parameter-Efficient Fine-Tuning (ICLR 2023).☆288Updated last year
- RewardBench: the first evaluation tool for reward models.☆491Updated last week
- State-of-the-art Parameter-Efficient MoE Fine-tuning Method☆119Updated 4 months ago
- Collection of Tools and Papers related to Adapters / Parameter-Efficient Transfer Learning/ Fine-Tuning☆180Updated 8 months ago
- Official implementation for the paper "DoLa: Decoding by Contrasting Layers Improves Factuality in Large Language Models"☆452Updated 8 months ago
- [SIGIR'24] The official implementation code of MOELoRA.☆142Updated 5 months ago
- The Paper List on Data Contamination for Large Language Models Evaluation.☆86Updated last week
- Awesome-Low-Rank-Adaptation☆61Updated 3 months ago