EnnengYang / Awesome-Model-Merging-Methods-Theories-ApplicationsLinks
Model Merging in LLMs, MLLMs, and Beyond: Methods, Theories, Applications and Opportunities. arXiv:2408.07666.
β506Updated this week
Alternatives and similar repositories for Awesome-Model-Merging-Methods-Theories-Applications
Users that are interested in Awesome-Model-Merging-Methods-Theories-Applications are comparing it to the libraries listed below
Sorting:
- [CSUR 2025] Continual Learning of Large Language Models: A Comprehensive Surveyβ442Updated 3 months ago
- π A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, Agent, and Beyondβ286Updated last week
- A curated list of LLM Interpretability related material - Tutorial, Library, Survey, Paper, Blog, etc..β265Updated 5 months ago
- AdaMerging: Adaptive Model Merging for Multi-Task Learning. ICLR, 2024.β88Updated 9 months ago
- A Survey on Data Selection for Language Modelsβ247Updated 3 months ago
- Must-read Papers on Large Language Model (LLM) Continual Learningβ144Updated last year
- A curated list of Model Merging methods.β92Updated 11 months ago
- [TMLR 2025] Stop Overthinking: A Survey on Efficient Reasoning for Large Language Modelsβ574Updated this week
- β49Updated 8 months ago
- This repository collects awesome survey, resource, and paper for Lifelong Learning for Large Language Models. (Updated Regularly)β59Updated 2 months ago
- AdaLoRA: Adaptive Budget Allocation for Parameter-Efficient Fine-Tuning (ICLR 2023).β342Updated 2 years ago
- β151Updated last year
- This repository contains a regularly updated paper list for LLMs-reasoning-in-latent-space.β153Updated last week
- FusionBench: A Comprehensive Benchmark/Toolkit of Deep Model Fusionβ160Updated last week
- AlphaEdit: Null-Space Constrained Knowledge Editing for Language Models, ICLR 2025 (Outstanding Paper)β308Updated last month
- π Paper list on decoding methods for LLMs and LVLMsβ55Updated last month
- State-of-the-art Parameter-Efficient MoE Fine-tuning Methodβ178Updated last year
- [NeurIPS'24 Oral] HydraLoRA: An Asymmetric LoRA Architecture for Efficient Fine-Tuningβ220Updated 8 months ago
- [ICLR 2025] Code and Data Repo for Paper "Latent Space Chain-of-Embedding Enables Output-free LLM Self-Evaluation"β75Updated 8 months ago
- β261Updated last month
- Paper list for Efficient Reasoning.β608Updated this week
- β161Updated 3 months ago
- [ICML 2024] LESS: Selecting Influential Data for Targeted Instruction Tuningβ477Updated 10 months ago
- [arXiv 2025] Efficient Reasoning Models: A Surveyβ258Updated this week
- PiSSA: Principal Singular Values and Singular Vectors Adaptation of Large Language Models(NeurIPS 2024 Spotlight)β373Updated last month
- [TKDE'25] The official GitHub page for the survey paper "A Survey on Mixture of Experts in Large Language Models".β407Updated last month
- LLM hallucination paper listβ322Updated last year
- A curated collection of resources focused on the Mechanistic Interpretability (MI) of Large Multimodal Models (LMMs). This repository aggβ¦β119Updated 3 weeks ago
- Papers about training data quality management for ML models.β94Updated last month
- This paper list focuses on the theoretical and empirical analysis of language models, especially large language models (LLMs). The papersβ¦β87Updated 8 months ago