EnnengYang / Awesome-Model-Merging-Methods-Theories-Applications
Model Merging in LLMs, MLLMs, and Beyond: Methods, Theories, Applications and Opportunities. arXiv:2408.07666.
☆316Updated this week
Alternatives and similar repositories for Awesome-Model-Merging-Methods-Theories-Applications:
Users that are interested in Awesome-Model-Merging-Methods-Theories-Applications are comparing it to the libraries listed below
- Continual Learning of Large Language Models: A Comprehensive Survey☆344Updated 2 weeks ago
- A curated list of Model Merging methods.☆89Updated 5 months ago
- ☆166Updated last year
- Repo for Rho-1: Token-level Data Selection & Selective Pretraining of LLMs.☆398Updated 10 months ago
- Must-read Papers on Large Language Model (LLM) Continual Learning☆141Updated last year
- A curated list of LLM Interpretability related material - Tutorial, Library, Survey, Paper, Blog, etc..☆201Updated 4 months ago
- AdaMerging: Adaptive Model Merging for Multi-Task Learning. ICLR, 2024.☆65Updated 3 months ago
- ☆123Updated 6 months ago
- The official GitHub page for the survey paper "A Survey on Mixture of Experts in Large Language Models".☆241Updated last month
- LLM hallucination paper list☆303Updated 11 months ago
- [NeurIPS'24 Oral] HydraLoRA: An Asymmetric LoRA Architecture for Efficient Fine-Tuning☆161Updated 2 months ago
- PiSSA: Principal Singular Values and Singular Vectors Adaptation of Large Language Models(NeurIPS 2024 Spotlight)☆324Updated 2 weeks ago
- Official implementation for the paper "DoLa: Decoding by Contrasting Layers Improves Factuality in Large Language Models"☆464Updated last month
- Collection of Tools and Papers related to Adapters / Parameter-Efficient Transfer Learning/ Fine-Tuning☆185Updated 9 months ago
- AdaLoRA: Adaptive Budget Allocation for Parameter-Efficient Fine-Tuning (ICLR 2023).☆291Updated last year
- [ICML 2024] LESS: Selecting Influential Data for Targeted Instruction Tuning☆412Updated 4 months ago
- A Survey on Data Selection for Language Models☆210Updated 4 months ago
- LLM-Merging: Building LLMs Efficiently through Merging☆190Updated 4 months ago
- ☆163Updated 7 months ago
- ☆251Updated last year
- Code accompanying the paper "Massive Activations in Large Language Models"☆140Updated 11 months ago
- State-of-the-art Parameter-Efficient MoE Fine-tuning Method☆130Updated 5 months ago
- Source code of "Task arithmetic in the tangent space: Improved editing of pre-trained models".☆95Updated last year
- Curation of resources for LLM mathematical reasoning, most of which are screened by @tongyx361 to ensure high quality and accompanied wit…☆114Updated 7 months ago
- Function Vectors in Large Language Models (ICLR 2024)☆138Updated 4 months ago
- RewardBench: the first evaluation tool for reward models.☆508Updated this week
- Project for the paper entitled `Instruction Tuning for Large Language Models: A Survey`☆160Updated 2 months ago
- [NeurIPS 2024 Spotlight] EMR-Merging: Tuning-Free High-Performance Model Merging☆44Updated 3 months ago
- This repository collects awesome survey, resource, and paper for Lifelong Learning for Large Language Models. (Updated Regularly)☆39Updated 2 weeks ago
- open-source code for paper: Retrieval Head Mechanistically Explains Long-Context Factuality☆173Updated 6 months ago