tanganke / fusion_benchLinks
FusionBench: A Comprehensive Benchmark/Toolkit of Deep Model Fusion
☆149Updated 2 weeks ago
Alternatives and similar repositories for fusion_bench
Users that are interested in fusion_bench are comparing it to the libraries listed below
Sorting:
- AdaMerging: Adaptive Model Merging for Multi-Task Learning. ICLR, 2024.☆88Updated 9 months ago
- [NeurIPS 2024 Spotlight] EMR-Merging: Tuning-Free High-Performance Model Merging☆62Updated 5 months ago
- Model Merging in LLMs, MLLMs, and Beyond: Methods, Theories, Applications and Opportunities. arXiv:2408.07666.☆497Updated this week
- A curated list of Model Merging methods.☆92Updated 10 months ago
- Official repository of "Localizing Task Information for Improved Model Merging and Compression" [ICML 2024]☆47Updated 9 months ago
- [ICLR 2025] Code and Data Repo for Paper "Latent Space Chain-of-Embedding Enables Output-free LLM Self-Evaluation"☆73Updated 7 months ago
- An Efficient LLM Fine-Tuning Factory Optimized for MoE PEFT☆108Updated 5 months ago
- Awesome Low-Rank Adaptation☆42Updated this week
- Principled Data Selection for Alignment: The Hidden Risks of Difficult Examples☆39Updated 3 weeks ago
- Accepted LLM Papers in NeurIPS 2024☆37Updated 9 months ago
- Localize-and-Stitch: Efficient Model Merging via Sparse Task Arithmetic☆27Updated 7 months ago
- ☆149Updated last year
- Code for paper "Parameter Efficient Multi-task Model Fusion with Partial Linearization"☆21Updated 10 months ago
- [NeurIPS 2024] "Can Language Models Perform Robust Reasoning in Chain-of-thought Prompting with Noisy Rationales?"☆35Updated 3 weeks ago
- 📜 Paper list on decoding methods for LLMs and LVLMs☆55Updated last month
- State-of-the-art Parameter-Efficient MoE Fine-tuning Method☆176Updated 11 months ago
- Code for paper "Merging Multi-Task Models via Weight-Ensembling Mixture of Experts"☆29Updated last year
- ☆44Updated last year
- Papers about training data quality management for ML models.☆92Updated last month
- Representation Surgery for Multi-Task Model Merging. ICML, 2024.☆46Updated 10 months ago
- This paper list focuses on the theoretical and empirical analysis of language models, especially large language models (LLMs). The papers…☆85Updated 8 months ago
- AlphaEdit: Null-Space Constrained Knowledge Editing for Language Models, ICLR 2025 (Outstanding Paper)☆293Updated last month
- Codes for Merging Large Language Models☆33Updated last year
- ☆49Updated 3 weeks ago
- [EMNLP 2023, Main Conference] Sparse Low-rank Adaptation of Pre-trained Language Models☆81Updated last year
- awesome SAE papers☆41Updated 2 months ago
- This repository contains a regularly updated paper list for LLMs-reasoning-in-latent-space.☆149Updated this week
- A curated list of resources for activation engineering☆99Updated 2 months ago
- ☆156Updated 2 months ago
- [ACL 2024] Code and data for "Machine Unlearning of Pre-trained Large Language Models"☆59Updated 10 months ago