duguodong7 / pcb-mergingLinks
[NeurIPS 2024] For paper Parameter Competition Balancing for Model Merging
☆41Updated 8 months ago
Alternatives and similar repositories for pcb-merging
Users that are interested in pcb-merging are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2024 Spotlight] EMR-Merging: Tuning-Free High-Performance Model Merging☆59Updated 3 months ago
- Official repository of "Localizing Task Information for Improved Model Merging and Compression" [ICML 2024]☆45Updated 8 months ago
- Code for paper "Parameter Efficient Multi-task Model Fusion with Partial Linearization"☆21Updated 9 months ago
- code for ACL24 "MELoRA: Mini-Ensemble Low-Rank Adapter for Parameter-Efficient Fine-Tuning"☆19Updated 4 months ago
- Representation Surgery for Multi-Task Model Merging. ICML, 2024.☆45Updated 8 months ago
- Codes for Merging Large Language Models☆32Updated 10 months ago
- Task Singular Vectors: Reducing Task Interference in Model Merging. Merge models avoiding task interference through separable models.☆16Updated last month
- Code for paper "Merging Multi-Task Models via Weight-Ensembling Mixture of Experts"☆24Updated last year
- ☆18Updated last month
- CoT-Valve: Length-Compressible Chain-of-Thought Tuning☆73Updated 4 months ago
- (ICLR2025 Spotlight) DEEM: Official implementation of Diffusion models serve as the eyes of large language models for image perception.☆34Updated 3 months ago
- EMPO, A Fully Unsupervised RLVR Method☆40Updated 2 weeks ago
- This repo contains the code for the paper "Understanding and Mitigating Hallucinations in Large Vision-Language Models via Modular Attrib…☆18Updated 3 months ago
- Less is More: High-value Data Selection for Visual Instruction Tuning☆14Updated 5 months ago
- Official Code and data for ACL 2024 finding, "An Empirical Study on Parameter-Efficient Fine-Tuning for MultiModal Large Language Models"☆19Updated 7 months ago
- [ICML 2024] Unveiling and Harnessing Hidden Attention Sinks: Enhancing Large Language Models without Training through Attention Calibrati…☆41Updated 11 months ago
- Code for ACL 2024 accepted paper titled "SAPT: A Shared Attention Framework for Parameter-Efficient Continual Learning of Large Language …☆35Updated 5 months ago
- AdaMerging: Adaptive Model Merging for Multi-Task Learning. ICLR, 2024.☆83Updated 7 months ago
- Analyzing and Reducing Catastrophic Forgetting in Parameter Efficient Tuning☆33Updated 7 months ago
- Code for "CREAM: Consistency Regularized Self-Rewarding Language Models", ICLR 2025.☆22Updated 4 months ago
- MoCLE (First MLLM with MoE for instruction customization and generalization!) (https://arxiv.org/abs/2312.12379)☆40Updated last year
- ☆28Updated last year
- [ACL'25] Mosaic-IT: Cost-Free Compositional Data Synthesis for Instruction Tuning☆19Updated this week
- [ICLR 2025] When Attention Sink Emerges in Language Models: An Empirical View (Spotlight)☆88Updated 8 months ago
- ☆46Updated 2 months ago
- NoisyRollout: Reinforcing Visual Reasoning with Data Augmentation☆69Updated 3 weeks ago
- 🚀 LLaMA-MoE v2: Exploring Sparsity of LLaMA from Perspective of Mixture-of-Experts with Post-Training☆86Updated 6 months ago
- ☆18Updated 7 months ago
- CLIP-MoE: Mixture of Experts for CLIP☆42Updated 8 months ago
- PyTorch implementation of StableMask (ICML'24)☆13Updated 11 months ago