nathanielyvo / WUDI-MergingLinks
The official repository of "Whoever Started the Interference Should End It: Guiding Data-Free Model Merging via Task Vectors""
☆37Updated last month
Alternatives and similar repositories for WUDI-Merging
Users that are interested in WUDI-Merging are comparing it to the libraries listed below
Sorting:
- ☆109Updated 2 months ago
- ☆56Updated 4 months ago
- ☆53Updated last year
- Code release for VTW (AAAI 2025 Oral)☆64Updated 3 weeks ago
- [NeurIPS 2025] Think Silently, Think Fast: Dynamic Latent Compression of LLM Reasoning Chains☆61Updated 4 months ago
- CoT-Valve: Length-Compressible Chain-of-Thought Tuning☆87Updated 9 months ago
- ☆168Updated last year
- [NeurIPS'24 Oral] HydraLoRA: An Asymmetric LoRA Architecture for Efficient Fine-Tuning☆233Updated 11 months ago
- This repository contains a regularly updated paper list for LLMs-reasoning-in-latent-space.☆194Updated 2 weeks ago
- [ICLR 2025] Code and Data Repo for Paper "Latent Space Chain-of-Embedding Enables Output-free LLM Self-Evaluation"☆86Updated 11 months ago
- [ICML 2024] Unveiling and Harnessing Hidden Attention Sinks: Enhancing Large Language Models without Training through Attention Calibrati…☆46Updated last year
- ☆56Updated 5 months ago
- Imagine While Reasoning in Space: Multimodal Visualization-of-Thought (ICML 2025)☆59Updated 7 months ago
- ☆184Updated 6 months ago
- [NeurIPS 2024 Spotlight] EMR-Merging: Tuning-Free High-Performance Model Merging☆72Updated 8 months ago
- [EMNLP 2024 Findings🔥] Official implementation of ": LOOK-M: Look-Once Optimization in KV Cache for Efficient Multimodal Long-Context In…☆104Updated last year
- State-of-the-art Parameter-Efficient MoE Fine-tuning Method☆197Updated last year
- LoRAMoE: Revolutionizing Mixture of Experts for Maintaining World Knowledge in Language Model Alignment☆386Updated last year
- [ICLR 2025] The official pytorch implement of "Dynamic-LLaVA: Efficient Multimodal Large Language Models via Dynamic Vision-language Cont…☆62Updated 2 months ago
- CorDA: Context-Oriented Decomposition Adaptation of Large Language Models for task-aware parameter-efficient fine-tuning(NeurIPS 2024)☆53Updated 10 months ago
- A curated collection of resources focused on the Mechanistic Interpretability (MI) of Large Multimodal Models (LMMs). This repository agg…☆164Updated last month
- Code for ACL 2024 accepted paper titled "SAPT: A Shared Attention Framework for Parameter-Efficient Continual Learning of Large Language …☆36Updated 10 months ago
- ☆289Updated 4 months ago
- [EMNLP 2025] TokenSkip: Controllable Chain-of-Thought Compression in LLMs☆189Updated 5 months ago
- 📜 Paper list on decoding methods for LLMs and LVLMs☆65Updated 3 weeks ago
- [ICML 2024] Model Tailor: Mitigating Catastrophic Forgetting in Multi-modal Large Language Models☆32Updated last year
- [TMLR 2025] Efficient Reasoning Models: A Survey☆280Updated last month
- ☆59Updated 11 months ago
- [CVPR 2024 Highlight] Mitigating Object Hallucinations in Large Vision-Language Models through Visual Contrastive Decoding☆345Updated last year
- A generalized framework for subspace tuning methods in parameter efficient fine-tuning.☆161Updated 5 months ago