nathanielyvo / WUDI-MergingLinks
The official repository of "Whoever Started the Interference Should End It: Guiding Data-Free Model Merging via Task Vectors""
☆40Updated 2 months ago
Alternatives and similar repositories for WUDI-Merging
Users that are interested in WUDI-Merging are comparing it to the libraries listed below
Sorting:
- ☆55Updated last year
- ☆61Updated last year
- [NeurIPS'24 Oral] HydraLoRA: An Asymmetric LoRA Architecture for Efficient Fine-Tuning☆235Updated last year
- Code release for VTW (AAAI 2025 Oral)☆65Updated last month
- ☆111Updated 3 months ago
- [ICLR 2025] Code and Data Repo for Paper "Latent Space Chain-of-Embedding Enables Output-free LLM Self-Evaluation"☆90Updated last year
- This repository contains the code for SFT, RLHF, and DPO, designed for vision-based LLMs, including the LLaVA models and the LLaMA-3.2-vi…☆117Updated 6 months ago
- [ICLR 2025] The official pytorch implement of "Dynamic-LLaVA: Efficient Multimodal Large Language Models via Dynamic Vision-language Cont…☆66Updated 3 months ago
- State-of-the-art Parameter-Efficient MoE Fine-tuning Method☆196Updated last year
- One-shot Entropy Minimization☆187Updated 6 months ago
- A curated collection of resources focused on the Mechanistic Interpretability (MI) of Large Multimodal Models (LMMs). This repository agg…☆171Updated 2 months ago
- ☆292Updated 5 months ago
- This repository contains a regularly updated paper list for LLMs-reasoning-in-latent-space.☆235Updated last week
- [EMNLP 2024 Findings🔥] Official implementation of ": LOOK-M: Look-Once Optimization in KV Cache for Efficient Multimodal Long-Context In…☆104Updated last year
- A generalized framework for subspace tuning methods in parameter efficient fine-tuning.☆163Updated 5 months ago
- CoT-Valve: Length-Compressible Chain-of-Thought Tuning☆88Updated 10 months ago
- ☆169Updated last year
- 关于LLM和Multimodal LLM的paper list☆50Updated 2 weeks ago
- ☆57Updated 5 months ago
- [CVPR 2024 Highlight] Mitigating Object Hallucinations in Large Vision-Language Models through Visual Contrastive Decoding☆361Updated last year
- [TMLR 2025] Efficient Reasoning Models: A Survey☆285Updated last month
- ☆28Updated last year
- [EMNLP 2025] TokenSkip: Controllable Chain-of-Thought Compression in LLMs☆196Updated 3 weeks ago
- LoRAMoE: Revolutionizing Mixture of Experts for Maintaining World Knowledge in Language Model Alignment☆389Updated last year
- Latest Advances on Modality Priors in Multimodal Large Language Models☆29Updated last week
- [ICML 2024] Unveiling and Harnessing Hidden Attention Sinks: Enhancing Large Language Models without Training through Attention Calibrati…☆46Updated last year
- AlphaEdit: Null-Space Constrained Knowledge Editing for Language Models, ICLR 2025 (Outstanding Paper)☆380Updated 2 months ago
- ☆55Updated 6 months ago
- Papers about Hallucination in Multi-Modal Large Language Models (MLLMs)☆98Updated last year
- Imagine While Reasoning in Space: Multimodal Visualization-of-Thought (ICML 2025)☆59Updated 8 months ago