ylsung / vl-merging
PyTorch codes for the paper "An Empirical Study of Multimodal Model Merging"
β38Updated last year
Alternatives and similar repositories for vl-merging:
Users that are interested in vl-merging are comparing it to the libraries listed below
- [NeurIPS-2024] π Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies https://arxiv.org/abs/2407.13623β77Updated 4 months ago
- Source code for the paper "Prefix Language Models are Unified Modal Learners"β43Updated last year
- This repository contains the code and data for the paper "VisOnlyQA: Large Vision Language Models Still Struggle with Visual Perception oβ¦β21Updated 2 months ago
- [NeurIPS 2023] Make Your Pre-trained Model Reversible: From Parameter to Memory Efficient Fine-Tuningβ30Updated last year
- Public code repo for EMNLP 2024 Findings paper "MACAROON: Training Vision-Language Models To Be Your Engaged Partners"β13Updated 4 months ago
- Code for "Seeking Neural Nuggets: Knowledge Transfer in Large Language Models from a Parametric Perspective"β32Updated 9 months ago
- Code for paper "Diffusion Language Models Can Perform Many Tasks with Scaling and Instruction-Finetuning"β65Updated last year
- MMSci: A Multimodal Multi-Discipline Dataset for PhD-Level Scientific Comprehensionβ39Updated 2 months ago
- β12Updated last month
- Improving Language Understanding from Screenshots. Paper: https://arxiv.org/abs/2402.14073β26Updated 7 months ago
- β12Updated 11 months ago
- β29Updated 2 years ago
- [EMNLP-2022 Findings] Code for paper βProGen: Progressive Zero-shot Dataset Generation via In-context Feedbackβ.β26Updated 2 years ago
- Offcial Repo of Paper "Eliminating Position Bias of Language Models: A Mechanistic Approach""β11Updated 5 months ago
- Self-Supervised Alignment with Mutual Informationβ16Updated 8 months ago
- β27Updated 3 months ago
- β26Updated last year
- [NeurIPS 2024 Spotlight] Code and data for the paper "Finding Transformer Circuits with Edge Pruning".β46Updated 2 months ago
- SLED: Self Logits Evolution Decoding for Improving Factuality in Large Language Model https://arxiv.org/pdf/2411.02433β21Updated 2 months ago
- The released data for paper "Measuring and Improving Chain-of-Thought Reasoning in Vision-Language Models".β32Updated last year
- Evaluation framework for paper "VisualWebBench: How Far Have Multimodal LLMs Evolved in Web Page Understanding and Grounding?"β49Updated 4 months ago
- Official implementation of Bootstrapping Language Models via DPO Implicit Rewardsβ42Updated 6 months ago
- This repo contains code for "VISTA: Enhancing Long-Duration and High-Resolution Video Understanding by VIdeo SpatioTemporal Augmentation"β11Updated last month
- [NeurIPS2024] Twin-Merging: Dynamic Integration of Modular Expertise in Model Mergingβ48Updated 2 months ago
- [NAACL 2025] Multimodal Needle in a Haystack (MMNeedle): Benchmarking Long-Context Capability of Multimodal Large Language Modelsβ38Updated this week
- β64Updated 2 weeks ago
- β15Updated 6 months ago
- β80Updated 11 months ago
- Unofficial Implementation of Chain-of-Thought Reasoning Without Promptingβ27Updated 11 months ago
- Preference Learning for LLaVAβ37Updated 3 months ago