LiangJian24 / LoRASculptLinks
[CVPR'25 Oral] LoRASculpt: Sculpting LoRA for Harmonizing General and Specialized Knowledge in Multimodal Large Language Models
☆19Updated last month
Alternatives and similar repositories for LoRASculpt
Users that are interested in LoRASculpt are comparing it to the libraries listed below
Sorting:
- Multimodal Large Language Model (MLLM) Tuning Survey: Keeping Yourself is Important in Downstream Tuning Multimodal Large Language Model☆64Updated last week
- [CVPR'25] EMOE: Modality-Specific Enhanced Dynamic Emotion Experts☆32Updated this week
- [ICLR 2025] Official Implementation of Local-Prompt: Extensible Local Prompts for Few-Shot Out-of-Distribution Detection☆30Updated 2 months ago
- [NeurIPS2024] Repo for the paper `ControlMLLM: Training-Free Visual Prompt Learning for Multimodal Large Language Models'☆179Updated 3 weeks ago
- [CVPR 2025] Devils in Middle Layers of Large Vision-Language Models: Interpreting, Detecting and Mitigating Object Hallucinations via Att…☆23Updated 4 months ago
- ☆36Updated 2 weeks ago
- Official code for ICCV 2023 paper, "Improving Zero-Shot Generalization for CLIP with Synthesized Prompts"☆101Updated last year
- Instruction Tuning in Continual Learning paradigm☆50Updated 4 months ago
- A curated list of resources dedicated to the safety of Large Vision-Language Models. This repository aligns with our survey titled A Surv…☆109Updated 2 weeks ago
- ☆122Updated 4 months ago
- [ICLR'25] Official code for the paper 'MLLMs Know Where to Look: Training-free Perception of Small Visual Details with Multimodal LLMs'☆217Updated 2 months ago
- The repo for "Enhancing Multi-modal Cooperation via Sample-level Modality Valuation", CVPR 2024☆52Updated 7 months ago
- The official pytorch implemention of our CVPR-2024 paper "MMA: Multi-Modal Adapter for Vision-Language Models".☆68Updated 2 months ago
- [CVPR' 25] Interleaved-Modal Chain-of-Thought☆53Updated 2 months ago
- [AAAI'25, CVPRW 2024] Official repository of paper titled "Learning to Prompt with Text Only Supervision for Vision-Language Models".☆108Updated 6 months ago
- Pytorch implementation of "Test-time Adaption against Multi-modal Reliability Bias".☆35Updated 6 months ago
- Code for paper "LLMs Can Evolve Continually on Modality for X-Modal Reasoning" NeurIPS2024☆37Updated 6 months ago
- [CVPR 2025] LamRA: Large Multimodal Model as Your Advanced Retrieval Assistant☆119Updated last month
- 🔎Official code for our paper: "VL-Uncertainty: Detecting Hallucination in Large Vision-Language Model via Uncertainty Estimation".☆37Updated 3 months ago
- ☆37Updated 3 months ago
- [CVPR 2024] Official Repository for "Efficient Test-Time Adaptation of Vision-Language Models"☆100Updated 11 months ago
- [CVPR-2024] Official implementations of CLIP-KD: An Empirical Study of CLIP Model Distillation☆119Updated 11 months ago
- ☆47Updated 7 months ago
- ✨A curated list of papers on the uncertainty in multi-modal large language model (MLLM).☆48Updated 2 months ago
- Papers about Hallucination in Multi-Modal Large Language Models (MLLMs)☆91Updated 7 months ago
- [EMNLP 2024 Findings] The official PyTorch implementation of EchoSight: Advancing Visual-Language Models with Wiki Knowledge.☆63Updated last week
- [CVPR 2025] Adaptive Keyframe Sampling for Long Video Understanding☆73Updated 2 months ago
- ☆24Updated last year
- [ICML 2024] Offical code repo for ICML2024 paper "Candidate Pseudolabel Learning: Enhancing Vision-Language Models by Prompt Tuning with …☆26Updated last year
- This is the official implementation of our paper "QuoTA: Query-oriented Token Assignment via CoT Query Decouple for Long Video Comprehens…☆71Updated 2 months ago