THUNLP-MT / ModelComposeLinks
Official code for our paper "Model Composition for Multimodal Large Language Models" (ACL 2024)
☆30Updated 8 months ago
Alternatives and similar repositories for ModelCompose
Users that are interested in ModelCompose are comparing it to the libraries listed below
Sorting:
- [ICLR 2024] Analyzing and Mitigating Object Hallucination in Large Vision-Language Models☆150Updated last year
- mPLUG-HalOwl: Multimodal Hallucination Evaluation and Mitigating☆97Updated last year
- 😎 curated list of awesome LMM hallucinations papers, methods & resources.☆149Updated last year
- The official GitHub page for ''Evaluating Object Hallucination in Large Vision-Language Models''☆223Updated last month
- [ACM Multimedia 2025] This is the official repo for Debiasing Large Visual Language Models, including a Post-Hoc debias method and Visual…☆82Updated 7 months ago
- [ICML 2024] Official implementation for "HALC: Object Hallucination Reduction via Adaptive Focal-Contrast Decoding"☆98Updated 9 months ago
- MMICL, a state-of-the-art VLM with the in context learning ability from ICL, PKU☆50Updated 2 months ago
- Less is More: Mitigating Multimodal Hallucination from an EOS Decision Perspective (ACL 2024)☆55Updated 10 months ago
- ☆82Updated last year
- ☆100Updated last year
- Beyond Hallucinations: Enhancing LVLMs through Hallucination-Aware Direct Preference Optimization☆92Updated last year
- [EMNLP 2024] mDPO: Conditional Preference Optimization for Multimodal Large Language Models.☆81Updated 10 months ago
- HalluciDoctor: Mitigating Hallucinatory Toxicity in Visual Instruction Data (Accepted by CVPR 2024)☆48Updated last year
- ☆140Updated 7 months ago
- [EMNLP'23] The official GitHub page for ''Evaluating Object Hallucination in Large Vision-Language Models''☆93Updated last month
- ☆63Updated last year
- Official repository for the A-OKVQA dataset☆99Updated last year
- MoCLE (First MLLM with MoE for instruction customization and generalization!) (https://arxiv.org/abs/2312.12379)☆43Updated 2 months ago
- ☆55Updated last month
- [NeurIPS 2024] Calibrated Self-Rewarding Vision Language Models☆80Updated last year
- [CVPR'24] HallusionBench: You See What You Think? Or You Think What You See? An Image-Context Reasoning Benchmark Challenging for GPT-4V(…☆298Updated 10 months ago
- HallE-Control: Controlling Object Hallucination in LMMs☆31Updated last year
- [CVPR 2024 Highlight] Mitigating Object Hallucinations in Large Vision-Language Models through Visual Contrastive Decoding☆319Updated 11 months ago
- my commonly-used tools☆61Updated 8 months ago
- ☆79Updated 6 years ago
- Code for DeCo: Decoupling token compression from semanchc abstraction in multimodal large language models☆70Updated 2 months ago
- code for "Strengthening Multimodal Large Language Model with Bootstrapped Preference Optimization"☆59Updated last year
- MAT: Multi-modal Agent Tuning 🔥 ICLR 2025 (Spotlight)☆62Updated 3 months ago
- [ICLR 2025] MLLM can see? Dynamic Correction Decoding for Hallucination Mitigation☆109Updated 2 weeks ago
- PyTorch Implementation of "Divide, Conquer and Combine: A Training-Free Framework for High-Resolution Image Perception in Multimodal Larg…☆29Updated 4 months ago