RainBowLuoCS / DEEM
(ICLR2025 Spotlight) DEEM: Official implementation of Diffusion models serve as the eyes of large language models for image perception.
☆26Updated 2 weeks ago
Alternatives and similar repositories for DEEM:
Users that are interested in DEEM are comparing it to the libraries listed below
- [ICLR 2025] MLLM can see? Dynamic Correction Decoding for Hallucination Mitigation☆40Updated 3 months ago
- Less is More: Mitigating Multimodal Hallucination from an EOS Decision Perspective (ACL 2024)☆44Updated 4 months ago
- PyTorch implementation of StableMask (ICML'24)☆12Updated 8 months ago
- ☆31Updated 8 months ago
- code for "Strengthening Multimodal Large Language Model with Bootstrapped Preference Optimization"☆54Updated 6 months ago
- Enhancing Large Vision Language Models with Self-Training on Image Comprehension.