RainBowLuoCS / DEEMLinks
(ICLR2025 Spotlight) DEEM: Official implementation of Diffusion models serve as the eyes of large language models for image perception.
☆37Updated last month
Alternatives and similar repositories for DEEM
Users that are interested in DEEM are comparing it to the libraries listed below
Sorting:
- code for "Strengthening Multimodal Large Language Model with Bootstrapped Preference Optimization"☆57Updated 11 months ago
- V1: Toward Multimodal Reasoning by Designing Auxiliary Task☆34Updated 3 months ago
- [NeurIPS 2024] Calibrated Self-Rewarding Vision Language Models