JierunChen / Ref-L4Links
Evaluation code for Ref-L4, a new REC benchmark in the LMM era
☆53Updated last year
Alternatives and similar repositories for Ref-L4
Users that are interested in Ref-L4 are comparing it to the libraries listed below
Sorting:
- [ICLR2025] Draw-and-Understand: Leveraging Visual Prompts to Enable MLLMs to Comprehend What You Want☆93Updated last month
- Official repository of "CoMP: Continual Multimodal Pre-training for Vision Foundation Models"☆41Updated 9 months ago
- ☆124Updated last year
- Pink: Unveiling the Power of Referential Comprehension for Multi-modal LLMs☆97Updated last year
- [NeurIPS 2024] Visual Perception by Large Language Model’s Weights☆55Updated 9 months ago
- DenseFusion-1M: Merging Vision Experts for Comprehensive Multimodal Perception☆158Updated last year
- [ECCV 2024] Elysium: Exploring Object-level Perception in Videos via MLLM☆86Updated last year
- VoCoT: Unleashing Visually Grounded Multi-Step Reasoning in Large Multi-Modal Models☆77Updated last year
- The official implementation of RAR☆92Updated last month
- Reinforcement Learning Tuning for VideoLLMs: Reward Design and Data Efficiency