visual-haystacks / mirage
π₯ [ICLR 2025] Official PyTorch Model "Visual Haystacks: A Vision-Centric Needle-In-A-Haystack Benchmark"
β15Updated 3 months ago
Alternatives and similar repositories for mirage
Users that are interested in mirage are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2024] Calibrated Self-Rewarding Vision Language Modelsβ73Updated 11 months ago
- π₯ [ICLR 2025] Official Benchmark Toolkits for "Visual Haystacks: A Vision-Centric Needle-In-A-Haystack Benchmark"β27Updated 3 months ago
- Emerging Pixel Grounding in Large Multimodal Models Without Grounding Supervisionβ41Updated last month
- [ICML2024] Repo for the paper `Evaluating and Analyzing Relationship Hallucinations in Large Vision-Language Models'β20Updated 4 months ago
- Compress conventional Vision-Language Pre-training dataβ51Updated last year
- Enhancing Large Vision Language Models with Self-Training on Image Comprehension.β66Updated 11 months ago
- Code and data setup for the paper "Are Diffusion Models Vision-and-language Reasoners?"β32Updated last year
- A instruction data generation system for multimodal language models.β32Updated 3 months ago
- β41Updated 4 months ago
- code for "Strengthening Multimodal Large Language Model with Bootstrapped Preference Optimization"β55Updated 8 months ago
- NegCLIP.β31Updated 2 years ago
- Official implementation of "Connect, Collapse, Corrupt: Learning Cross-Modal Tasks with Uni-Modal Data" (ICLR 2024)