zhaohengyuan1 / GenixerLinks
(ECCV 2024) Empowering Multimodal Large Language Model as a Powerful Data Generator
β112Updated 4 months ago
Alternatives and similar repositories for Genixer
Users that are interested in Genixer are comparing it to the libraries listed below
Sorting:
- Chain-of-Spot: Interactive Reasoning Improves Large Vision-language Modelsβ97Updated last year
- π [NeurIPS24] Make Vision Matter in Visual-Question-Answering (VQA)! Introducing NaturalBench, a vision-centric VQA benchmark (NeurIPS'2β¦β84Updated last month
- u-LLaVA: Unifying Multi-Modal Tasks via Large Language Model