zhaohengyuan1 / GenixerLinks
(ECCV 2024) Empowering Multimodal Large Language Model as a Powerful Data Generator
β113Updated 5 months ago
Alternatives and similar repositories for Genixer
Users that are interested in Genixer are comparing it to the libraries listed below
Sorting:
- Chain-of-Spot: Interactive Reasoning Improves Large Vision-language Modelsβ96Updated last year
- π [NeurIPS24] Make Vision Matter in Visual-Question-Answering (VQA)! Introducing NaturalBench, a vision-centric VQA benchmark (NeurIPS'2β¦β85Updated 2 months ago
- u-LLaVA: Unifying Multi-Modal Tasks via Large Language Model