WisconsinAIVision / ViP-LLaVA

[CVPR2024] ViP-LLaVA: Making Large Multimodal Models Understand Arbitrary Visual Prompts
297Updated 4 months ago

Related projects

Alternatives and complementary repositories for ViP-LLaVA