WisconsinAIVision / ViP-LLaVAView on GitHub
[CVPR2024] ViP-LLaVA: Making Large Multimodal Models Understand Arbitrary Visual Prompts
β˜†336Jul 17, 2024Updated last year

Alternatives and similar repositories for ViP-LLaVA

Users that are interested in ViP-LLaVA are comparing it to the libraries listed below

Sorting:

Are these results useful?