FoundationVision / GromaView on GitHub
[ECCV2024] Grounded Multimodal Large Language Model with Localized Visual Tokenization
586Jun 7, 2024Updated last year

Alternatives and similar repositories for Groma

Users that are interested in Groma are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Are these results useful?