mbzuai-oryx / groundingLMM

[CVPR 2024 πŸ”₯] Grounding Large Multimodal Model (GLaMM), the first-of-its-kind model capable of generating natural language responses that are seamlessly integrated with object segmentation masks.
β˜†782Updated 5 months ago

Related projects β“˜

Alternatives and complementary repositories for groundingLMM