mbzuai-oryx / VideoGLaMM

A Large Multimodal Model for Pixel-Level Visual Grounding in Videos
34Updated 2 weeks ago

Related projects

Alternatives and complementary repositories for VideoGLaMM