mbzuai-oryx / VideoGLaMM

[CVPR 2025 πŸ”₯]A Large Multimodal Model for Pixel-Level Visual Grounding in Videos
β˜†52Updated this week

Alternatives and similar repositories for VideoGLaMM:

Users that are interested in VideoGLaMM are comparing it to the libraries listed below