zhouyiks / CoLVALinks
β39Updated 2 months ago
Alternatives and similar repositories for CoLVA
Users that are interested in CoLVA are comparing it to the libraries listed below
Sorting:
- Code and dataset link for "DenseWorld-1M: Towards Detailed Dense Grounded Caption in the Real World"β108Updated 3 months ago
- π₯ [CVPR 2024] Official implementation of "See, Say, and Segment: Teaching LMMs to Overcome False Premises (SESAME)"β43Updated last year
- β24Updated 5 months ago
- [ICLR'25] Reconstructive Visual Instruction Tuningβ118Updated 5 months ago
- [ECCV2024] PartGLEE: A Foundation Model for Recognizing and Parsing Any Objectsβ51Updated last year
- β44Updated last year
- [CVPR 2025 π₯]A Large Multimodal Model for Pixel-Level Visual Grounding in Videosβ86Updated 5 months ago
- [ECCV 2024] ControlCap: Controllable Region-level Captioningβ79Updated 11 months ago
- Offical repo for CAT-V - Caption Anything in Video: Object-centric Dense Video Captioning with Spatiotemporal Multimodal Promptingβ53Updated 2 months ago
- [NeurlPS 2024] One Token to Seg Them All: Language Instructed Reasoning Segmentation in Videos