The official implementation of A Counting-Aware Hierarchical Decoding Framework for Generalized Referring Expression Segmentation
☆23Aug 17, 2025Updated 6 months ago
Alternatives and similar repositories for CoHD
Users that are interested in CoHD are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2023] The official implementation of SOC: Semantic-Assisted Object Cluster for Referring Video Object Segmentation☆33Mar 16, 2024Updated last year
- [ICCV 2023 Workshop] The Official Implementation of The First Prize Solution for RVOS Competition☆14Jan 1, 2024Updated 2 years ago
- The repository of VG-Refiner paper☆17Dec 9, 2025Updated 2 months ago
- The benchmark for "Video Object Segmentation in Panoptic Wild Scenes".☆12Oct 17, 2023Updated 2 years ago
- [ICCV 2025] MPG-SAM 2: Adapting SAM 2 with Mask Priors and Global Context for Referring Video Object Segmentation☆20Sep 5, 2025Updated 6 months ago
- (TIP 2024) Towards Robust Referring Image Segmentation☆36Mar 2, 2024Updated 2 years ago
- ☆66Feb 23, 2026Updated last week
- [CVPR 2024] Bridging the Gap: A Unified Video Comprehension Framework for Moment Retrieval and Highlight Detection☆114Jul 17, 2024Updated last year
- [ACMMM 2024] Consistent123: One Image to Highly Consistent 3D Asset Using Case-Aware Diffusion Priors☆25Oct 22, 2024Updated last year
- ☆22May 9, 2024Updated last year
- [CVPR 2024] Official implementation of "Universal Segmentation at Arbitrary Granularity with Language Instruction"☆284Mar 8, 2024Updated last year
- [TIP 2025] Self-Calibrated CLIP for Training-Free Open-Vocabulary Segmentation☆58Dec 22, 2025Updated 2 months ago
- Code for "CARIS: Context-Aware Referring Image Segmentation" [ACM MM2023]☆28Nov 28, 2024Updated last year
- Official Implementation of "Open-Vocabulary Audio-Visual Semantic Segmentation" [ACM MM 2024 Oral].☆35Nov 2, 2024Updated last year
- OVMR: Open-Vocabulary Recognition with Multi-Modal References (CVPR24)☆35Jun 16, 2025Updated 8 months ago
- The official implementation of our work Hawkeye: Discovering and Grounding Implicit Anomalous Sentiment in Recon-videos via Scene-enhanc…☆12Oct 14, 2024Updated last year
- [CVPR 2024] The repository contains the official implementation of "Open-Vocabulary Segmentation with Semantic-Assisted Calibration"☆76Sep 23, 2024Updated last year
- [ICML2024]The official implementation of SemiRES in PyTorch.☆33Jun 20, 2024Updated last year
- Finetuning & extending DiffusionDet to video & pedestrian multi-object-tracking☆13Apr 12, 2023Updated 2 years ago
- [ICLR2026] AliTok: Towards Sequence Modeling Alignment between Tokenizer and Autoregressive Model☆53Oct 12, 2025Updated 4 months ago
- [CVPR 2024] BIVDiff: A Training-free Framework for General-Purpose Video Synthesis via Bridging Image and Video Diffusion Models☆77Sep 11, 2024Updated last year
- ☆37Mar 22, 2024Updated last year
- [CVPR 2024] LoSh: Long-Short Text Joint Prediction Network for Referring Video Object Segmentation☆13Jun 17, 2024Updated last year
- ☆11Jan 18, 2025Updated last year
- MILO perceptual quality metric☆22Dec 8, 2025Updated 2 months ago
- DisTime: Distribution-based Time Representation for Video Large Language Models.☆19Jul 10, 2025Updated 7 months ago
- ☆10Apr 7, 2025Updated 10 months ago
- Official code for our CVPR 2025 paper: "Toward Generalized Image Quality Assessment: Relaxing the Perfect Reference Quality Assumption"☆66Sep 15, 2025Updated 5 months ago
- [ICLR'26] Grasp Any Region: Towards Precise, Contextual Pixel Understanding for Multimodal LLMs☆97Jan 26, 2026Updated last month
- [NeurIPS 2025 Spotlight] VisualQuality-R1 is the first open-sourced NR-IQA model can accurately describe and rate the image quality.☆157Oct 15, 2025Updated 4 months ago
- Official Implementation for paper "Pretraining A Large Language Model using Distributed GPUs: A Memory-Efficient Decentralized Paradigm"☆20Feb 20, 2026Updated 2 weeks ago
- Progressive Language-guided Visual Learning for Multi-Task Visual Grounding☆13May 9, 2025Updated 9 months ago
- Official repository of paper "LOVE-R1: Advancing Long Video Understanding with Adaptive Zoom-in Mechanism via Multi-Step Reasoning"☆22Nov 1, 2025Updated 4 months ago
- This repository is an official implementation of the paper A Simple Baseline for Open-World Tracking via Self-training.☆10Jan 26, 2024Updated 2 years ago
- ☆13Jan 21, 2025Updated last year
- [CVPR 2024] "Towards Robust Audiovisual Segmentation in Complex Environments with Quantization-based Semantic Decomposition"☆12Feb 27, 2024Updated 2 years ago
- [NeurIPS2022] Perceptual Attacks of No-Reference Image Quality Models with Human-in-the-Loop☆14Apr 13, 2023Updated 2 years ago
- Unlocking the Essence of Beauty: Advanced Aesthetic Reasoning with Relative-Absolute Policy Optimization☆21Jan 27, 2026Updated last month
- [TCSVT'24] Offical Implementation of 2AFC-LMMs☆12Aug 17, 2024Updated last year