[ECCV 2024] Official code for "Pseudo-RIS: Distinctive Pseudo-supervision Generation for Referring Image Segmentation"
☆18Jul 31, 2025Updated 7 months ago
Alternatives and similar repositories for Pseudo-RIS
Users that are interested in Pseudo-RIS are comparing it to the libraries listed below
Sorting:
- Related papers about Referring Image Segmentation (RIS)☆16Dec 26, 2023Updated 2 years ago
- ☆12May 26, 2023Updated 2 years ago
- Official repository of the "ReSTR: Convolution-Free Referring Image Segmentation Using Transformers (CVPR'22)"☆14Dec 13, 2024Updated last year
- Code release for "Segment, Select, Correct: A Framework for Weakly-Supervised Referring Segmentation"☆14Oct 23, 2023Updated 2 years ago
- [WACV 2026] MomentMix Augmentation with Length-Aware DETR for Temporally Robust Moment Retrieval☆13Sep 18, 2025Updated 5 months ago
- Official PyTorch implementation of CorrespondentDream: Enhancing 3D Fidelity of Text-to-3D using Cross-View Correspondences (CVPR 2024 Po…☆19Apr 29, 2024Updated last year
- ☆18Jun 10, 2023Updated 2 years ago
- Official PyTorch Implementation of Efficient and Versatile Robust Fine-Tuning of Zero-shot Models, ECCV 2024☆17Oct 3, 2024Updated last year
- [ICCV 2023] Official code release of our paper "Referring Image Segmentation Using Text Supervision"☆73Oct 13, 2024Updated last year
- [ICCV 2023] The official PyTorch implementation of the paper: "Localizing Moments in Long Video Via Multimodal Guidance"☆19Sep 26, 2024Updated last year
- ICCV'23 Dual Learning with Dynamic Knowledge Distillation for Partially Relevant Video Retrieval☆19Aug 22, 2025Updated 6 months ago
- Official repository for the paper "Instance-Wise Holistic Order Prediction in Natural Scenes".☆26Jan 11, 2024Updated 2 years ago
- ☆45Oct 3, 2023Updated 2 years ago
- This is the official implementation of RGNet: A Unified Retrieval and Grounding Network for Long Videos☆19Mar 3, 2025Updated 11 months ago
- ImaginaryNet: Learning Object Detectors without Real Images and Annotations☆26Mar 11, 2023Updated 2 years ago
- Official Implementation of ISR-DPO:Aligning Large Multimodal Models for Videos by Iterative Self-Retrospective DPO (AAAI'25)☆23Nov 25, 2025Updated 3 months ago
- Exploring Classification Equilibrium in Long-Tailed Object Detection, ICCV2021☆58Mar 31, 2022Updated 3 years ago
- (TMI-2024) Video-Instrument Synergistic Network for Referring Video Instrument Segmentation in Robotic Surgery☆25Nov 13, 2024Updated last year
- ☆23Aug 20, 2024Updated last year
- [TIP 2025] Self-Calibrated CLIP for Training-Free Open-Vocabulary Segmentation☆58Dec 22, 2025Updated 2 months ago
- Official repository for CATs++: Boosting Cost Aggregation with Convolutions and Transformers (TPAMI'22)☆49Jan 10, 2024Updated 2 years ago
- Code for "CARIS: Context-Aware Referring Image Segmentation" [ACM MM2023]☆28Nov 28, 2024Updated last year
- ☆28Jul 22, 2024Updated last year
- [ICLR 2024] The official implementation of Zip-Your-Clip☆35Mar 14, 2024Updated last year
- An unofficial implementation for paper "DenseCLIP: Extract Free Dense Labels from CLIP"☆23Jan 27, 2022Updated 4 years ago
- Weakly Supervised Video Moment Localisation with Contrastive Negative Sample Mining☆30Apr 4, 2022Updated 3 years ago
- ☆29Jun 10, 2024Updated last year
- An Open-access Dataset for Liver Lesion Diagnosis on Multi-phase MRI☆36Apr 7, 2025Updated 10 months ago
- [2023 ACL] CONE: An Efficient COarse-to-fiNE Alignment Framework for Long Video Temporal Grounding☆31Aug 5, 2023Updated 2 years ago
- [CVPR 2024 Accepted] TaskWeave: Decoupling and Inter-Task Feedback for Joint Moment Retrieval and Highlight Detection☆29Sep 26, 2024Updated last year
- Code for "AVG-LLaVA: A Multimodal Large Model with Adaptive Visual Granularity"☆33Oct 12, 2024Updated last year
- (TIP 2024) Towards Robust Referring Image Segmentation☆36Mar 2, 2024Updated last year
- RefVOS☆29Feb 3, 2021Updated 5 years ago
- Video as Conditional Graph Hierarchy for Multi-Granular Question Answering (AAAI'22, Oral)☆34Sep 17, 2022Updated 3 years ago
- ☆31Jun 14, 2024Updated last year
- DisTime: Distribution-based Time Representation for Video Large Language Models.☆18Jul 10, 2025Updated 7 months ago
- A vision-language model with bidirectional progressive fusion and global-local alignment for enhanced medical image segmentation.☆17Dec 25, 2025Updated 2 months ago
- [ECCV 2024] SAM4MLLM: Enhance Multi-Modal Large Language Model for Referring Expression Segmentation,☆49Mar 20, 2025Updated 11 months ago
- ☆76Sep 30, 2022Updated 3 years ago