qumengxue / RIO
☆10Updated last year
Related projects ⓘ
Alternatives and complementary repositories for RIO
- Exploiting unlabeled data with vision and language models for object detection, ECCV 2022☆86Updated 10 months ago
- RefVOS☆28Updated 3 years ago
- Official Codes for Fine-Grained Visual Prompting, NeurIPS 2023☆36Updated 9 months ago
- ☆23Updated last year
- [AAAI 2023] DQ-DETR: Dual Query Detection Transformer for Phrase Extraction and Grounding☆56Updated last year
- ☆35Updated 2 years ago
- ☆47Updated 2 years ago
- [TPAMI 2023] Local-Global Context Aware Transformer for Language-Guided Video Segmentation☆48Updated 10 months ago
- Refer-Youtube-VOS dataset☆25Updated 9 months ago
- [NIPS2023] This is an official implementation of paper "DAC-DETR: Divide the Attention Layers and Conquer".☆52Updated 4 months ago
- SeqTR: A Simple yet Universal Network for Visual Grounding☆131Updated 3 weeks ago
- Object-Aware Distillation Pyramid for Open-Vocabulary Object Detection☆55Updated this week
- [CVPR 2022 Oral] Towards Open Set Temporal Action Localization☆50Updated last year
- [CVPR2021] Look before you leap: learning landmark features for one-stage visual grounding.☆47Updated 3 years ago
- The official implementation of 'Weakly Supervised Temporal Action Localization via Representative Snippet Knowledge Propagation' (CVPR 2…☆45Updated 2 years ago
- ☆33Updated last year
- CVPR2022 - Language-Bridged Spatial-Temporal Interaction for Referring Video Object Segmentation☆23Updated 2 years ago
- ☆21Updated 2 years ago
- ☆40Updated last year
- (ICCV 2023) Betrayed by Captions: Joint Caption Grounding and Generation for Open Vocabulary Instance Segmentation☆45Updated 4 months ago
- Salvage of Supervision in Weakly Supervised Object Detection, CVPR 2022☆22Updated 2 years ago
- [AAAI 2022] Pytorch implementation of "LCTR: On Awakening the Local Continuity of Transformer for Weakly Supervised Object Localization".☆22Updated 2 years ago
- Improving Visual Grounding with Visual-Linguistic Verification and Iterative Reasoning, CVPR 2022☆91Updated last year
- ☆9Updated 9 months ago
- [ACM MM 2024] Hierarchical Multimodal Fine-grained Modulation for Visual Grounding.☆32Updated last month
- Code for "CARIS: Context-Aware Referring Image Segmentation" [ACM MM2023]☆20Updated 9 months ago
- [AAAI2023] Repo for the paper ''End-to-End Zero-Shot HOI Detection via Vision and Language Knowledge Distillation''.☆22Updated last year
- ☆32Updated 11 months ago
- A pytorch Implementation of Open Vocabulary Object Detection with Pseudo Bounding-Box Labels☆57Updated last year
- Series of work (ECCV2020, CVPR2021, CVPR2021, ECCV2022) about Compositional Learning for Human-Object Interaction Exploration☆78Updated last year