SooLab / REP-ERU
[ECCV2022] A PyTorch implementation of the paper "Spatial and Visual Perspective-Taking via View Rotation and Relation Reasoning for Embodied Reference Understanding"
☆13Updated last year
Related projects: ⓘ
- (ICCV 2023) Betrayed by Captions: Joint Caption Grounding and Generation for Open Vocabulary Instance Segmentation☆43Updated 2 months ago
- [AAAI 2024] Referred by Multi-Modality: A Unified Temporal Transformers for Video Object Segmentation☆62Updated 2 months ago
- The official repository for ICLR2024 paper "FROSTER: Frozen CLIP is a Strong Teacher for Open-Vocabulary Action Recognition"☆55Updated 5 months ago
- [NeurIPS 2023] OV-PARTS: Towards Open-Vocabulary Part Segmentation☆70Updated 2 months ago
- ☆32Updated 5 months ago
- ☆32Updated last year
- CVPR2022 - Language-Bridged Spatial-Temporal Interaction for Referring Video Object Segmentation☆22Updated 2 years ago
- The offical implemention of JM3D.☆27Updated 11 months ago
- [CVPR 2024] The repository contains the official implementation of "Open-Vocabulary Segmentation with Semantic-Assisted Calibration"☆45Updated 4 months ago
- This repo holds the official code and data for "Unveiling Parts Beyond Objects: Towards Finer-Granularity Referring Expression Segmentati…☆63Updated 3 months ago
- [ECCV 2024] ControlCap: Controllable Region-level Captioning☆49Updated last month
- PyTorch implementation of ICML 2023 paper "SegCLIP: Patch Aggregation with Learnable Centers for Open-Vocabulary Semantic Segmentation"☆78Updated last year
- [CVPR2024] The code of "UniPT: Universal Parallel Tuning for Transfer Learning with Efficient Parameter and Memory"☆62Updated 4 months ago
- Official repository for "IntentQA: Context-aware Video Intent Reasoning" from ICCV 2023.☆9Updated 9 months ago
- Large-Vocabulary Video Instance Segmentation dataset☆73Updated 2 months ago
- ☆26Updated last week
- ICLR‘24 Offical Implementation of Composed Image Retrieval with Text Feedback via Multi-grained Uncertainty Regularization☆59Updated 7 months ago
- Can 3D Vision-Language Models Truly Understand Natural Language?☆20Updated 5 months ago
- Code for the paper "Detecting Any Human-Object Interaction Relationship: Universal HOI Detector with Spatial Prompt Learning on Foundatio…☆22Updated 10 months ago
- [ICCV 2023] Generative Prompt Model for Weakly Supervised Object Localization☆53Updated 10 months ago
- Official Pytorch codebase for Open-Vocabulary Instance Segmentation without Manual Mask Annotations [CVPR 2023]☆47Updated 9 months ago
- ☆16Updated last year
- [ICML2024]The official implementation of SemiRES in PyTorch.☆18Updated 3 months ago
- Referring Video Object Segmentation / Multi-Object Tracking Repo☆84Updated last year
- Open-vocabulary Video Instance Segmentation Codebase built upon Detectron2, which is really easy to use.☆15Updated 6 months ago
- SeqTR: A Simple yet Universal Network for Visual Grounding☆128Updated 3 months ago
- [CVPR 2024] Official implementation of "Universal Segmentation at Arbitrary Granularity with Language Instruction"☆75Updated 6 months ago
- [CVPR2022 Oral] 3DJCG: A Unified Framework for Joint Dense Captioning and Visual Grounding on 3D Point Clouds☆51Updated last year
- [AAAI 2024] DGL: Dynamic Global-Local Prompt Tuning for Text-Video Retrieval. Also, visualization and qb norm search for best performance…☆28Updated 5 months ago
- [CVPR 2023] Prompt, Generate, then Cache: Cascade of Foundation Models makes Strong Few-shot Learners☆35Updated last year