lhc1224 / OSAD_Net
Pytorch implementation of One-Shot Affordance Detection
☆59Updated 2 weeks ago
Related projects: ⓘ
- Official PyTorch Implementation of Learning Affordance Grounding from Exocentric Images, CVPR 2022☆45Updated 3 months ago
- [NeurIPS 2023] OV-PARTS: Towards Open-Vocabulary Part Segmentation☆70Updated 2 months ago
- Open-Vocabulary Instance Segmentation via Robust Cross-Modal Pseudo-Labeling @ CVPR22☆42Updated last year
- Pytorch implementation of "TokenCut: Segmenting Objects in Images and Videos with Self-supervised Transformer and Normalized Cut"☆56Updated last year
- A Model for Embodied Adaptive Object Detection☆42Updated 2 years ago
- ☆54Updated 2 years ago
- PyTorch implementation of ICML 2023 paper "SegCLIP: Patch Aggregation with Learnable Centers for Open-Vocabulary Semantic Segmentation"☆78Updated last year
- [NeurIPS 2022] Official implementation of the paper "Rethinking Resolution in the Context of Efficient Video Recognition".☆32Updated last year
- ☆77Updated 2 years ago
- LOCATE: Localize and Transfer Object Parts for Weakly Supervised Affordance Grounding (CVPR 2023)☆30Updated last year
- An Examination of the Compositionality of Large Generative Vision-Language Models☆20Updated 5 months ago
- [CVPR 2022] Amodal Segmentation through Out-of-Task and Out-of-Distribution Generalization with a Bayesian Model☆22Updated last month
- ICCV'2023 | CTVIS: Consistent Training for Online Video Instance Segmentation☆70Updated 11 months ago
- Large-Vocabulary Video Instance Segmentation dataset☆73Updated 2 months ago
- ☆8Updated 10 months ago
- [CVPR 2022 (oral)] Bongard-HOI for benchmarking few-shot visual reasoning☆64Updated last year
- Exploiting unlabeled data with vision and language models for object detection, ECCV 2022☆86Updated 8 months ago
- Robust Referring Video Object Segmentation with Cyclic Structural Consistency [ICCV 2023]☆23Updated 6 months ago
- (NeurIPS 2022) Self-Supervised Visual Representation Learning with Semantic Grouping☆95Updated last year
- [AAAI 2024] Referred by Multi-Modality: A Unified Temporal Transformers for Video Object Segmentation☆62Updated 2 months ago
- The official code for Relational Context Learning for Human-Object Interaction Detection, CVPR2023.☆48Updated last year
- Code Release for MaskCLIP (ICML 2023)☆48Updated 9 months ago
- [ECCV2022] A PyTorch implementation of the paper "Spatial and Visual Perspective-Taking via View Rotation and Relation Reasoning for Embo…☆13Updated last year
- ☆32Updated 5 months ago
- Detectron2 Toolbox and Benchmark for V3Det☆15Updated 3 months ago
- [AAAI 2023 Oral] CoMAE: Single Model Hybrid Pre-training on Small-Scale RGB-D Datasets☆34Updated last month
- ☆57Updated last year
- Referring Video Object Segmentation / Multi-Object Tracking Repo☆84Updated last year
- Open-vocabulary Semantic Segmentation☆162Updated last year
- ☆57Updated last year