yz93 / LAVT-RIS
☆198Updated 2 years ago
Alternatives and similar repositories for LAVT-RIS:
Users that are interested in LAVT-RIS are comparing it to the libraries listed below
- Open-vocabulary Semantic Segmentation☆171Updated 2 years ago
- SeqTR: A Simple yet Universal Network for Visual Grounding☆134Updated 5 months ago
- An official PyTorch implementation of the CRIS paper☆270Updated 10 months ago
- Official implement of CVPR2023 ZegCLIP: Towards Adapting CLIP for Zero-shot Semantic Segmentation☆239Updated last year
- A new framework for open-vocabulary object detection, based on maskrcnn-benchmark☆237Updated 2 years ago
- Official PyTorch implementation of "Extract Free Dense Labels from CLIP" (ECCV 22 Oral)☆438Updated 2 years ago
- ☆180Updated 2 years ago
- [CVPR 2023] CLIP is Also an Efficient Segmenter: A Text-Driven Approach for Weakly Supervised Semantic Segmentation☆191Updated 7 months ago
- ☆143Updated last year
- [TMM 2023] Self-paced Curriculum Adapting of CLIP for Visual Grounding.☆119Updated 2 months ago
- Referring Video Object Segmentation / Multi-Object Tracking Repo☆87Updated last year
- [ICLR2024 Spotlight] Code Release of CLIPSelf: Vision Transformer Distills Itself for Open-Vocabulary Dense Prediction☆183Updated last year
- [CVPR2023] Code Release of Aligning Bag of Regions for Open-Vocabulary Object Detection☆181Updated last year
- Official code for "Decoupling Zero-Shot Semantic Segmentation"☆171Updated 2 years ago
- This repo holds the official code and data for "Unveiling Parts Beyond Objects: Towards Finer-Granularity Referring Expression Segmentati…☆65Updated 10 months ago
- PyTorch implementation of ICML 2023 paper "SegCLIP: Patch Aggregation with Learnable Centers for Open-Vocabulary Semantic Segmentation"☆90Updated last year
- Open-vocabulary Semantic Segmentation☆341Updated 6 months ago
- Improving Visual Grounding with Visual-Linguistic Verification and Iterative Reasoning, CVPR 2022☆95Updated 2 years ago
- ☆180Updated last year
- Dataset API for "PhraseCut: Language-based Image Segmentation in the Wild"☆110Updated 4 years ago
- Official Implementation for paper "Referring Transformer: A One-step Approach to Multi-task Visual Grounding" Neurips 2021☆66Updated 2 years ago
- [CVPR 2023] Official code for "Zero-shot Referring Image Segmentation with Global-Local Context Features"☆119Updated 3 weeks ago
- [Under preparation] Code repo for "Open-Vocabulary DETR with Conditional Matching" (ECCV 2022)☆226Updated 2 years ago
- [ICCV 2023] Code for "Not All Features Matter: Enhancing Few-shot CLIP with Adaptive Prior Refinement"☆144Updated 11 months ago
- [CVPR 2022] Pseudo-Q: Generating Pseudo Language Queries for Visual Grounding☆148Updated 9 months ago
- [ICLR 2023] PyTorch implementation of VLDet (https://arxiv.org/abs/2211.14843)☆186Updated last year
- [CVPR 2022] DenseCLIP: Language-Guided Dense Prediction with Context-Aware Prompting☆532Updated last year
- A DETR-style framework for open-vocabulary detection (OVD). CVPR 2023☆190Updated 2 years ago
- A lightweight codebase for referring expression comprehension and segmentation☆54Updated 2 years ago
- Self-Supervised Video Representation Learning with Motion-Aware Masked Autoencoders☆23Updated 8 months ago