seanzhuh / SeqTRLinks
SeqTR: A Simple yet Universal Network for Visual Grounding
☆144Updated last year
Alternatives and similar repositories for SeqTR
Users that are interested in SeqTR are comparing it to the libraries listed below
Sorting:
- A lightweight codebase for referring expression comprehension and segmentation☆56Updated 3 years ago
- Improving Visual Grounding with Visual-Linguistic Verification and Iterative Reasoning, CVPR 2022☆96Updated 3 years ago
- ☆185Updated 3 years ago
- [CVPR2023] Code Release of Aligning Bag of Regions for Open-Vocabulary Object Detection☆184Updated 2 years ago
- Official Implementation for paper "Referring Transformer: A One-step Approach to Multi-task Visual Grounding" Neurips 2021☆69Updated 3 years ago
- Referring Video Object Segmentation / Multi-Object Tracking Repo☆90Updated 2 years ago
- [CVPR 2022] Pseudo-Q: Generating Pseudo Language Queries for Visual Grounding☆154Updated last year
- [TMM 2023] Self-paced Curriculum Adapting of CLIP for Visual Grounding.☆132Updated 2 months ago
- ☆86Updated 3 years ago
- [AAAI 2023] DQ-DETR: Dual Query Detection Transformer for Phrase Extraction and Grounding☆57Updated 3 years ago
- [ICLR 2023] PyTorch implementation of VLDet (https://arxiv.org/abs/2211.14843)☆190Updated last year
- [CVPR2023] The code for 《Position-guided Text Prompt for Vision-Language Pre-training》☆151Updated 2 years ago
- ☆194Updated last year
- ☆39Updated 2 years ago
- A pytorch Implementation of Open Vocabulary Object Detection with Pseudo Bounding-Box Labels☆64Updated 2 years ago
- ☆41Updated 3 years ago
- A new framework for open-vocabulary object detection, based on maskrcnn-benchmark☆247Updated 2 years ago
- [Under preparation] Code repo for "Open-Vocabulary DETR with Conditional Matching" (ECCV 2022)☆237Updated 3 years ago
- CVPR2022 - Language-Bridged Spatial-Temporal Interaction for Referring Video Object Segmentation☆24Updated 3 years ago
- Object-Aware Distillation Pyramid for Open-Vocabulary Object Detection☆64Updated last week
- Exploiting unlabeled data with vision and language models for object detection, ECCV 2022☆94Updated last year
- PyTorch implementation of ICML 2023 paper "SegCLIP: Patch Aggregation with Learnable Centers for Open-Vocabulary Semantic Segmentation"☆99Updated 2 years ago
- [ICCV 2023] Code for "Not All Features Matter: Enhancing Few-shot CLIP with Adaptive Prior Refinement"☆149Updated last year
- ☆219Updated 2 years ago
- (ICCV 2023) Betrayed by Captions: Joint Caption Grounding and Generation for Open Vocabulary Instance Segmentation☆48Updated last year
- ☆38Updated 3 years ago
- CVPR 2023 Accepted Paper HOICLIP: Efficient Knowledge Transfer for HOI Detection with Vision-Language Models☆68Updated last year
- ☆37Updated last year
- [CVPR2024] The code of "UniPT: Universal Parallel Tuning for Transfer Learning with Efficient Parameter and Memory"☆68Updated last year
- ☆95Updated 2 years ago