yangli18 / VLTVG
Improving Visual Grounding with Visual-Linguistic Verification and Iterative Reasoning, CVPR 2022
☆91Updated last year
Related projects ⓘ
Alternatives and complementary repositories for VLTVG
- SeqTR: A Simple yet Universal Network for Visual Grounding☆130Updated last week
- A lightweight codebase for referring expression comprehension and segmentation☆52Updated 2 years ago
- An unofficial pytorch implementation of "TransVG: End-to-End Visual Grounding with Transformers".☆51Updated 3 years ago
- Official Implementation for paper "Referring Transformer: A One-step Approach to Multi-task Visual Grounding" Neurips 2021☆65Updated 2 years ago
- ☆163Updated 8 months ago
- [CVPR 2022] Pseudo-Q: Generating Pseudo Language Queries for Visual Grounding☆144Updated 3 months ago
- ☆34Updated 2 years ago
- [TMM 2023] Self-paced Curriculum Adapting of CLIP for Visual Grounding.☆107Updated 3 months ago
- ☆33Updated last year
- ☆173Updated 2 years ago
- [CVPR 2022] This repository is for the paper ``DIFNet: Boosting Visual Information Flow for Image Captioning'' .☆20Updated last year
- Improving One-stage Visual Grounding by Recursive Sub-query Construction, ECCV 2020☆82Updated 3 years ago
- [CVPR2021] Look before you leap: learning landmark features for one-stage visual grounding.☆47Updated 3 years ago
- ☆80Updated 2 years ago
- ☆19Updated 7 months ago
- Object-Aware Distillation Pyramid for Open-Vocabulary Object Detection☆54Updated 2 weeks ago
- Encoder Fusion Network with Co-Attention Embedding for Referring Image Segmentation, CVPR2021☆17Updated 3 years ago
- Code for Referring Image Segmentation via Cross-Modal Progressive Comprehension, CVPR2020.☆59Updated 3 years ago
- [ICCV 2021] Official implementation of the paper "TRAR: Routing the Attention Spans in Transformers for Visual Question Answering"☆65Updated 3 years ago
- ☆89Updated last year
- [AAAI 2023] DQ-DETR: Dual Query Detection Transformer for Phrase Extraction and Grounding☆56Updated last year
- Official Codes for Fine-Grained Visual Prompting, NeurIPS 2023☆35Updated 9 months ago
- Code for our paper "Category Query Learning for Human-Object Interaction Classification" (CVPR2023)☆36Updated last year
- ☆183Updated last year
- [AAAI2023] Repo for the paper ''End-to-End Zero-Shot HOI Detection via Vision and Language Knowledge Distillation''.☆22Updated last year
- PyTorch implementation of ICML 2023 paper "SegCLIP: Patch Aggregation with Learnable Centers for Open-Vocabulary Semantic Segmentation"☆82Updated last year
- [CVPR 2023]Official Pytorch code for paper "Prototype-based Embedding Network for Scene Graph Generation"☆46Updated last year
- [NeurIPS 2022] Embracing Consistency: A One-Stage Approach for Spatio-Temporal Video Grounding☆45Updated 8 months ago
- Referring Video Object Segmentation / Multi-Object Tracking Repo☆87Updated last year