yangli18 / VLTVGLinks

Improving Visual Grounding with Visual-Linguistic Verification and Iterative Reasoning, CVPR 2022

☆96

Alternatives and similar repositories for VLTVG

Users that are interested in VLTVG are comparing it to the libraries listed below

Sorting:

luogen1996 / SimREC
A lightweight codebase for referring expression comprehension and segmentation
☆55Updated 3 years ago
seanzhuh / SeqTR
SeqTR: A Simple yet Universal Network for Visual Grounding
☆144Updated last year
djiajunustc / TransVG
☆193Updated last year
LukeForeverYoung / QRNet
☆40Updated 3 years ago
LeapLabTHU / Pseudo-Q
[CVPR 2022] Pseudo-Q: Generating Pseudo Language Queries for Visual Grounding
☆152Updated last year
ubc-vision / RefTR
Official Implementation for paper "Referring Transformer: A One-step Approach to Multi-task Visual Grounding" Neurips 2021
☆69Updated 3 years ago
nku-shengzheliu / Pytorch-TransVG
An unofficial pytorch implementation of "TransVG: End-to-End Visual Grounding with Transformers".
☆52Updated 4 years ago
linhuixiao / CLIP-VG
[TMM 2023] Self-paced Curriculum Adapting of CLIP for Visual Grounding.
☆131Updated last week
dyabel / detpro
☆184Updated 3 years ago
allenai / reclip
☆88Updated 3 years ago
kingthreestones / RefCLIP
☆38Updated 2 years ago
rentainhe / TRAR-VQA
[ICCV 2021] Official implementation of the paper "TRAR: Routing the Attention Spans in Transformers for Visual Question Answering"
☆68Updated 4 years ago
dzh19990407 / LBDT
CVPR2022 - Language-Bridged Spatial-Temporal Interaction for Referring Video Object Segmentation
☆23Updated 3 years ago
zyang-ur / ReSC
Improving One-stage Visual Grounding by Recursive Sub-query Construction, ECCV 2020
☆87Updated 4 years ago
alirezazareian / ovr-cnn
A new framework for open-vocabulary object detection, based on maskrcnn-benchmark
☆247Updated 2 years ago
yz93 / LAVT-RIS
☆215Updated 2 years ago
Artanic30 / HOICLIP
CVPR 2023 Accepted Paper HOICLIP: Efficient Knowledge Transfer for HOI Detection with Vision-Language Models
☆68Updated last year
zjh31 / CPL
☆20Updated last year
wusize / ovdet
[CVPR2023] Code Release of Aligning Bag of Regions for Open-Vocabulary Object Detection
☆183Updated 2 years ago
Charles-Xie / CQL
Code for our paper "Category Query Learning for Human-Object Interaction Classification" (CVPR2023)
☆37Updated 2 years ago
mrwu-mac / DIFNet
[CVPR 2022] This repository is for the paper ``DIFNet: Boosting Visual Information Flow for Image Captioning'' .
☆20Updated 2 years ago
sail-sg / ptp
[CVPR2023] The code for 《Position-guided Text Prompt for Vision-Language Pre-training》
☆151Updated 2 years ago
spyflying / CMPC-Refseg
Code for Referring Image Segmentation via Cross-Modal Progressive Comprehension, CVPR2020.
☆63Updated 4 years ago
TalalWasim / Vita-CLIP
Official repository for "Vita-CLIP: Video and text adaptive CLIP via Multimodal Prompting" [CVPR 2023]
☆127Updated 2 years ago
LutingWang / OADP
Object-Aware Distillation Pyramid for Open-Vocabulary Object Detection
☆62Updated last month
guozix / TaI-DPT
☆94Updated 2 years ago
svip-lab / LBYLNet
[CVPR2021] Look before you leap: learning landmark features for one-stage visual grounding.
☆49Updated 4 years ago
bladewaltz1 / PromptSwitch
☆30Updated 2 years ago
thunlp / PEVL
Source code for EMNLP 2022 paper “PEVL: Position-enhanced Pre-training and Prompt Tuning for Vision-language Models”
☆48Updated 3 years ago
Scarecrow0 / SGTR
☆97Updated 3 years ago