Improving Visual Grounding with Visual-Linguistic Verification and Iterative Reasoning, CVPR 2022
☆97Dec 2, 2022Updated 3 years ago
Alternatives and similar repositories for VLTVG
Users that are interested in VLTVG are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆41Jun 3, 2022Updated 3 years ago
- Improving One-stage Visual Grounding by Recursive Sub-query Construction, ECCV 2020☆90Sep 30, 2021Updated 4 years ago
- ☆198Feb 27, 2024Updated 2 years ago
- [CVPR2021] Look before you leap: learning landmark features for one-stage visual grounding.☆51Aug 31, 2021Updated 4 years ago
- Official Implementation for paper "Referring Transformer: A One-step Approach to Multi-task Visual Grounding" Neurips 2021☆67May 26, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- The official PyTorch code for "Relation-aware Instance Refinement for Weakly Supervised Visual Grounding" accepted by CVPR2021☆28Oct 9, 2021Updated 4 years ago
- [CVPR 2023] Code for "Improving Visual Grounding by Encouraging Consistent Gradient-based Explanations"☆19Oct 10, 2023Updated 2 years ago
- [CVPR 2022] Pseudo-Q: Generating Pseudo Language Queries for Visual Grounding☆153Jul 13, 2024Updated last year
- Implementation for MAF: Multimodal Alignment Framework☆46Nov 25, 2020Updated 5 years ago
- awesome grounding: A curated list of research papers in visual grounding☆1,124Sep 21, 2025Updated 7 months ago
- SeqTR: A Simple yet Universal Network for Visual Grounding☆144Oct 30, 2024Updated last year
- ☆16Nov 14, 2022Updated 3 years ago
- ☆91Apr 15, 2022Updated 4 years ago
- [AAAI 2023] DQ-DETR: Dual Query Detection Transformer for Phrase Extraction and Grounding☆59Nov 28, 2022Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Official codebase for "Ref-NMS: Breaking Proposal Bottlenecks in Two-Stage Referring Expression Grounding"☆22Dec 20, 2020Updated 5 years ago
- [ICCV 2025] MPG-SAM 2: Adapting SAM 2 with Mask Priors and Global Context for Referring Video Object Segmentation☆22Sep 5, 2025Updated 7 months ago
- ☆233Apr 13, 2023Updated 3 years ago
- A Fast and Accurate One-Stage Approach to Visual Grounding, ICCV 2019 (Oral)☆151Nov 18, 2020Updated 5 years ago
- Code for "CARIS: Context-Aware Referring Image Segmentation" [ACM MM2023]☆29Nov 28, 2024Updated last year
- Learning phrase grounding from captioned images through InfoNCE bound on mutual information☆74Aug 22, 2020Updated 5 years ago
- ☆23Aug 20, 2024Updated last year
- Phrase Localization Evaluation Toolkit☆20Aug 16, 2019Updated 6 years ago
- Progressive Language-guided Visual Learning for Multi-Task Visual Grounding☆13May 9, 2025Updated 11 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [CVPR2024] Mask Grounding for Referring Image Segmentation☆28Jul 22, 2024Updated last year
- ☆39Jun 28, 2023Updated 2 years ago
- The source code of the CVPR22 paper titled "Multi-Modal Dynamic Graph Transformer for Visual Grounding".☆22Mar 26, 2022Updated 4 years ago
- [CVPR2022] Official Implementation of ReferFormer☆352Feb 15, 2025Updated last year
- Lightweight Transformer for Multi-modal Tasks☆16Dec 9, 2022Updated 3 years ago
- ECCV24 "ReMamber: Referring Image Segmentation with Mamba Twister" official repository.☆45Jul 11, 2024Updated last year
- CVPR2022 - Language-Bridged Spatial-Temporal Interaction for Referring Video Object Segmentation☆24Aug 12, 2022Updated 3 years ago
- Preliminary code for reviewers☆13Mar 30, 2021Updated 5 years ago
- An unofficial pytorch implementation of "TransVG: End-to-End Visual Grounding with Transformers".☆51Jun 7, 2021Updated 4 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Code for "Towards Effective Visual Representations for Partial-Label Learning" in CVPR 2023.☆24Dec 12, 2023Updated 2 years ago
- [TMM 2023] Self-paced Curriculum Adapting of CLIP for Visual Grounding.☆134Nov 10, 2025Updated 5 months ago
- ☆64May 17, 2023Updated 2 years ago
- [TGRS 2024] Language-Guided Progressive Attention for Visual Grounding in Remote Sensing Images.☆53Jun 10, 2025Updated 10 months ago
- [TPAMI 2024] This is the official Pytorch code for our paper "Context Disentangling and Prototype Inheriting for Robust Visual Grounding"…☆28May 8, 2025Updated 11 months ago
- Flickr30K Entities Dataset☆184Dec 23, 2018Updated 7 years ago
- ☆1,046Oct 3, 2022Updated 3 years ago