djiajunustc/TransVG

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/djiajunustc/TransVG)

djiajunustc / TransVG

☆198

Alternatives and similar repositories for TransVG

Users that are interested in TransVG are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

nku-shengzheliu / Pytorch-TransVG
View on GitHub
An unofficial pytorch implementation of "TransVG: End-to-End Visual Grounding with Transformers".
☆51Jun 7, 2021Updated 5 years ago
zyang-ur / ReSC
View on GitHub
Improving One-stage Visual Grounding by Recursive Sub-query Construction, ECCV 2020
☆91Sep 30, 2021Updated 4 years ago
yangli18 / VLTVG
View on GitHub
Improving Visual Grounding with Visual-Linguistic Verification and Iterative Reasoning, CVPR 2022
☆97Dec 2, 2022Updated 3 years ago
LukeForeverYoung / QRNet
View on GitHub
☆41Jun 3, 2022Updated 4 years ago
djiajunustc / H-23D_R-CNN
View on GitHub
☆65Aug 11, 2021Updated 4 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
TheShadow29 / awesome-grounding
View on GitHub
awesome grounding: A curated list of research papers in visual grounding
☆1,127Sep 21, 2025Updated 10 months ago
zyang-ur / onestage_grounding
View on GitHub
A Fast and Accurate One-Stage Approach to Visual Grounding, ICCV 2019 (Oral)
☆151Nov 18, 2020Updated 5 years ago
ubc-vision / RefTR
View on GitHub
Official Implementation for paper "Referring Transformer: A One-step Approach to Multi-task Visual Grounding" Neurips 2021
☆67May 26, 2022Updated 4 years ago
seanzhuh / SeqTR
View on GitHub
SeqTR: A Simple yet Universal Network for Visual Grounding
☆144Oct 30, 2024Updated last year
LeapLabTHU / Pseudo-Q
View on GitHub
[CVPR 2022] Pseudo-Q: Generating Pseudo Language Queries for Visual Grounding
☆153Jul 13, 2024Updated 2 years ago
LANMNG / LQVG
View on GitHub
☆32Nov 27, 2025Updated 7 months ago
linhuixiao / CLIP-VG
View on GitHub
[TMM 2023] Self-paced Curriculum Adapting of CLIP for Visual Grounding.
☆135Nov 10, 2025Updated 8 months ago
lichengunc / refer
View on GitHub
Referring Expression Datasets API
☆573Aug 27, 2024Updated last year
ChopinSharp / ref-nms
View on GitHub
Official codebase for "Ref-NMS: Breaking Proposal Bottlenecks in Two-Stage Referring Expression Grounding"
☆22Dec 20, 2020Updated 5 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
Dmmm1997 / SimVG
View on GitHub
[NeurIPS2024] - SimVG: A Simple Framework for Visual Grounding with Decoupled Multi-modal Fusion
☆103Oct 29, 2025Updated 8 months ago
lichengunc / MAttNet
View on GitHub
MAttNet: Modular Attention Network for Referring Expression Comprehension
☆299Nov 29, 2022Updated 3 years ago
like413 / OPT-RSVG
View on GitHub
[TGRS 2024] Language-Guided Progressive Attention for Visual Grounding in Remote Sensing Images.
☆56Jun 10, 2025Updated last year
BigRedT / info-ground
View on GitHub
Learning phrase grounding from captioned images through InfoNCE bound on mutual information
☆74Aug 22, 2020Updated 5 years ago
iQua / M-DGT
View on GitHub
The source code of the CVPR22 paper titled "Multi-Modal Dynamic Graph Transformer for Visual Grounding".
☆22Mar 26, 2022Updated 4 years ago
svip-lab / LBYLNet
View on GitHub
[CVPR2021] Look before you leap: learning landmark features for one-stage visual grounding.
☆51Aug 31, 2021Updated 4 years ago
BryanPlummer / flickr30k_entities
View on GitHub
Flickr30K Entities Dataset
☆185Dec 23, 2018Updated 7 years ago
ashkamath / mdetr
View on GitHub
☆1,050Oct 3, 2022Updated 3 years ago
ZhanYang-nwpu / RSVG-pytorch
View on GitHub
RSVG: Exploring Data and Model for Visual Grounding on Remote Sensing Data, 2022
☆178Dec 10, 2025Updated 7 months ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
chihhuiho / yoro
View on GitHub
☆16Nov 14, 2022Updated 3 years ago
qinzzz / Multimodal-Alignment-Framework
View on GitHub
Implementation for MAF: Multimodal Alignment Framework
☆46Nov 25, 2020Updated 5 years ago
daqingliu / awesome-rec
View on GitHub
A curated list of research papers in Referring Expression Comprehension (REC)
☆47May 13, 2021Updated 5 years ago
lichengunc / refer-parser2
View on GitHub
Referring Expression Parser
☆27Feb 10, 2018Updated 8 years ago
chenwei746 / EEVG
View on GitHub
☆23Aug 20, 2024Updated last year
Deanplayerljx / tab-vcr
View on GitHub
Pytorch implementation for our NeurIPS 2019 paper "TAB-VCR: Tags and Attributes based VCR Baselines" https://arxiv.org/abs/1910.14671
☆19May 6, 2021Updated 5 years ago
luogen1996 / SimREC
View on GitHub
A lightweight codebase for referring expression comprehension and segmentation
☆57May 21, 2022Updated 4 years ago
TheShadow29 / zsgnet-pytorch
View on GitHub
Official implementation of ICCV19 oral paper Zero-Shot grounding of Objects from Natural Language Queries (https://arxiv.org/abs/1908.071…
☆71Apr 22, 2020Updated 6 years ago
insomnia94 / DTWREG
View on GitHub
Preliminary code for reviewers
☆13Mar 30, 2021Updated 5 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
daqingliu / NMTree
View on GitHub
Code release for Learning to Assemble Neural Module Tree Networks for Visual Grounding (ICCV 2019)
☆39Nov 23, 2019Updated 6 years ago
insomnia94 / ISREG
View on GitHub
iterative shrinking for referring expression grounding using deep reinforcement learning
☆14Nov 27, 2021Updated 4 years ago
LouChao98 / VLGAE
View on GitHub
Official Implementation for CVPR 2022 paper "Unsupervised Vision-Language Parsing: Seamlessly Bridging Visual Scene Graphs with Language …
☆24Oct 19, 2022Updated 3 years ago
djiajunustc / Voxel-R-CNN
View on GitHub
☆299Feb 12, 2022Updated 4 years ago
allenai / reclip
View on GitHub
☆92Apr 15, 2022Updated 4 years ago
sega-hsj / MVT-3DVG
View on GitHub
[CVPR 2022] Multi-View Transformer for 3D Visual Grounding
☆81Nov 9, 2022Updated 3 years ago
wenz116 / DRFT
View on GitHub
End-to-end Multi-modal Video Temporal Grounding, NeurIPS 2021
☆18Oct 24, 2021Updated 4 years ago