linhuixiao/CLIP-VG

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/linhuixiao/CLIP-VG)

linhuixiao / CLIP-VG

[TMM 2023] Self-paced Curriculum Adapting of CLIP for Visual Grounding.

☆134

Alternatives and similar repositories for CLIP-VG

Users that are interested in CLIP-VG are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

linhuixiao / HiVG
View on GitHub
[ACM MM 2024] Hierarchical Multimodal Fine-grained Modulation for Visual Grounding.
☆63Nov 10, 2025Updated 6 months ago
MightXiong / FedMIT
View on GitHub
☆13Mar 14, 2025Updated last year
chenwei746 / EEVG
View on GitHub
☆23Aug 20, 2024Updated last year
LANMNG / LQVG
View on GitHub
☆32Nov 27, 2025Updated 5 months ago
linhuixiao / OneRef
View on GitHub
[NeurIPS 2024] OneRef: Unified One-tower Expression Grounding and Segmentation with Mask Referring Modeling.
☆31Nov 13, 2025Updated 6 months ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
djiajunustc / TransVG
View on GitHub
☆198Feb 27, 2024Updated 2 years ago
Dmmm1997 / SimVG
View on GitHub
[NeurIPS2024] - SimVG: A Simple Framework for Visual Grounding with Decoupled Multi-modal Fusion
☆103Oct 29, 2025Updated 6 months ago
LukeForeverYoung / QRNet
View on GitHub
☆41Jun 3, 2022Updated 3 years ago
linhuixiao / Awesome-Visual-Grounding
View on GitHub
[TPAMI 2025] Towards Visual Grounding: A Survey
☆312Nov 18, 2025Updated 6 months ago
cv516Buaa / OV-VG
View on GitHub
☆31Mar 25, 2024Updated 2 years ago
YuchenLiu98 / COMM
View on GitHub
Pytorch code for paper From CLIP to DINO: Visual Encoders Shout in Multi-modal Large Language Models
☆209Jan 8, 2025Updated last year
kingthreestones / RefCLIP
View on GitHub
☆39Jun 28, 2023Updated 2 years ago
zjh31 / CPL
View on GitHub
☆21Apr 2, 2024Updated 2 years ago
zyang-ur / ReSC
View on GitHub
Improving One-stage Visual Grounding by Recursive Sub-query Construction, ECCV 2020
☆90Sep 30, 2021Updated 4 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
LeapLabTHU / Pseudo-Q
View on GitHub
[CVPR 2022] Pseudo-Q: Generating Pseudo Language Queries for Visual Grounding
☆153Jul 13, 2024Updated last year
MengyuanChen21 / CVPR2023-OWTAL
View on GitHub
[CVPR 2023] Cascade Evidential Learning for Open-world Weakly-supervised Temporal Action Localization
☆12Jul 9, 2024Updated last year
mightyzau / RegionBLIP
View on GitHub
☆59Aug 7, 2023Updated 2 years ago
xuyang-liu16 / V2Drop
View on GitHub
[CVPR 2026] Variation-aware Vision Token Dropping for Faster Large Vision-Language Models
☆30Mar 18, 2026Updated 2 months ago
ZhanYang-nwpu / RSVG-pytorch
View on GitHub
RSVG: Exploring Data and Model for Visual Grounding on Remote Sensing Data, 2022
☆175Dec 10, 2025Updated 5 months ago
iQua / M-DGT
View on GitHub
The source code of the CVPR22 paper titled "Multi-Modal Dynamic Graph Transformer for Visual Grounding".
☆22Mar 26, 2022Updated 4 years ago
yangli18 / VLTVG
View on GitHub
Improving Visual Grounding with Visual-Linguistic Verification and Iterative Reasoning, CVPR 2022
☆97Dec 2, 2022Updated 3 years ago
TheShadow29 / awesome-grounding
View on GitHub
awesome grounding: A curated list of research papers in visual grounding
☆1,125Sep 21, 2025Updated 8 months ago
guanxiongsun / vfe.pytorch
View on GitHub
Video Feature Enhancement with PyTorch
☆32Nov 28, 2024Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
callsys / DynRefer
View on GitHub
[CVPR 2025] DynRefer: Delving into Region-level Multimodal Tasks via Dynamic Resolution
☆59Mar 4, 2025Updated last year
microsoft / FIBER
View on GitHub
Coarse-to-Fine Vision-Language Pre-training with Fusion in the Backbone
☆131Oct 10, 2023Updated 2 years ago
liuting20 / DARA
View on GitHub
[ICME 2024 Oral] DARA: Domain- and Relation-aware Adapters Make Parameter-efficient Tuning for Visual Grounding
☆23Feb 26, 2025Updated last year
seanzhuh / SeqTR
View on GitHub
SeqTR: A Simple yet Universal Network for Visual Grounding
☆144Oct 30, 2024Updated last year
jefferyZhan / Griffon
View on GitHub
Official repo of Griffon series including v1(ECCV 2024), v2(ICCV 2025), G, and R, and also the RL tool Vision-R1(CVPR 2026).
☆249Apr 17, 2026Updated last month
yyh-rain-song / ReMamber
View on GitHub
ECCV24 "ReMamber: Referring Image Segmentation with Mamba Twister" official repository.
☆45Jul 11, 2024Updated last year
yuhangzang / UPT
View on GitHub
☆61May 2, 2025Updated last year
hwjiang1510 / VQLoC
View on GitHub
(NeurIPS 2023) Open-set visual object query search & localization in long-form videos
☆26Feb 1, 2024Updated 2 years ago
Jiaxuan-Li / EVCap
View on GitHub
[CVPR 2024] Retrieval-Augmented Image Captioning with External Visual-Name Memory for Open-World Comprehension
☆63Apr 8, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
zaynmi / seada-vqa
View on GitHub
A pytorch implemetation of data augmentation method for visual question answering
☆21May 25, 2023Updated 2 years ago
uvavision / AMC-grounding
View on GitHub
[CVPR 2023] Code for "Improving Visual Grounding by Encouraging Consistent Gradient-based Explanations"
☆19Oct 10, 2023Updated 2 years ago
MengyuanChen21 / ICLR2024-REDL
View on GitHub
[ICLR 2024 Spotlight] R-EDL: Relaxing Nonessential Settings of Evidential Deep Learning
☆43Nov 18, 2024Updated last year
liuting20 / SwimVG
View on GitHub
Transactions on Multimedia (TMM25)
☆21Apr 8, 2025Updated last year
MengyuanChen21 / CVPR2023-CMPAE
View on GitHub
[CVPR 2023] Collecting Cross-Modal Presence-Absence Evidence for Weakly-Supervised Audio-Visual Event Perception
☆37Jun 17, 2023Updated 2 years ago
JierunChen / Ref-L4
View on GitHub
Evaluation code for Ref-L4, a new REC benchmark in the LMM era
☆61Dec 28, 2024Updated last year
MCG-NJU / Dynamic-MDETR
View on GitHub
[TPAMI 2024] Dynamic MDETR: A Dynamic Multimodal Transformer Decoder for Visual Grounding
☆29Sep 11, 2024Updated last year