Dmmm1997/SimVG

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Dmmm1997/SimVG)

Dmmm1997 / SimVG

[NeurIPS2024] - SimVG: A Simple Framework for Visual Grounding with Decoupled Multi-modal Fusion

☆103

Alternatives and similar repositories for SimVG

Users that are interested in SimVG are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Dmmm1997 / PropVG
View on GitHub
[ICCV2025] PropVG: End-to-End Proposal-Driven Visual Grounding with Multi-Granularity Discrimination
☆32Oct 13, 2025Updated 9 months ago
Dmmm1997 / C3VG
View on GitHub
[AAAI2025 selected as oral] - Multi-task Visual Grounding with Coarse-to-Fine Consistency Constraints
☆45Jul 2, 2025Updated last year
Dmmm1997 / MomentSeg
View on GitHub
[ECCV2026] MomentSeg: Moment-Centric Sampling for Enhanced Video Pixel Understanding
☆24Jun 19, 2026Updated last month
linhuixiao / HiVG
View on GitHub
[ACM MM 2024] Hierarchical Multimodal Fine-grained Modulation for Visual Grounding.
☆65Nov 10, 2025Updated 8 months ago
Dmmm1997 / DeRIS
View on GitHub
[ICCV2025] DeRIS: Decoupling Perception and Cognition for Enhanced Referring Image Segmentation through Loopback Synergy
☆48Nov 21, 2025Updated 8 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
linhuixiao / OneRef
View on GitHub
[NeurIPS 2024] OneRef: Unified One-tower Expression Grounding and Segmentation with Mask Referring Modeling.
☆32Nov 13, 2025Updated 8 months ago
Dmmm1997 / InstanceVG
View on GitHub
[TPAMI2025] Improving Generalized Visual Grounding with Instance-aware Joint Learning
☆33Apr 28, 2026Updated 2 months ago
chenwei746 / EEVG
View on GitHub
☆23Aug 20, 2024Updated last year
Mr-Bigworth / MMCA
View on GitHub
Visual Grounding with Multi-modal Conditional Adaptation (ACMMM 2024 Oral)
☆26Jun 11, 2025Updated last year
like413 / OPT-RSVG
View on GitHub
[TGRS 2024] Language-Guided Progressive Attention for Visual Grounding in Remote Sensing Images.
☆56Jun 10, 2025Updated last year
WayneTomas / TransCP
View on GitHub
[TPAMI 2024] This is the official Pytorch code for our paper "Context Disentangling and Prototype Inheriting for Robust Visual Grounding"…
☆28May 8, 2025Updated last year
linhuixiao / CLIP-VG
View on GitHub
[TMM 2023] Self-paced Curriculum Adapting of CLIP for Visual Grounding.
☆135Nov 10, 2025Updated 8 months ago
om-ai-lab / GroundVLP
View on GitHub
GroundVLP: Harnessing Zero-shot Visual Grounding from Vision-Language Pre-training and Open-Vocabulary Object Detection (AAAI 2024)
☆74Apr 10, 2026Updated 3 months ago
MCG-NJU / Dynamic-MDETR
View on GitHub
[TPAMI 2024] Dynamic MDETR: A Dynamic Multimodal Transformer Decoder for Visual Grounding
☆29Sep 11, 2024Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
djiajunustc / TransVG
View on GitHub
☆198Feb 27, 2024Updated 2 years ago
pumpkin805 / FALIP
View on GitHub
[ECCV2024]FALIP: Visual Prompt as Foveal Attention Boosts CLIP Zero-Shot Performance
☆18Sep 11, 2024Updated last year
linhuixiao / Awesome-Visual-Grounding
View on GitHub
[TPAMI 2025] Towards Visual Grounding: A Survey
☆322Nov 18, 2025Updated 8 months ago
cjhing / OS-FPI
View on GitHub
Official repository of OS-FPI
☆17Dec 22, 2024Updated last year
liuting20 / SwimVG
View on GitHub
Transactions on Multimedia (TMM25)
☆21Apr 8, 2025Updated last year
yifeisu / TG-GAT
View on GitHub
Target-Grounded Graph-Aware Transformer for Aerial Vision-and-Dialog Navigation, AVDN Challenge, ICCV CLVL 2023.
☆21Jan 2, 2024Updated 2 years ago
jcwang0602 / PLVL
View on GitHub
Progressive Language-guided Visual Learning for Multi-Task Visual Grounding
☆13May 9, 2025Updated last year
Charles-Xie / awesome-described-object-detection
View on GitHub
A curated list of papers and resources related to Described Object Detection, Open-Vocabulary/Open-World Object Detection and Referring E…
☆356Nov 6, 2025Updated 8 months ago
Xuchen-Li / Awesome-Vision-Language-Tracking
View on GitHub
A vision-language tracking paper list, articles related to visual language tracking have been documented.
☆46Dec 15, 2024Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
WeitaiKang / SegVG
View on GitHub
[ECCV 2024] SegVG: Transferring Object Bounding Box to Segmentation for Visual Grounding
☆63Oct 22, 2024Updated last year
MultimodalGeo / GeoText-1652
View on GitHub
An offical repo for ECCV 2024 Towards Natural Language-Guided Drones: GeoText-1652 Benchmark with Spatial Relation Matching
☆118Jul 7, 2026Updated 2 weeks ago
OpenSpaceAI / UVLTrack
View on GitHub
The official pytorch implementation of our AAAI 2024 paper "Unifying Visual and Vision-Language Tracking via Contrastive Learning"
☆50Nov 4, 2024Updated last year
callsys / ControlCap
View on GitHub
[ECCV 2024] ControlCap: Controllable Region-level Captioning
☆81Oct 25, 2024Updated last year
iLearn-Lab / MM23-RTQ
View on GitHub
ACM Multimedia 2023 (Oral) - RTQ: Rethinking Video-language Understanding Based on Image-text Model
☆15Apr 7, 2026Updated 3 months ago
Dmmm1997 / DenseUAV
View on GitHub
「TIP2023」Vision-Based UAV Self-Positioning in Low-Altitude Urban Environments
☆228Dec 12, 2025Updated 7 months ago
ziplab / MPVSS
View on GitHub
☆33Feb 29, 2024Updated 2 years ago
weijun-arc / SPOL
View on GitHub
Codes for CVPR2021 paper "Shallow Feature Matters for Weakly Supervised Object Localization"
☆24Aug 2, 2021Updated 4 years ago
hyqyoung / RAMS-Trans
View on GitHub
RAMS-Trans: Recurrent Attention Multi-scale Transformer for Fine-grained Image Recognition
☆11Dec 14, 2021Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
qumengxue / RIO
View on GitHub
☆13Oct 30, 2023Updated 2 years ago
shikras / d-cube
View on GitHub
A detection/segmentation dataset with labels characterized by intricate and flexible expressions. "Described Object Detection: Liberating…
☆138Mar 20, 2024Updated 2 years ago
LukeForeverYoung / QRNet
View on GitHub
☆41Jun 3, 2022Updated 4 years ago
LANMNG / LQVG
View on GitHub
☆32Nov 27, 2025Updated 7 months ago
jkli1998 / DRM
View on GitHub
Code for paper 'Leveraging Predicate and Triplet Learning for Scene Graph Generation'. (CVPR 2024)
☆33Sep 6, 2025Updated 10 months ago
ml-research / deictic-segment-anything
View on GitHub
Segment Anything with Deictic Prompting
☆27May 13, 2025Updated last year
callsys / DynRefer
View on GitHub
[CVPR 2025] DynRefer: Delving into Region-level Multimodal Tasks via Dynamic Resolution
☆59Mar 4, 2025Updated last year