like413/OPT-RSVG

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/like413/OPT-RSVG)

like413 / OPT-RSVG

[TGRS 2024] Language-Guided Progressive Attention for Visual Grounding in Remote Sensing Images.

☆56

Alternatives and similar repositories for OPT-RSVG

Users that are interested in OPT-RSVG are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ZhanYang-nwpu / RSVG-pytorch
View on GitHub
RSVG: Exploring Data and Model for Visual Grounding on Remote Sensing Data, 2022
☆178Dec 10, 2025Updated 7 months ago
LANMNG / LQVG
View on GitHub
☆32Nov 27, 2025Updated 7 months ago
like413 / MACN
View on GitHub
[TGRS 2023] Mixing Self-Attention and Convolution Network: A Uniﬁed Framework for Multisource Remote Sensing Data Classification.
☆14Mar 28, 2024Updated 2 years ago
xiaoqiang-lu / Research
View on GitHub
☆19Jun 6, 2025Updated last year
like413 / RSVG-ZeroOV
View on GitHub
[AAAI 2026] RSVG-ZeroOV: Exploring a Training-Free Framework for Zero-Shot Open-Vocabulary Visual Grounding in Remote Sensing Images.
☆24Nov 11, 2025Updated 8 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
linhuixiao / HiVG
View on GitHub
[ACM MM 2024] Hierarchical Multimodal Fine-grained Modulation for Visual Grounding.
☆65Nov 10, 2025Updated 8 months ago
like413 / VisTA
View on GitHub
[arXiv, 2024] Show Me What and Where has Changed? Question Answering and Grounding for Remote Sensing Change Detection
☆36Jul 2, 2025Updated last year
VisionXLab / GeoGround
View on GitHub
GeoGround: A Unified Large Vision-Language Model for Remote Sensing Visual Grounding
☆92May 10, 2025Updated last year
Dmmm1997 / SimVG
View on GitHub
[NeurIPS2024] - SimVG: A Simple Framework for Visual Grounding with Decoupled Multi-modal Fusion
☆103Oct 29, 2025Updated 8 months ago
1e12Leon / RemoteAgent
View on GitHub
[arXiv 26] RemoteAgent: Bridging Vague Human Intents and Earth Observation with RL-based Agentic MLLMs
☆15Jul 5, 2026Updated 2 weeks ago
WayneTomas / TransCP
View on GitHub
[TPAMI 2024] This is the official Pytorch code for our paper "Context Disentangling and Prototype Inheriting for Robust Visual Grounding"…
☆28May 8, 2025Updated last year
djiajunustc / TransVG
View on GitHub
☆198Feb 27, 2024Updated 2 years ago
Lsan2401 / RMSIN
View on GitHub
Rotated Multi-Scale Interaction Network for Referring Remote Sensing Image Segmentation
☆159Apr 1, 2024Updated 2 years ago
like413 / SFS-Conv
View on GitHub
[CVPR 2024] Unleashing Channel Potential: Space-Frequency Selection Convolution for SAR Object Detection.
☆62Sep 20, 2024Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
zhu-xlab / rrsis
View on GitHub
☆22Jul 15, 2024Updated 2 years ago
om-ai-lab / RS5M
View on GitHub
RS5M: a large-scale vision language dataset for remote sensing [TGRS]
☆312Mar 17, 2025Updated last year
BigData-KSU / RS-LLaVA
View on GitHub
☆66Oct 21, 2025Updated 9 months ago
wivizhang / EarthMarker
View on GitHub
☆46Jan 6, 2025Updated last year
HAWLYQ / ET-Cap
View on GitHub
☆24Oct 8, 2023Updated 2 years ago
sunzc-sunny / refdrone
View on GitHub
RefDrone: A Challenging Benchmark for Drone Scene Referring Expression Comprehension
☆44Jul 8, 2026Updated 2 weeks ago
GeoX-Lab / RS-GPT4V
View on GitHub
☆37Jul 1, 2024Updated 2 years ago
qzp2018 / MCLN
View on GitHub
This is a PyTorch implementation of MCLN proposed by our paper "Multi-branch Collaborative Learning Network for 3D Visual Grounding"(ECCV…
☆27Oct 10, 2024Updated last year
ChenDelong1999 / RemoteCLIP
View on GitHub
🛰️ Official repository of paper "RemoteCLIP: A Vision Language Foundation Model for Remote Sensing" (IEEE TGRS)
☆575Jun 27, 2024Updated 2 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
om-ai-lab / awesome-RSVLM
View on GitHub
Collection of Remote Sensing Vision-Language Models
☆142May 13, 2024Updated 2 years ago
jaychempan / LAE-DINO
View on GitHub
[AAAI'25] Official Code for “Locate Anything on Earth: Advancing Open-Vocabulary Object Detection for Remote Sensing Community"
☆276Jun 6, 2026Updated last month
Zjut-MultimediaPlus / PIR-pytorch
View on GitHub
A Prior Instruction Representation Framework for Remote Sensing Image-text Retrieval (MM'23 Oral)
☆15Dec 8, 2023Updated 2 years ago
Shaosifan / FIANet
View on GitHub
[IEEE TGRS 2025] Exploring Fine-Grained Image-Text Alignment for Referring Remote Sensing Image Segmentation
☆45Nov 14, 2025Updated 8 months ago
ViTAE-Transformer / MTP
View on GitHub
The official repo for [JSTARS'24] "MTP: Advancing Remote Sensing Foundation Model via Multi-Task Pretraining"
☆249Aug 4, 2025Updated 11 months ago
floatingstarZ / OpenRSD
View on GitHub
OpenRSD: Towards Open-Prompt Object Detection in Remote Sensing
☆39Jan 7, 2026Updated 6 months ago
Huntersxsx / RIS-Learning-List
View on GitHub
Related papers about Referring Image Segmentation (RIS)
☆16Dec 26, 2023Updated 2 years ago
VisionXLab / RSCoVLM
View on GitHub
[Remote Sensing 2026] Co-Training Vision Language Models for Remote Sensing Multi-task Learning
☆37Jul 8, 2026Updated 2 weeks ago
minglangL / RSThinker
View on GitHub
☆38May 26, 2026Updated last month
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
mbzuai-oryx / GeoChat
View on GitHub
[CVPR 2024 🔥] GeoChat, the first grounded Large Vision Language Model for Remote Sensing
☆732Nov 28, 2024Updated last year
xiaoyuan1996 / SemanticLocalizationMetrics
View on GitHub
The first research for semantic localization
☆27Dec 6, 2023Updated 2 years ago
XiaoxFeng / RINet
View on GitHub
Codes for Weakly Supervised Rotation-Invariant Aerial Object Detection Network
☆32May 30, 2023Updated 3 years ago
Yxxxb / LAVT-RS
View on GitHub
[CVPR'2022, TPAMI'2024] LAVT: Language-Aware Vision Transformer for Referring Segmentation
☆26Jan 21, 2025Updated last year
yangli18 / VLTVG
View on GitHub
Improving Visual Grounding with Visual-Linguistic Verification and Iterative Reasoning, CVPR 2022
☆97Dec 2, 2022Updated 3 years ago
lx709 / VRSBench
View on GitHub
☆69Jun 11, 2026Updated last month
linhuixiao / Awesome-Visual-Grounding
View on GitHub
[TPAMI 2025] Towards Visual Grounding: A Survey
☆322Nov 18, 2025Updated 8 months ago