LanCole/Awesome-Remote-Sensing-Cross-Modal-Image-Text-Retrieval

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/LanCole/Awesome-Remote-Sensing-Cross-Modal-Image-Text-Retrieval)

LanCole / Awesome-Remote-Sensing-Cross-Modal-Image-Text-Retrieval

A collection of papers, datasets, benchmarks, code, and model weights for Remote Sensing Cross-Modal Image-Text Retrieval (RSCMIT).

☆39

Alternatives and similar repositories for Awesome-Remote-Sensing-Cross-Modal-Image-Text-Retrieval

Users that are interested in Awesome-Remote-Sensing-Cross-Modal-Image-Text-Retrieval are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

jaychempan / Awesome-RSITR
View on GitHub
A Benchmark and Awesome Collection of Methods for Remote Sensing Image-Text Retrieval (RSITR)｜ Remote Sensing Cross-Model Retrieval (RSCM…
☆69Mar 10, 2025Updated last year
ZhanYang-nwpu / PE-RSITR
View on GitHub
Parameter-Efficient Transfer Learning for Remote Sensing Image-Text Retrieval, 2023
☆29Jan 14, 2024Updated 2 years ago
jaychempan / PriorCLIP
View on GitHub
Official Code for “PriorCLIP: Visual Prior Guided Vision-Language Model for Remote Sensing Image-Text Retrieval”
☆30Dec 19, 2025Updated 7 months ago
TangXu-Group / Cross-modal-remote-sensing-image-and-text-retrieval-models
View on GitHub
☆22Sep 19, 2024Updated last year
Ji-Haoyang / FGVLA
View on GitHub
The code of Fine-Grained Visual-Language Alignment for Remote Sensing Image-Text Retrieval（IEEE Transactions on Geoscience and Remote Sen…
☆15Jun 30, 2025Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
CrossmodalGroup / ESL
View on GitHub
☆12May 3, 2024Updated 2 years ago
jaychempan / PIR
View on GitHub
[ACMMM'23 Oral] Official Code for “A Prior Instruction Representation Framework for Remote Sensing Image-text Retrieval”
☆51Jan 19, 2024Updated 2 years ago
xiaoyuan1996 / GaLR
View on GitHub
Source code of paper "Remote Sensing Cross-Modal Image-Text Retrieval Based on Global and Local Information"
☆70Oct 25, 2023Updated 2 years ago
seekerhuang / HarMA
View on GitHub
[ICLRW 2024] Efficient Remote Sensing with Harmonized Transfer Learning and Modality Alignment
☆64Jul 18, 2024Updated 2 years ago
ZhangWeihang99 / HVSA
View on GitHub
Official PyTorch implementation for Hypersphere-Based Remote Sensing Cross-Modal Text–Image Retrieval via Curriculum Learning.
☆16Aug 10, 2024Updated last year
ChenDelong1999 / RemoteCLIP
View on GitHub
🛰️ Official repository of paper "RemoteCLIP: A Vision Language Foundation Model for Remote Sensing" (IEEE TGRS)
☆579Jun 27, 2024Updated 2 years ago
AAA-Zheng / Image-Text-Matching-Summary
View on GitHub
Summary of Related Research on Image-Text Matching
☆75May 20, 2023Updated 3 years ago
CrossmodalGroup / LAPS
View on GitHub
Linguistic-Aware Patch Slimming Framework for Fine-grained Cross-Modal Alignment, CVPR, 2024
☆110Jun 26, 2025Updated last year
MediaBrain-SJTU / GSC
View on GitHub
☆14Jul 13, 2024Updated 2 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
MitsuiChen14 / DGTRS
View on GitHub
☆32Jun 10, 2026Updated last month
Zjut-MultimediaPlus / PIR-pytorch
View on GitHub
A Prior Instruction Representation Framework for Remote Sensing Image-text Retrieval (MM'23 Oral)
☆15Dec 8, 2023Updated 2 years ago
LANMNG / LQVG
View on GitHub
☆32Nov 27, 2025Updated 8 months ago
kkzhang95 / Awesome-Composed-Multi-modal-Retrieval
View on GitHub
A comprehensive survey of Composed Multi-modal Retrieval (CMR), including Composed Image Retrieval (CIR) and Composed Video Retrieval (CV…
☆90Jan 20, 2026Updated 6 months ago
om-ai-lab / RS5M
View on GitHub
RS5M: a large-scale vision language dataset for remote sensing [TGRS]
☆313Mar 17, 2025Updated last year
XLearning-SCU / 2024-TIP-CREAM
View on GitHub
PyTorch implementation for Cross-modal Retrieval with Noisy Correspondence via Consistency Refining and Mining (TIP 2024)
☆22Mar 25, 2024Updated 2 years ago
multimodal-interpretability / nnn
View on GitHub
Nearest Neighbor Normalization (EMNLP 2024)
☆21Nov 1, 2024Updated last year
yangcong356 / BITA
View on GitHub
This is the official code for "Bootstrapping Interactive Image-Text Alignment for Remote Sensing Image Captioning"
☆36Dec 24, 2024Updated last year
liuliqin / R2HGAN-generate-HSI-from-RGB
View on GitHub
☆12Mar 31, 2022Updated 4 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
CrossmodalGroup / ER-SAN
View on GitHub
Implementation of our IJCAI2022 oral paper, ER-SAN: Enhanced-Adaptive Relation Self-Attention Network for Image Captioning.
☆25Aug 5, 2023Updated 2 years ago
LuminosityX / FNE
View on GitHub
Implementation of our paper, Your Negative May not Be True Negative: Boosting Image-Text Matching with False Negative Elimination..
☆20Dec 3, 2023Updated 2 years ago
caoyuan57 / Hashing
View on GitHub
☆78May 26, 2025Updated last year
qxzha / UGNCL
View on GitHub
Uncertainty-Guided Noisy Correspondence Learning for Efficient Cross-Modal Matching (ACM SIGIR 2024, Pytorch Code)
☆22Apr 16, 2026Updated 3 months ago
shivram1987 / VisionTransformerHashing
View on GitHub
☆42Mar 23, 2022Updated 4 years ago
Luo-Z13 / SkySense-Chat
View on GitHub
A Scene Graph-Enhanced Remote Sensing Large Vision-Language Model
☆148Jan 19, 2026Updated 6 months ago
zkashef / ECE535-FederatedLearning
View on GitHub
Multimodal Federated Learning on IoT Data
☆11Dec 17, 2023Updated 2 years ago
OMEGAFSL / MESSL
View on GitHub
Multiform Ensemble Self-Supervised Learning for Few-Shot Remote Sensing Scene Classification
☆13Mar 10, 2023Updated 3 years ago
Lavender105 / RSGPT
View on GitHub
☆150May 27, 2026Updated 2 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
BigData-KSU / RS-LLaVA
View on GitHub
☆66Oct 21, 2025Updated 9 months ago
201528014227051 / RSICD_optimal
View on GitHub
Datasets for remote sensing images (Paper:Exploring Models and Data for Remote Sensing Image Caption Generation)
☆239Nov 28, 2021Updated 4 years ago
rui-ye / FedFM
View on GitHub
☆11Dec 4, 2025Updated 7 months ago
gentlefress / MLIP
View on GitHub
The code of paper "MLIP: Enhancing Medical Visual Representation with Divergence Encoder and Knowledge-guided Contrastive Learning" accep…
☆10Mar 5, 2024Updated 2 years ago
KyanChen / DynamicVis
View on GitHub
This is the implement of the paper "DynamicVis: An Efficient and General Visual Foundation Model for Remote Sensing Image Understanding"
☆86Jan 25, 2026Updated 6 months ago
om-ai-lab / ImageRAG
View on GitHub
Enhancing Ultrahigh Resolution Remote Sensing Imagery Analysis With ImageRAG [GRSM]
☆34May 16, 2026Updated 2 months ago
RS-xjg / oil-spill-detection
View on GitHub
Oil Spill Detection Based on Deep Convolution Neural Network using Polarimetric Scattering Information from Sentinel-1 SAR Images
☆12May 13, 2021Updated 5 years ago