A collection of papers, datasets, benchmarks, code, and model weights for Remote Sensing Cross-Modal Image-Text Retrieval (RSCMIT).
☆36Mar 1, 2026Updated last week
Alternatives and similar repositories for Awesome-Remote-Sensing-Cross-Modal-Image-Text-Retrieval
Users that are interested in Awesome-Remote-Sensing-Cross-Modal-Image-Text-Retrieval are comparing it to the libraries listed below
Sorting:
- ☆24Sep 19, 2024Updated last year
- Parameter-Efficient Transfer Learning for Remote Sensing Image-Text Retrieval, 2023☆28Jan 14, 2024Updated 2 years ago
- A Benchmark and Awesome Collection of Methods for Remote Sensing Image-Text Retrieval (RSITR)| Remote Sensing Cross-Model Retrieval (RSCM…☆66Mar 10, 2025Updated 11 months ago
- ☆12May 3, 2024Updated last year
- ☆30Jul 21, 2025Updated 7 months ago
- Official Code for “PriorCLIP: Visual Prior Guided Vision-Language Model for Remote Sensing Image-Text Retrieval”☆26Dec 19, 2025Updated 2 months ago
- Source code of paper "Remote Sensing Cross-Modal Image-Text Retrieval Based on Global and Local Information"☆72Oct 25, 2023Updated 2 years ago
- HybridHash: Hybrid Convolutional and Self-Attention Deep Hashing for Image Retrieval (ICMR 2024)☆20Aug 13, 2024Updated last year
- Implementation of our IJCAI2022 oral paper, ER-SAN: Enhanced-Adaptive Relation Self-Attention Network for Image Captioning.☆24Aug 5, 2023Updated 2 years ago
- ☆28Nov 27, 2025Updated 3 months ago
- Implementation of our paper, 'Unifying Two-Stream Encoders with Transformers for Cross-Modal Retrieval.'☆28Dec 3, 2023Updated 2 years ago
- Linguistic-Aware Patch Slimming Framework for Fine-grained Cross-Modal Alignment, CVPR, 2024☆107Jun 26, 2025Updated 8 months ago
- A Scene Graph-Enhanced Remote Sensing Large Vision-Language Model☆138Jan 19, 2026Updated last month
- Context-Aware Multi-View Summarization Network for Image-Text Matching. ACM MM'20☆29May 26, 2022Updated 3 years ago
- This is the implement of the paper "DynamicVis: An Efficient and General Visual Foundation Model for Remote Sensing Image Understanding"☆84Jan 25, 2026Updated last month
- The source code of AMFMN and the dataset RSITMD☆216Oct 25, 2023Updated 2 years ago
- Official implementation of "Meta-Entity Driven Triplet Mining for Aligning Medical Vision-Language Models"☆14Mar 19, 2025Updated 11 months ago
- [ACMMM'23 Oral] Official Code for “A Prior Instruction Representation Framework for Remote Sensing Image-text Retrieval”☆46Jan 19, 2024Updated 2 years ago
- ☆10Feb 13, 2025Updated last year
- [JAG 2026] DreamCD: A change-label-free framework for change detection via a weakly conditional semantic diffusion model in optical VHR i…☆21Jan 30, 2026Updated last month
- This repository includes official implementation and model weights of Data-Efficient Multi-Scale Fusion Vision Transformer.☆13Jan 7, 2025Updated last year
- An official pytorch implementation of the paper: [MV-Adapter: Multimodal Video Transfer Learning for Video Text Retrieval].☆14Jul 27, 2024Updated last year
- A Pytorch Dataloader for tif image files that dynamically crops the image.☆13Aug 21, 2020Updated 5 years ago
- some small but usuful scripts that help you with RK35588 or other Rockchips☆10May 17, 2023Updated 2 years ago
- Source code of WSiP model☆12Aug 14, 2022Updated 3 years ago
- This is a PyTorch implementation of the paper IDA-SiamNet: Interactive- and Dynamic-Aware Siamese Network for Building Change Detection☆12Aug 21, 2024Updated last year
- Authors official PyTorch implementation of the "Leveraging EfficientNet and Contrastive Learning for Accurate Global-scale Location Estim…☆13Feb 28, 2024Updated 2 years ago
- ☆14Oct 14, 2019Updated 6 years ago
- ☆18Apr 8, 2025Updated 11 months ago
- ☆13Sep 28, 2024Updated last year
- 😎 Awesome lists of papers and codes about Large Vision-Language Models☆13Apr 1, 2024Updated last year
- Source code of ”Ada4DIR: An adaptive model-driven all-in-one image restoration network for remote sensing images”☆14Mar 12, 2025Updated 11 months ago
- Multiform Ensemble Self-Supervised Learning for Few-Shot Remote Sensing Scene Classification☆13Mar 10, 2023Updated 2 years ago
- 🛰️ Official repository of paper "RemoteCLIP: A Vision Language Foundation Model for Remote Sensing" (IEEE TGRS)☆519Jun 27, 2024Updated last year
- ☆17Mar 5, 2024Updated 2 years ago
- ☆14Dec 31, 2024Updated last year
- Benchmark Dataset, Env and Agent for DSN Scheduling☆11Mar 3, 2022Updated 4 years ago
- [Neural Networks 2025]Text-guided Image Restoration and Semantic Enhancement for Text-to-Image Person Retrieval☆11Dec 24, 2024Updated last year
- ☆13Nov 26, 2023Updated 2 years ago