Zjut-MultimediaPlus / PIR-pytorchLinks

A Prior Instruction Representation Framework for Remote Sensing Image-text Retrieval (MM'23 Oral)

☆16

Alternatives and similar repositories for PIR-pytorch

Users that are interested in PIR-pytorch are comparing it to the libraries listed below

Sorting:

jaychempan / PIR-CLIP
📖 Official Code for “PIR-CLIP: Remote Sensing Image-text Retrieval with Prior Instruction Representation Learning”
☆18Updated 9 months ago
jaychempan / Awesome-RSITR
A Benchmark and Awesome Collection of Methods for Remote Sensing Image-Text Retrieval (RSITR)｜ Remote Sensing Cross-Model Retrieval (RSCM…
☆57Updated 4 months ago
ZhanYang-nwpu / PE-RSITR
Parameter-Efficient Transfer Learning for Remote Sensing Image-Text Retrieval, 2023
☆26Updated last year
LanCole / Awesome-Remote-Sensing-Cross-Modal-Image-Text-Retrieval
A collection of papers, datasets, benchmarks, code, and model weights for Remote Sensing Cross-Modal Image-Text Retrieval (RSCMIT).
☆20Updated this week
TangXu-Group / Cross-modal-remote-sensing-image-and-text-retrieval-models
☆23Updated 10 months ago
ChenDelong1999 / ITRA
A codebase for flexible and efficient Image Text Representation Alignment
☆19Updated 2 years ago
like413 / OPT-RSVG
[TGRS 2024] Language-Guided Progressive Attention for Visual Grounding in Remote Sensing Images.
☆39Updated last month
ZhangWeihang99 / HVSA
Official PyTorch implementation for Hypersphere-Based Remote Sensing Cross-Modal Text–Image Retrieval via Curriculum Learning.
☆16Updated 11 months ago
LANMNG / LQVG
☆21Updated 11 months ago
linhuixiao / HiVG
[ACM MM 2024] Hierarchical Multimodal Fine-grained Modulation for Visual Grounding.
☆53Updated 3 months ago
xiaoyuan1996 / GaLR
Source code of paper "Remote Sensing Cross-Modal Image-Text Retrieval Based on Global and Local Information"
☆68Updated last year
lx709 / VRSBench
☆55Updated 2 months ago
yangcong356 / BITA
This is the official code for "Bootstrapping Interactive Image-Text Alignment for Remote Sensing Image Captioning"
☆32Updated 7 months ago
ZhanYang-nwpu / RSVG-pytorch
RSVG: Exploring Data and Model for Visual Grounding on Remote Sensing Data, 2022
☆145Updated last year
HaiyanHuang98 / NWPU-Captions
☆16Updated 2 years ago
opendatalab / VHM
VHM: Versatile and Honest Vision Language Model for Remote Sensing Image Analysis
☆92Updated 5 months ago
om-ai-lab / RS5M
RS5M: a large-scale vision language dataset for remote sensing [TGRS]
☆270Updated 4 months ago
ferjad / I2DFormer
Code for CVPR23 Highlight "I2MVFormer: Large Language Model Generated Multi-View Document Supervision for Zero-Shot Image Classification"…
☆21Updated 2 years ago
xiaoyuan1996 / SemanticLocalizationMetrics
The first research for semantic localization
☆29Updated last year
om-ai-lab / awesome-RSVLM
Collection of Remote Sensing Vision-Language Models
☆138Updated last year
xuliu-cyber / RSUniVLM
☆31Updated 7 months ago
ZhanYang-nwpu / SkyEyeGPT
[ISPRS2025] SkyEyeGPT: Unifying Remote Sensing Vision-Language Tasks via Instruction Tuning with Large Language Model
☆89Updated 2 months ago
chenwei746 / EEVG
☆20Updated 11 months ago
GeoX-Lab / RS-GPT4V
☆36Updated last year
Lsan2401 / RMSIN
Rotated Multi-Scale Interaction Network for Referring Remote Sensing Image Segmentation
☆130Updated last year
caoql98 / OVRS
Open-Vocabulary High-Resolution Remote Sensing Image Semantic Segmentation
☆20Updated 4 months ago
yecy749 / GSNet
Code & Dataset repository for the paper "Towards Open-Vocabulary Remote Sensing Image Semantic Segmentation"
☆59Updated 7 months ago
Luo-Z13 / SkySenseGPT
A Fine-Grained Instruction Tuning Dataset and Model for Remote Sensing Vision-Language Understanding
☆115Updated this week
YZHJessica / CDVQA
☆13Updated 2 years ago
BigData-KSU / RS-LLaVA
☆52Updated last year