ChenDelong1999 / ITRALinks

A codebase for flexible and efficient Image Text Representation Alignment

☆19

Alternatives and similar repositories for ITRA

Users that are interested in ITRA are comparing it to the libraries listed below

Sorting:

Zjut-MultimediaPlus / PIR-pytorch
A Prior Instruction Representation Framework for Remote Sensing Image-text Retrieval (MM'23 Oral)
☆16Updated last year
jaychempan / PIR-CLIP
📖 Official Code for “PIR-CLIP: Remote Sensing Image-text Retrieval with Prior Instruction Representation Learning”
☆18Updated 9 months ago
ZhanYang-nwpu / PE-RSITR
Parameter-Efficient Transfer Learning for Remote Sensing Image-Text Retrieval, 2023
☆26Updated last year
xiaoyuan1996 / GaLR
Source code of paper "Remote Sensing Cross-Modal Image-Text Retrieval Based on Global and Local Information"
☆68Updated last year
TangXu-Group / Cross-modal-remote-sensing-image-and-text-retrieval-models
☆23Updated 10 months ago
like413 / OPT-RSVG
[TGRS 2024] Language-Guided Progressive Attention for Visual Grounding in Remote Sensing Images.
☆39Updated last month
jaychempan / Awesome-RSITR
A Benchmark and Awesome Collection of Methods for Remote Sensing Image-Text Retrieval (RSITR)｜ Remote Sensing Cross-Model Retrieval (RSCM…
☆57Updated 4 months ago
lx709 / VRSBench
☆55Updated 2 months ago
LANMNG / LQVG
☆21Updated 11 months ago
LanCole / Awesome-Remote-Sensing-Cross-Modal-Image-Text-Retrieval
A collection of papers, datasets, benchmarks, code, and model weights for Remote Sensing Cross-Modal Image-Text Retrieval (RSCMIT).
☆20Updated this week
ZhanYang-nwpu / RSVG-pytorch
RSVG: Exploring Data and Model for Visual Grounding on Remote Sensing Data, 2022
☆145Updated last year
mainaksingha01 / APPLeNet
☆22Updated 11 months ago
om-ai-lab / awesome-RSVLM
Collection of Remote Sensing Vision-Language Models
☆138Updated last year
spectralpublic / RSIVQA
Data set for the IEEE TGRS paper "Mutual Attention Inception Network for Remote Sensing Visual Question Answering"
☆21Updated 2 years ago
HaiyanHuang98 / NWPU-Captions
☆16Updated 2 years ago
GeoX-Lab / RS-GPT4V
☆36Updated last year
yangcong356 / BITA
This is the official code for "Bootstrapping Interactive Image-Text Alignment for Remote Sensing Image Captioning"
☆32Updated 7 months ago
om-ai-lab / RS5M
RS5M: a large-scale vision language dataset for remote sensing [TGRS]
☆270Updated 4 months ago
ZhanYang-nwpu / SkyEyeGPT
[ISPRS2025] SkyEyeGPT: Unifying Remote Sensing Vision-Language Tasks via Instruction Tuning with Large Language Model
☆89Updated 2 months ago
linhuixiao / HiVG
[ACM MM 2024] Hierarchical Multimodal Fine-grained Modulation for Visual Grounding.
☆53Updated 3 months ago
ZhangWeihang99 / HVSA
Official PyTorch implementation for Hypersphere-Based Remote Sensing Cross-Modal Text–Image Retrieval via Curriculum Learning.
☆16Updated 11 months ago
Lsan2401 / RMSIN
Rotated Multi-Scale Interaction Network for Referring Remote Sensing Image Segmentation
☆130Updated last year
yangcong356 / KCFI
This is the official code for "Enhancing Perception of Key Changes in Remote Sensing Image Change Captioning"
☆14Updated last week
yecy749 / GSNet
Code & Dataset repository for the paper "Towards Open-Vocabulary Remote Sensing Image Semantic Segmentation"
☆59Updated 7 months ago
NJU-LHRS / ScoreRS
Code and updates for the ScoreRS project.
☆25Updated 5 months ago
VisionXLab / CastDet
[ECCV'24] Code repo for "Toward Open Vocabulary Aerial Object Detection with CLIP-Activated Student-Teacher Learning"
☆49Updated 2 months ago
seekerhuang / HarMA
[ICLRW 2024] Efficient Remote Sensing with Harmonized Transfer Learning and Modality Alignment
☆54Updated last year
BigData-KSU / RS-LLaVA
☆52Updated last year
caoql98 / OVRS
Open-Vocabulary High-Resolution Remote Sensing Image Semantic Segmentation
☆20Updated 4 months ago
opendatalab / VHM
VHM: Versatile and Honest Vision Language Model for Remote Sensing Image Analysis
☆92Updated 5 months ago