tuyunbin / NCTLinks

[IEEE TMM 2023] This is the Pytorch code for our paper "Neighborhood Contrastive Transformer for Change Captioning".

☆12

Alternatives and similar repositories for NCT

Users that are interested in NCT are comparing it to the libraries listed below

Sorting:

yangcong356 / BITA
This is the official code for "Bootstrapping Interactive Image-Text Alignment for Remote Sensing Image Captioning"
☆32Updated 6 months ago
ChenDelong1999 / ITRA
A codebase for flexible and efficient Image Text Representation Alignment
☆19Updated 2 years ago
TangXu-Group / Cross-modal-remote-sensing-image-and-text-retrieval-models
☆23Updated 9 months ago
xiaoyuan1996 / GaLR
Source code of paper "Remote Sensing Cross-Modal Image-Text Retrieval Based on Global and Local Information"
☆67Updated last year
YZHJessica / CDVQA
☆13Updated 2 years ago
HaiyanHuang98 / NWPU-Captions
☆16Updated 2 years ago
like413 / OPT-RSVG
[TGRS 2024] Language-Guided Progressive Attention for Visual Grounding in Remote Sensing Images.
☆39Updated last month
jaychempan / Awesome-RSITR
🎮 A Benchmark and Awesome Collection of Methods for Remote Sensing Image-Text Retrieval (RSITR)｜ Remote Sensing Cross-Model Retrieval (R…
☆57Updated 4 months ago
Chen-Yang-Liu / MLAT
[IEEE GRSL 2022 🔥] "Remote Sensing Image Captioning Based on Multi-Layer Aggregated Transformer"
☆28Updated 2 years ago
Paranioar / RCAR
[TIP2023] The code of “Plug-and-Play Regulators for Image-Text Matching”
☆33Updated last year
jaychempan / PIR-CLIP
📖 Official Code for “PIR-CLIP: Remote Sensing Image-text Retrieval with Prior Instruction Representation Learning”
☆18Updated 9 months ago
Zjut-MultimediaPlus / PIR-pytorch
A Prior Instruction Representation Framework for Remote Sensing Image-text Retrieval (MM'23 Oral)
☆16Updated last year
yangcong356 / KCFI
This is the official code for "Enhancing Perception of Key Changes in Remote Sensing Image Change Captioning"
☆14Updated this week
ferjad / I2DFormer
Code for CVPR23 Highlight "I2MVFormer: Large Language Model Generated Multi-View Document Supervision for Zero-Shot Image Classification"…
☆21Updated last year
One-paper-luck / MG-Transformer
☆12Updated 8 months ago
LANMNG / LQVG
☆20Updated 10 months ago
zhuduowang / Change3D
[CVPR 2025 Highlight] Change3D: Revisiting Change Detection and Captioning from A Video Modeling Perspective.
☆34Updated 2 weeks ago
lerogo / aaai24_itr_cusa
Source code of our AAAI 2024 paper "Cross-Modal and Uni-Modal Soft-Label Alignment for Image-Text Retrieval"
☆44Updated last year
lsa1997 / CARIS
Code for "CARIS: Context-Aware Referring Image Segmentation" [ACM MM2023]
☆27Updated 7 months ago
mainaksingha01 / APPLeNet
☆22Updated 10 months ago
ZhanYang-nwpu / RSVG-pytorch
RSVG: Exploring Data and Model for Visual Grounding on Remote Sensing Data, 2022
☆144Updated last year
caoql98 / OVRS
Open-Vocabulary High-Resolution Remote Sensing Image Semantic Segmentation
☆19Updated 4 months ago
linhuixiao / HiVG
[ACM MM 2024] Hierarchical Multimodal Fine-grained Modulation for Visual Grounding.
☆51Updated 3 months ago
yaolinli / IDC
☆28Updated 2 years ago
yangli18 / VLTVG
Improving Visual Grounding with Visual-Linguistic Verification and Iterative Reasoning, CVPR 2022
☆97Updated 2 years ago
WayneTomas / TransCP
[TPAMI 2024] This is the Pytorch code for our paper "Context Disentangling and Prototype Inheriting for Robust Visual Grounding".
☆18Updated 2 months ago
hiteshK03 / Remote-sensing-image-captioning-with-transformer-and-multilabel-classification
☆18Updated 2 years ago
CrossmodalGroup / NAAF
Implementation of our CVPR2022 paper, Negative-Aware Attention Framework for Image-Text Matching.
☆118Updated 2 years ago
kingthreestones / RefCLIP
☆36Updated 2 years ago
GeoX-Lab / RS-GPT4V
☆36Updated last year