Seth-Park/RobustChangeCaptioning

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Seth-Park/RobustChangeCaptioning)

Seth-Park / RobustChangeCaptioning

Code and dataset release for Park et al., Robust Change Captioning (ICCV 2019)

☆52

Alternatives and similar repositories for RobustChangeCaptioning

Users that are interested in RobustChangeCaptioning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

tuyunbin / SRDRL
View on GitHub
[ACL 2021] This is the Pytorch code for our paper "Semantic Relation-aware Difference Representation Learning for Change Captioning".
☆13Jan 16, 2022Updated 4 years ago
yaolinli / IDC
View on GitHub
☆30Oct 19, 2022Updated 3 years ago
sushizixin / CLIP4IDC
View on GitHub
CLIP4IDC: CLIP for Image Difference Captioning (AACL 2022)
☆36Nov 12, 2022Updated 3 years ago
harsh19 / spot-the-diff
View on GitHub
EMNLP 2018. Learning to Describe Differences Between Pairs of Similar Images. Harsh Jhamtani, Taylor Berg-Kirkpatrick.
☆69Jan 27, 2026Updated 5 months ago
ShizhenChang / Chg2Cap
View on GitHub
Changes to Captions: An Attentive Network for Remote Sensing Change Captioning
☆80Oct 26, 2023Updated 2 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
yili-19 / SSGPA
View on GitHub
☆17Jul 14, 2025Updated last year
YZHJessica / CDVQA
View on GitHub
☆14Feb 17, 2023Updated 3 years ago
kevendai / fandp-ijcai2025-issues
View on GitHub
☆17Oct 13, 2025Updated 9 months ago
120343 / modified
View on GitHub
modified datasets for remote sensing image caption
☆12Apr 23, 2019Updated 7 years ago
feizc / PNAIC
View on GitHub
Partially Non-Autoregressive Image Captioning
☆10Sep 30, 2021Updated 4 years ago
rabiulcste / vismin
View on GitHub
[NeurIPS24] VisMin: Visual Minimal-Change Understanding
☆19Mar 3, 2025Updated last year
airsplay / VisualRelationships
View on GitHub
Data of ACL 2019 Paper "Expressing Visual Relationships via Language".
☆63Sep 30, 2020Updated 5 years ago
Chen-Yang-Liu / MLAT
View on GitHub
[IEEE GRSL 2022 🔥] "Remote Sensing Image Captioning Based on Multi-Layer Aggregated Transformer"
☆32Jun 20, 2023Updated 3 years ago
HaiyanHuang98 / NWPU-Captions
View on GitHub
☆18Dec 7, 2022Updated 3 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
mlii0117 / CoFE
View on GitHub
Official Code for Contrastive Learning with Counterfactual Explanations for Radiology Report Generation (ECCV 2024)
☆18Apr 3, 2025Updated last year
tuyunbin / SCORER
View on GitHub
[ICCV 2023] This is the Pytorch code for our paper "Self-Supervised Cross-View Representation Reconstruction for Change Captioning".
☆20Sep 25, 2025Updated 10 months ago
Sha-Lab / CMHSE
View on GitHub
The code repository for "Cross-Modal and Hierarchical Modeling of Video and Text" in PyTorch
☆16Apr 22, 2019Updated 7 years ago
ruotianluo / coco-caption
View on GitHub
☆67Nov 11, 2022Updated 3 years ago
mrwu-mac / DIFNet
View on GitHub
[CVPR 2022] This repository is for the paper ``DIFNet: Boosting Visual Information Flow for Image Captioning'' .
☆21Nov 28, 2022Updated 3 years ago
zhangxuying1004 / RSTNet
View on GitHub
Official Code for 'RSTNet: Captioning with Adaptive Attention on Visual and Non-Visual Words' (CVPR 2021)
☆123Dec 17, 2022Updated 3 years ago
marcopede / AreasOfAttention
View on GitHub
☆10Apr 20, 2018Updated 8 years ago
hitachi-nlp / FLD-corpus
View on GitHub
☆19Dec 6, 2024Updated last year
jacobswan1 / ViTCAP
View on GitHub
Implementation for CVPR 2022 paper " Injecting Semantic Concepts into End-to-End Image Captionin".
☆43May 28, 2022Updated 4 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
hongwang600 / fashion-iq-metadata
View on GitHub
this repo contains some useful metadata for Fashion IQ challenge: https://sites.google.com/view/lingir/fashion-iq
☆15Jun 28, 2019Updated 7 years ago
Chen-Yang-Liu / Change-Agent
View on GitHub
[IEEE TGRS 2024 🔥] Change-Agent: Toward Interactive Comprehensive Remote Sensing Change Interpretation and Analysis
☆198Jul 27, 2025Updated 11 months ago
LibertFan / ImageCaption
View on GitHub
Bridging by Word: Image-Grounded Vocabulary Construction for Visual Captioning based in ACL2019
☆17Sep 8, 2019Updated 6 years ago
techmn / cdchat
View on GitHub
A Large Multimodal Model for Remote Sensing Change Description (IGARSS 2025)
☆22Dec 17, 2025Updated 7 months ago
hanlinwu / ChangeChat
View on GitHub
AN INTERACTIVE REMOTE SENSING CHANGE ANALYSIS MODEL BASED ON MULTIMODAL INSTRUCTION TUNING
☆24Jun 16, 2025Updated last year
LuoweiZhou / coco-caption
View on GitHub
kdexd/coco-caption@de6f385
☆26Apr 21, 2020Updated 6 years ago
husthuaan / AAT
View on GitHub
Code for paper "Adaptively Aligned Image Captioning via Adaptive Attention Time". NeurIPS 2019
☆50Dec 18, 2019Updated 6 years ago
forence / Awesome-Visual-Captioning
View on GitHub
This repository focus on Image Captioning & Video Captioning & Seq-to-Seq Learning & NLP
☆410Nov 14, 2022Updated 3 years ago
princetonvisualai / imagecaptioning-bias
View on GitHub
Code for the paper "Understanding and Evaluating Racial Biases in Image Captioning"
☆12Mar 26, 2026Updated 3 months ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
PLAN-Lab / CheXRelFormer
View on GitHub
Hierarchical Vision Transformers for Disease Progression Detection in Chest X-Ray Images
☆11Jan 11, 2024Updated 2 years ago
yfyuan01 / MultiturnFashionRetrieval
View on GitHub
SIGIR paper Conversational Fashion Image Retrieval via Multiturn Natural Language Feedback
☆14Oct 17, 2022Updated 3 years ago
UCSB-AI / CPL
View on GitHub
Official implementation of our EMNLP 2022 paper "CPL: Counterfactual Prompt Learning for Vision and Language Models"
☆35Dec 5, 2022Updated 3 years ago
haamoon / finding_common_object
View on GitHub
Learning to Find Common Objects Across Few Image Collections
☆15Dec 11, 2020Updated 5 years ago
LunarShen / DsicoVLA
View on GitHub
[CVPR 2025] DiscoVLA: Discrepancy Reduction in Vision, Language, and Alignment for Parameter-Efficient Video-Text Retrieval
☆22Jun 23, 2025Updated last year
aimagelab / meshed-memory-transformer
View on GitHub
Meshed-Memory Transformer for Image Captioning. CVPR 2020
☆546Dec 21, 2022Updated 3 years ago
Chen-Yang-Liu / PromptCC
View on GitHub
[IEEE TGRS 2023 🔥] A Decoupling Paradigm With Prompt Learning for Remote Sensing Image Change Captioning'
☆39Jul 9, 2026Updated 2 weeks ago