yaolinli/IDC

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/yaolinli/IDC)

yaolinli / IDC

☆30

Alternatives and similar repositories for IDC

Users that are interested in IDC are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Seth-Park / RobustChangeCaptioning
View on GitHub
Code and dataset release for Park et al., Robust Change Captioning (ICCV 2019)
☆52Dec 8, 2022Updated 3 years ago
sushizixin / CLIP4IDC
View on GitHub
CLIP4IDC: CLIP for Image Difference Captioning (AACL 2022)
☆36Nov 12, 2022Updated 3 years ago
tuyunbin / SRDRL
View on GitHub
[ACL 2021] This is the Pytorch code for our paper "Semantic Relation-aware Difference Representation Learning for Change Captioning".
☆13Jan 16, 2022Updated 4 years ago
cvpaperchallenge / Describing-and-Localizing-Multiple-Change-with-Transformers
View on GitHub
☆20Nov 10, 2022Updated 3 years ago
ShizhenChang / Chg2Cap
View on GitHub
Changes to Captions: An Attentive Network for Remote Sensing Change Captioning
☆80Oct 26, 2023Updated 2 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
SjokerLily / awesome-image-captioning
View on GitHub
A paper list of image captioning.
☆21Apr 23, 2022Updated 4 years ago
tuyunbin / SCORER
View on GitHub
[ICCV 2023] This is the Pytorch code for our paper "Self-Supervised Cross-View Representation Reconstruction for Change Captioning".
☆20Sep 25, 2025Updated 9 months ago
YZHJessica / CDVQA
View on GitHub
☆14Feb 17, 2023Updated 3 years ago
xmu-xiaoma666 / SDATR
View on GitHub
Official Code for "Knowing what it is: Semantic-enhanced Dual Attention Transformer" (TMM2022)
☆19Oct 15, 2022Updated 3 years ago
mrwu-mac / DIFNet
View on GitHub
[CVPR 2022] This repository is for the paper ``DIFNet: Boosting Visual Information Flow for Image Captioning'' .
☆21Nov 28, 2022Updated 3 years ago
LibertFan / ImageCaption
View on GitHub
Bridging by Word: Image-Grounded Vocabulary Construction for Visual Captioning based in ACL2019
☆17Sep 8, 2019Updated 6 years ago
uestc-xyh / ComqueryFormer
View on GitHub
☆11Nov 28, 2022Updated 3 years ago
UKPLab / emnlp2022-missing-counter-evidence
View on GitHub
Source code and data of our paper "Missing Counter-Evidence Renders NLP Fact-Checking Unrealistic for Misinformation" (https://arxiv.org/…
☆10Jun 21, 2023Updated 3 years ago
husthuaan / AAT
View on GitHub
Code for paper "Adaptively Aligned Image Captioning via Adaptive Attention Time". NeurIPS 2019
☆50Dec 18, 2019Updated 6 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
md-mohaiminul / BIMBA
View on GitHub
☆29Jul 25, 2025Updated 11 months ago
daicoolb / Awesome-Video-Captioning
View on GitHub
video captioning
☆24Mar 14, 2019Updated 7 years ago
luo3300612 / Transformer-Captioning
View on GitHub
Optimized code based on M2 for faster image captioning training
☆21Nov 18, 2022Updated 3 years ago
AlonMendelson / SGVL
View on GitHub
☆17Dec 13, 2023Updated 2 years ago
Holipori / EKAID
View on GitHub
code for Expert Knowledge-Aware Image Difference Graph Representation Learning for Difference-Aware Medical Visual Question Answering
☆29May 30, 2025Updated last year
arijitray1993 / COLA
View on GitHub
COLA: Evaluate how well your vision-language model can Compose Objects Localized with Attributes!
☆25May 14, 2026Updated 2 months ago
zhangxuying1004 / RSTNet
View on GitHub
Official Code for 'RSTNet: Captioning with Adaptive Attention on Visual and Non-Visual Words' (CVPR 2021)
☆123Dec 17, 2022Updated 3 years ago
justchenhao / SILI_CD
View on GitHub
Official Pytorch Implementation of “Continuous Cross-resolution Remote Sensing Image Change Detection”
☆35Nov 26, 2023Updated 2 years ago
TencentARC / FLM
View on GitHub
Accelerating Vision-Language Pretraining with Free Language Modeling (CVPR 2023)
☆31May 15, 2023Updated 3 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
sks3i / pycocoevalcap
View on GitHub
Microsoft COCO Caption Evaluation Tool - Python 3
☆32May 23, 2019Updated 7 years ago
AndresPMD / semantic_adaptive_margin
View on GitHub
WACV 2022 Paper - Is An Image Worth Five Sentences? A New Look into Semantics for Image-Text Matching
☆16Dec 10, 2021Updated 4 years ago
lancopku / simNet
View on GitHub
Code for "simNet: Stepwise Image-Topic Merging Network for Generating Detailed and Comprehensive Image Captions" （EMNLP 2018）
☆36Sep 5, 2018Updated 7 years ago
uzh-dqbm-cmi / ARGON
View on GitHub
Progressive Transformer-Based Generation of Radiology Reports
☆25Jan 5, 2025Updated last year
Code-kunkun / ZS-CIR
View on GitHub
[BMVC 2023] Zero-shot Composed Text-Image Retrieval
☆55Nov 26, 2024Updated last year
czbiohub-sf / Organelle_IP_analyses_and_figures
View on GitHub
Jupyter notebooks for analysis and figures related to the native organelle IP paper
☆14Mar 10, 2026Updated 4 months ago
jamespark3922 / lsmdc-baseline
View on GitHub
☆15Aug 16, 2019Updated 6 years ago
AIM3-RUC / Youmakeup_Challenge2022
View on GitHub
☆17Jun 15, 2022Updated 4 years ago
airsplay / VisualRelationships
View on GitHub
Data of ACL 2019 Paper "Expressing Visual Relationships via Language".
☆63Sep 30, 2020Updated 5 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
wangyuchi369 / RICO
View on GitHub
Official implementation of the paper: [EMNLP 2025] RICO: Improving Accuracy and Completeness in Image Recaptioning via Visual Reconstruct…
☆21Dec 9, 2025Updated 7 months ago
md-mohaiminul / VideoRecap
View on GitHub
☆208Jul 12, 2024Updated 2 years ago
luo3300612 / image-captioning-DLCT
View on GitHub
Official pytorch implementation of paper "Dual-Level Collaborative Transformer for Image Captioning" (AAAI 2021).
☆203Jun 8, 2022Updated 4 years ago
cs-jerhuang / P-VQA
View on GitHub
Medical Knowledge-Based Network For Patient-oriented Visual Question Answering
☆19Feb 25, 2023Updated 3 years ago
liusiqi43 / tf-mixer
View on GitHub
tensorflow Implementation of https://github.com/facebookresearch/MIXER
☆11Mar 8, 2017Updated 9 years ago
evanmiltenburg / MeasureDiversity
View on GitHub
Measure the diversity of image descriptions, repository for our COLING 2018 paper.
☆13Dec 29, 2019Updated 6 years ago
mlii0117 / DCL
View on GitHub
Official code for "Dynamic Graph Enhanced Contrastive Learning for Chest X-ray Report Generation" (CVPR 2023)
☆120May 7, 2023Updated 3 years ago