sushizixin/CLIP4IDC

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/sushizixin/CLIP4IDC)

sushizixin / CLIP4IDC

CLIP4IDC: CLIP for Image Difference Captioning (AACL 2022)

☆36

Alternatives and similar repositories for CLIP4IDC

Users that are interested in CLIP4IDC are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

tuyunbin / SCORER
View on GitHub
[ICCV 2023] This is the Pytorch code for our paper "Self-Supervised Cross-View Representation Reconstruction for Change Captioning".
☆20Sep 25, 2025Updated 10 months ago
yaolinli / IDC
View on GitHub
☆30Oct 19, 2022Updated 3 years ago
Seth-Park / RobustChangeCaptioning
View on GitHub
Code and dataset release for Park et al., Robust Change Captioning (ICCV 2019)
☆52Dec 8, 2022Updated 3 years ago
tuyunbin / SRDRL
View on GitHub
[ACL 2021] This is the Pytorch code for our paper "Semantic Relation-aware Difference Representation Learning for Change Captioning".
☆13Jan 16, 2022Updated 4 years ago
ShizhenChang / Chg2Cap
View on GitHub
Changes to Captions: An Attentive Network for Remote Sensing Change Captioning
☆80Oct 26, 2023Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
xmu-xiaoma666 / SDATR
View on GitHub
Official Code for "Knowing what it is: Semantic-enhanced Dual Attention Transformer" (TMM2022)
☆19Oct 15, 2022Updated 3 years ago
reallsp / SAF
View on GitHub
☆12Sep 6, 2023Updated 2 years ago
fkxssaa / Deliberate-Attention-Networks-for-Image-Captioning
View on GitHub
Deliberate Attention Networks for Image Captioning (AAAI 2019)
☆11Sep 30, 2019Updated 6 years ago
ezeli / BUTD_model
View on GitHub
A pytorch implementation of "Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering" for image captioning.
☆48Nov 15, 2021Updated 4 years ago
cvpaperchallenge / Describing-and-Localizing-Multiple-Change-with-Transformers
View on GitHub
☆20Nov 10, 2022Updated 3 years ago
Chen-Yang-Liu / RSICC
View on GitHub
[IEEE TGRS 2022 🔥] Remote Sensing Image Change Captioning With Dual-Branch Transformers: A New Method and a Large Scale Dataset
☆141Sep 16, 2025Updated 10 months ago
zhangxuying1004 / RSTNet
View on GitHub
Official Code for 'RSTNet: Captioning with Adaptive Attention on Visual and Non-Visual Words' (CVPR 2021)
☆123Dec 17, 2022Updated 3 years ago
YueJiang-nj / EyeFormer-UIST2024
View on GitHub
Code Release for the paper EyeFormer: Predicting Scanpaths in Free-Viewing Tasks with Transformer-Guided Reinforcement Learning.
☆16Jan 29, 2026Updated 5 months ago
facebookresearch / connect-caption-and-trace
View on GitHub
A unified framework to jointly model images, text, and human attention traces.
☆80May 24, 2021Updated 5 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
N-Almarwani / DCT_Sentence_Embedding
View on GitHub
Efficient-Sentence-Embedding-using-Discrete-Cosine-Transform
☆17Jul 2, 2020Updated 6 years ago
aliborji / ObjectNetReanalysis
View on GitHub
reanalysis of the ObjectNet paper and our annotations and code
☆16Mar 4, 2021Updated 5 years ago
feizc / PNAIC
View on GitHub
Partially Non-Autoregressive Image Captioning
☆10Sep 30, 2021Updated 4 years ago
guanghuixu / AnchorCaptioner
View on GitHub
☆30May 7, 2021Updated 5 years ago
JDAI-CV / image-captioning
View on GitHub
Implementation of 'X-Linear Attention Networks for Image Captioning' [CVPR 2020]
☆273Jul 27, 2021Updated 4 years ago
fundamentalvision / UniGrad
View on GitHub
☆31Jun 29, 2022Updated 4 years ago
Flame-Chasers / DiaNA
View on GitHub
【CVPR 2025】Chat-based Person Retrieval via Dialogue-Refined Cross-Modal Alignment
☆39Sep 17, 2025Updated 10 months ago
mesnico / TERN
View on GitHub
Code and Resources for the Transformer Encoder Reasoning Network (TERN) - https://arxiv.org/abs/2004.09144
☆58Dec 6, 2023Updated 2 years ago
simonepri / fever-transformers
View on GitHub
📄 Evidence Retrieval and Claim Verification for the FEVER shared task using Transformer Networks
☆12Feb 21, 2020Updated 6 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
congchao120 / video_synopsis
View on GitHub
Project of video synopsis
☆10May 18, 2016Updated 10 years ago
forence / Awesome-Visual-Captioning
View on GitHub
This repository focus on Image Captioning & Video Captioning & Seq-to-Seq Learning & NLP
☆410Nov 14, 2022Updated 3 years ago
Qinying-Liu / TagAlign
View on GitHub
Official implementation of TagAlign
☆37Dec 11, 2024Updated last year
MLforHealth / S2SD
View on GitHub
(ICML 2021) Implementation for S2SD - Simultaneous Similarity-based Self-Distillation for Deep Metric Learning. Paper Link: https://arxiv…
☆44Sep 18, 2020Updated 5 years ago
quangvnai / grit
View on GitHub
GRIT: Faster and Better Image-captioning Transformer (ECCV 2022)
☆199May 9, 2023Updated 3 years ago
OVAD-Benchmark / ovad-benchmark-code
View on GitHub
OVAD: Open-vocabulary Attribute Detection code
☆30Aug 28, 2023Updated 2 years ago
Boomem / geetest
View on GitHub
极验滑块 js 破解
☆13May 24, 2019Updated 7 years ago
aa200647963 / SGG-DHL
View on GitHub
This repository contains code for the paper 'Dual-branch Hybrid Learning Network for Unbiased Scene Graph Generation'.
☆17Aug 6, 2022Updated 3 years ago
soumik12345 / nerf.jax
View on GitHub
A minimal TPU compatible Jax implementation of NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis.
☆13Apr 21, 2022Updated 4 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
marcopede / AreasOfAttention
View on GitHub
☆10Apr 20, 2018Updated 8 years ago
husthuaan / AAT
View on GitHub
Code for paper "Adaptively Aligned Image Captioning via Adaptive Attention Time". NeurIPS 2019
☆50Dec 18, 2019Updated 6 years ago
katesanders9 / squid-e
View on GitHub
☆10Oct 5, 2022Updated 3 years ago
mesnico / TERAN
View on GitHub
Code and Resources for the Transformer Encoder Reasoning and Alignment Network (TERAN), accepted for publication in ACM Transactions on M…
☆74Dec 6, 2023Updated 2 years ago
SY-Xuan / vibe_python
View on GitHub
a implementation of vibe with python
☆11Jul 27, 2018Updated 7 years ago
vsubhashini / noc
View on GitHub
Novel Object Captioner - Captioning Images with diverse objects
☆42Nov 26, 2017Updated 8 years ago
yale-nlp / ODSum
View on GitHub
Data and code for paper "ODSum: New Benchmarks for Open Domain Multi-Document Summarization"
☆11Sep 20, 2024Updated last year