CLIP4IDC: CLIP for Image Difference Captioning (AACL 2022)
☆36Nov 12, 2022Updated 3 years ago
Alternatives and similar repositories for CLIP4IDC
Users that are interested in CLIP4IDC are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [IEEE TMM 2023] This is the Pytorch code for our paper "Neighborhood Contrastive Transformer for Change Captioning".☆12Aug 30, 2023Updated 2 years ago
- [ICCV 2023] This is the Pytorch code for our paper "Self-Supervised Cross-View Representation Reconstruction for Change Captioning".☆20Sep 25, 2025Updated 6 months ago
- ☆30Oct 19, 2022Updated 3 years ago
- Code and dataset release for Park et al., Robust Change Captioning (ICCV 2019)☆50Dec 8, 2022Updated 3 years ago
- [ACL 2021] This is the Pytorch code for our paper "Semantic Relation-aware Difference Representation Learning for Change Captioning".☆13Jan 16, 2022Updated 4 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Official Code for "Knowing what it is: Semantic-enhanced Dual Attention Transformer" (TMM2022)☆19Oct 15, 2022Updated 3 years ago
- ☆12Sep 6, 2023Updated 2 years ago
- A paper list of image captioning.☆21Apr 23, 2022Updated 3 years ago
- Deliberate Attention Networks for Image Captioning (AAAI 2019)☆11Sep 30, 2019Updated 6 years ago
- modified datasets for remote sensing image caption☆11Apr 23, 2019Updated 6 years ago
- A pytorch implementation of "Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering" for image captioning.☆48Nov 15, 2021Updated 4 years ago
- ☆20Nov 10, 2022Updated 3 years ago
- [IEEE TGRS 2022 🔥] Remote Sensing Image Change Captioning With Dual-Branch Transformers: A New Method and a Large Scale Dataset☆138Sep 16, 2025Updated 6 months ago
- 🔥Collection of resources and papers☆69May 30, 2025Updated 9 months ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Data of ACL 2019 Paper "Expressing Visual Relationships via Language".☆63Sep 30, 2020Updated 5 years ago
- Official Code for 'RSTNet: Captioning with Adaptive Attention on Visual and Non-Visual Words' (CVPR 2021)☆123Dec 17, 2022Updated 3 years ago
- Bridging by Word: Image-Grounded Vocabulary Construction for Visual Captioning based in ACL2019☆17Sep 8, 2019Updated 6 years ago
- EMNLP 2018. Learning to Describe Differences Between Pairs of Similar Images. Harsh Jhamtani, Taylor Berg-Kirkpatrick.☆69Jan 27, 2026Updated 2 months ago
- Source code and data of our paper "Missing Counter-Evidence Renders NLP Fact-Checking Unrealistic for Misinformation" (https://arxiv.org/…☆10Jun 21, 2023Updated 2 years ago
- 🔥🔥🔥 Object State Description & Change Detection☆10Mar 30, 2024Updated last year
- A unified framework to jointly model images, text, and human attention traces.☆79May 24, 2021Updated 4 years ago
- reanalysis of the ObjectNet paper and our annotations and code☆16Mar 4, 2021Updated 5 years ago
- Efficient-Sentence-Embedding-using-Discrete-Cosine-Transform☆17Jul 2, 2020Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Framework for training Robotic Manipulators on the Obstacle Avoidance task through Reinforcement Learning☆17Feb 20, 2023Updated 3 years ago
- Partially Non-Autoregressive Image Captioning☆10Sep 30, 2021Updated 4 years ago
- ☆30May 7, 2021Updated 4 years ago
- Microsoft COCO Caption Evaluation Tool - Python 3☆33May 23, 2019Updated 6 years ago
- VideoCC is a dataset containing (video-URL, caption) pairs for training video-text machine learning models. It is created using an automa…☆78Dec 5, 2022Updated 3 years ago
- The code implementation of "Asymmetric Hash Code Learning" (AHCL) for remote sensing image retreival, which was accepted by IEEE Trans. …☆24Mar 22, 2022Updated 4 years ago
- ☆31Jun 29, 2022Updated 3 years ago
- Code and Resources for the Transformer Encoder Reasoning Network (TERN) - https://arxiv.org/abs/2004.09144☆58Dec 6, 2023Updated 2 years ago
- Official implementation of TagAlign☆37Dec 11, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- 📄 Evidence Retrieval and Claim Verification for the FEVER shared task using Transformer Networks☆12Feb 21, 2020Updated 6 years ago
- This repository focus on Image Captioning & Video Captioning & Seq-to-Seq Learning & NLP☆411Nov 14, 2022Updated 3 years ago
- Evaluating different engineering tricks that make RL work☆15Jun 3, 2021Updated 4 years ago
- (ICML 2021) Implementation for S2SD - Simultaneous Similarity-based Self-Distillation for Deep Metric Learning. Paper Link: https://arxiv…☆44Sep 18, 2020Updated 5 years ago
- ☆218Feb 26, 2022Updated 4 years ago
- The official code for "Visual Relationship Detection with Visual-Linguistic Knowledge from Multimodal Representations" (IEEE Access, 2021…☆17Oct 21, 2022Updated 3 years ago
- OVAD: Open-vocabulary Attribute Detection code☆31Aug 28, 2023Updated 2 years ago