sushizixin / CLIP4IDCView external linksLinks
CLIP4IDC: CLIP for Image Difference Captioning (AACL 2022)
☆36Nov 12, 2022Updated 3 years ago
Alternatives and similar repositories for CLIP4IDC
Users that are interested in CLIP4IDC are comparing it to the libraries listed below
Sorting:
- ☆29Oct 19, 2022Updated 3 years ago
- Code and dataset release for Park et al., Robust Change Captioning (ICCV 2019)☆50Dec 8, 2022Updated 3 years ago
- [ACL 2021] This is the Pytorch code for our paper "Semantic Relation-aware Difference Representation Learning for Change Captioning".☆13Jan 16, 2022Updated 4 years ago
- Official Code for "Knowing what it is: Semantic-enhanced Dual Attention Transformer" (TMM2022)☆19Oct 15, 2022Updated 3 years ago
- A pytorch implementation of "Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering" for image captioning.☆48Nov 15, 2021Updated 4 years ago
- ☆16Dec 25, 2021Updated 4 years ago
- The official code for "Visual Relationship Detection with Visual-Linguistic Knowledge from Multimodal Representations" (IEEE Access, 2021…☆17Oct 21, 2022Updated 3 years ago
- Efficient-Sentence-Embedding-using-Discrete-Cosine-Transform☆17Jul 2, 2020Updated 5 years ago
- Official Code for 'RSTNet: Captioning with Adaptive Attention on Visual and Non-Visual Words' (CVPR 2021)☆123Dec 17, 2022Updated 3 years ago
- This repository contains code for the paper 'Dual-branch Hybrid Learning Network for Unbiased Scene Graph Generation'.☆18Aug 6, 2022Updated 3 years ago
- Bridging by Word: Image-Grounded Vocabulary Construction for Visual Captioning based in ACL2019☆17Sep 8, 2019Updated 6 years ago
- A unified framework to jointly model images, text, and human attention traces.☆79May 24, 2021Updated 4 years ago
- [IEEE TGRS 2022 🔥] Remote Sensing Image Change Captioning With Dual-Branch Transformers: A New Method and a Large Scale Dataset☆137Sep 16, 2025Updated 4 months ago
- Code for the paper: Saying No is An Art: Contextualized Fallback Responses for Unanswerable Dialogue Queries☆19Nov 29, 2021Updated 4 years ago
- Scale-Steerable CNN Implementation for PyTorch☆23Nov 12, 2020Updated 5 years ago
- 🔥Collection of resources and papers☆69May 30, 2025Updated 8 months ago
- Implementation of 'X-Linear Attention Networks for Image Captioning' [CVPR 2020]☆273Jul 27, 2021Updated 4 years ago
- Data of ACL 2019 Paper "Expressing Visual Relationships via Language".☆62Sep 30, 2020Updated 5 years ago
- ☆31Jun 29, 2022Updated 3 years ago
- ☆30May 7, 2021Updated 4 years ago
- Official PyTorch implementation of Time-aware Large Kernel (TaLK) Convolutions (ICML 2020)☆29Dec 9, 2020Updated 5 years ago
- [MedIA 2025] MambaMIM: Pre-training Mamba with State Space Token Interpolation and its Application to Medical Image Segmentation☆40Aug 10, 2025Updated 6 months ago
- GRIT: Faster and Better Image-captioning Transformer (ECCV 2022)☆198May 9, 2023Updated 2 years ago
- Official implementation of TagAlign☆35Dec 11, 2024Updated last year
- The dataset and code for ACL 2022 paper "SciNLI: A Corpus for Natural Language Inference on Scientific Text" are released here.☆28Oct 17, 2023Updated 2 years ago
- ☆35Oct 21, 2023Updated 2 years ago
- Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities☆80Jan 7, 2026Updated last month
- 📄 Evidence Retrieval and Claim Verification for the FEVER shared task using Transformer Networks☆12Feb 21, 2020Updated 5 years ago
- MMMG: A Massive, Multidisciplinary, Multi-Tier Generation Benchmark for Text-to-Image Reasoning [NeurIPS 2025 Poster]☆22Dec 10, 2025Updated 2 months ago
- Code and Resources for the Transformer Encoder Reasoning and Alignment Network (TERAN), accepted for publication in ACM Transactions on M…☆74Dec 6, 2023Updated 2 years ago
- This repository focus on Image Captioning & Video Captioning & Seq-to-Seq Learning & NLP☆411Nov 14, 2022Updated 3 years ago
- Implementation of paper "Improving Image Captioning with Better Use of Caption"☆33Sep 15, 2020Updated 5 years ago
- A Universal Adversarial Dataset☆35Oct 26, 2020Updated 5 years ago
- (ICML 2021) Implementation for S2SD - Simultaneous Similarity-based Self-Distillation for Deep Metric Learning. Paper Link: https://arxiv…☆44Sep 18, 2020Updated 5 years ago
- Open Set Semantic Segmentation☆10Dec 23, 2020Updated 5 years ago
- Code and performance tests to demonstrate the COUNTLESS algorithm. https://medium.com/@willsilversmith/countless-high-performance-2x-down…☆10Oct 23, 2019Updated 6 years ago
- ☆10Apr 7, 2025Updated 10 months ago
- ☆10Oct 5, 2022Updated 3 years ago
- Partially Non-Autoregressive Image Captioning☆10Sep 30, 2021Updated 4 years ago