tuyunbin / Review-of-Change-CaptioningLinks
This repository offers a comprehensive overview of existing datasets and methods in the field of change captioning.
☆17Updated 3 months ago
Alternatives and similar repositories for Review-of-Change-Captioning
Users that are interested in Review-of-Change-Captioning are comparing it to the libraries listed below
Sorting:
- [TPAMI 2024] This is the official Pytorch code for our paper "Context Disentangling and Prototype Inheriting for Robust Visual Grounding"…☆27Updated 7 months ago
- A Benchmark and Awesome Collection of Methods for Remote Sensing Image-Text Retrieval (RSITR)| Remote Sensing Cross-Model Retrieval (RSCM…☆64Updated 9 months ago
- Source code of our AAAI 2024 paper "Cross-Modal and Uni-Modal Soft-Label Alignment for Image-Text Retrieval"☆52Updated last year
- [ICCV 2023] This is the Pytorch code for our paper "Self-Supervised Cross-View Representation Reconstruction for Change Captioning".☆20Updated 3 months ago
- Official Code for “PriorCLIP: Visual Prior Guided Vision-Language Model for Remote Sensing Image-Text Retrieval”☆23Updated last week
- Reason-before-Retrieve: One-Stage Reflective Chain-of-Thoughts for Training-Free Zero-Shot Composed Image Retrieval [CVPR 2025 Highlight]☆62Updated 5 months ago
- ☆36Updated last year
- This is a summary of research on noisy correspondence. There may be omissions. If anything is missing please get in touch with us. Our em…☆75Updated last month
- [ACM MM 2024] Hierarchical Multimodal Fine-grained Modulation for Visual Grounding.☆58Updated last month
- Official PyTorch repository for GRAM☆110Updated 7 months ago
- This is the official code for "Bootstrapping Interactive Image-Text Alignment for Remote Sensing Image Captioning"☆35Updated last year
- The official code for "TextRefiner: Internal Visual Feature as Efficient Refiner for Vision-Language Models Prompt Tuning" | [AAAI2025]☆48Updated 9 months ago
- ICML2024-ReconBoost: Boosting Can Achieve Modality Reconcilement☆28Updated 7 months ago
- This is a summary of research on noisy correspondence. There may be omissions. If anything is missing please get in touch with us. Our em…☆109Updated last month
- The official repo for "Ref-AVS: Refer and Segment Objects in Audio-Visual Scenes", ECCV 2024☆49Updated 2 months ago
- PyTorch implementation for Cross-modal Retrieval with Noisy Correspondence via Consistency Refining and Mining (TIP 2024)☆19Updated last year
- [CVPR2025] Code Release of Patch Matters: Training-free Fine-grained Image Caption Enhancement via Local Perception☆19Updated 6 months ago
- [TPAMI 2024] Dynamic MDETR: A Dynamic Multimodal Transformer Decoder for Visual Grounding☆29Updated last year
- [ICCV 2025] Enhancing Partially Relevant Video Retrieval with Hyperbolic Learning.☆49Updated 3 weeks ago
- [BMVC 2023] Zero-shot Composed Text-Image Retrieval☆54Updated last year
- 【AAAI2025】DeMo: Decoupled Feature-Based Mixture of Experts for Multi-Modal Object Re-Identification☆66Updated 9 months ago
- ☆24Updated last year
- [TMM 2023] Self-paced Curriculum Adapting of CLIP for Visual Grounding.☆132Updated last month
- [IEEE TMM 2023] This is the Pytorch code for our paper "Neighborhood Contrastive Transformer for Change Captioning".☆12Updated 2 years ago
- Deep Correlated Prompting for Visual Recognition with Missing Modalities (NeurIPS 2024)☆33Updated 9 months ago
- [TIP2023] The code of “Plug-and-Play Regulators for Image-Text Matching”☆34Updated last year
- [Paper][AAAI2024]Structure-CLIP: Towards Scene Graph Knowledge to Enhance Multi-modal Structured Representations☆153Updated last year
- ☆12Updated last year
- ☆27Updated last year
- A collection of papers, datasets, benchmarks, code, and model weights for Remote Sensing Cross-Modal Image-Text Retrieval (RSCMIT).☆33Updated 4 months ago