tuyunbin / NCT
[IEEE TMM 2023] This is the Pytorch code for our paper "Neighborhood Contrastive Transformer for Change Captioning".
☆12Updated last year
Alternatives and similar repositories for NCT:
Users that are interested in NCT are comparing it to the libraries listed below
- ☆19Updated 4 months ago
- [TIP2023] The code of “Plug-and-Play Regulators for Image-Text Matching”☆29Updated 9 months ago
- A Prior Instruction Representation Framework for Remote Sensing Image-text Retrieval (MM'23 Oral)☆14Updated last year
- This is the official code for "Bootstrapping Interactive Image-Text Alignment for Remote Sensing Image Captioning"☆25Updated 3 weeks ago
- Parameter-Efficient Transfer Learning for Remote Sensing Image-Text Retrieval, 2023☆22Updated last year
- 📖 Official Code for “PIR-CLIP: Remote Sensing Image-text Retrieval with Prior Instruction Representation Learning”☆15Updated 3 months ago
- Source code of our AAAI 2024 paper "Cross-Modal and Uni-Modal Soft-Label Alignment for Image-Text Retrieval"☆30Updated 9 months ago
- 🎮 A Benchmark and Awesome Collection of Methods for Remote Sensing Image-Text Retrieval (RSITR)| Remote Sensing Cross-Model Retrieval (R…☆48Updated 9 months ago
- ☆16Updated 2 years ago
- Official pytorch implementation of paper "Remote Sensing Image Captioning Based on Multi-Layer Aggregated Transformer"☆27Updated last year
- ☆28Updated 2 years ago
- A codebase for flexible and efficient Image Text Representation Alignment☆18Updated last year
- Source code of paper "Remote Sensing Cross-Modal Image-Text Retrieval Based on Global and Local Information"☆66Updated last year
- Code for "CARIS: Context-Aware Referring Image Segmentation" [ACM MM2023]☆23Updated last month
- [CVPR 2022] This repository is for the paper ``DIFNet: Boosting Visual Information Flow for Image Captioning'' .☆20Updated 2 years ago
- [TGRS 2024] Language-Guided Progressive Attention for Visual Grounding in Remote Sensing Images.☆32Updated 2 months ago
- Related papers about Referring Image Segmentation (RIS)☆16Updated last year
- Towards Local Visual Modeling for Image Captioning☆27Updated last year
- ☆43Updated last year
- [ACM MM 2024] Hierarchical Multimodal Fine-grained Modulation for Visual Grounding.☆34Updated 3 months ago
- ECCV24 "ReMamber: Referring Image Segmentation with Mamba Twister" official repository.☆30Updated 6 months ago
- The code of the paper "Cross-Modal Graph Matching Network for Image-Text Retrieval" in ACM Transactions on Multimedia Computing, Communic…☆44Updated last year
- Implementation of our IJCAI2022 oral paper, ER-SAN: Enhanced-Adaptive Relation Self-Attention Network for Image Captioning.☆22Updated last year
- ☆34Updated last year
- Code for CVPR23 Highlight "I2MVFormer: Large Language Model Generated Multi-View Document Supervision for Zero-Shot Image Classification"…☆19Updated last year
- [ICME'22] Visual Grounding with Transformers☆28Updated 2 years ago
- Code and dataset release for Park et al., Robust Change Captioning (ICCV 2019)☆48Updated 2 years ago
- [Paper][AAAI 2023] DUET: Cross-modal Semantic Grounding for Contrastive Zero-shot Learning☆49Updated 11 months ago
- Data set for the IEEE TGRS paper "Mutual Attention Inception Network for Remote Sensing Visual Question Answering"☆17Updated 2 years ago
- [TPAMI 2024] This is the Pytorch code for our paper "Context Disentangling and Prototype Inheriting for Robust Visual Grounding".☆16Updated 3 months ago