xiaoyuan1996 / MCRNLinks
A multi-source cross-modal retrieval network
☆14Updated last year
Alternatives and similar repositories for MCRN
Users that are interested in MCRN are comparing it to the libraries listed below
Sorting:
- The official code for "Visual Relationship Detection with Visual-Linguistic Knowledge from Multimodal Representations" (IEEE Access, 2021…☆17Updated 2 years ago
- The first research for semantic localization☆29Updated last year
- ☆10Updated 7 months ago
- modified datasets for remote sensing image caption☆11Updated 6 years ago
- Code for the paper "MSMatch: Semi-Supervised Multispectral Scene Classification with Few Labels"☆27Updated 5 months ago
- Codes for our CVPR 2021 paper "Deep Compositional Metric Learning"☆21Updated 4 years ago
- CLIP4IDC: CLIP for Image Difference Captioning (AACL 2022)☆34Updated 2 years ago
- [ACL 2021] Learning Relation Alignment for Calibrated Cross-modal Retrieval☆31Updated 2 years ago
- ☆24Updated 3 years ago
- Paper list of compositional zero-shot learning☆10Updated 3 years ago
- [IGARSS 2022] CapFormer: Pure transformer for remote sensing image caption☆20Updated 2 years ago
- [ICLR 2023] “ Layer Grafted Pre-training: Bridging Contrastive Learning And Masked Image Modeling For Better Representations”, Ziyu Jian…☆24Updated 2 years ago
- ☆16Updated 3 years ago
- Knowledge Distillation using Contrastive Language-Image Pretraining (CLIP) without a teacher model.☆18Updated 11 months ago
- Official Code for "Knowing what it is: Semantic-enhanced Dual Attention Transformer" (TMM2022)☆19Updated 2 years ago
- ☆19Updated 4 years ago
- [ECCV'22 Poster] Explicit Image Caption Editing☆22Updated 2 years ago
- IEEE/CVF International Conference on Computer Vision Workshop (2023)☆16Updated last year
- [WACV2023] This is the official PyTorch impelementation of our paper "[Rethinking Rotation in Self-Supervised Contrastive Learning: Adapt…☆12Updated 2 years ago
- SVL-Adapter: Self-Supervised Adapter for Vision-Language Pretrained Models☆20Updated last year
- Pytorch implementation of Location-free Camouflge Generation Network☆22Updated 3 years ago
- Code for: Imagine by Reasoning: A Reasoning-Based Implicit Semantic Data Augmentation for Long-Tailed Classification☆27Updated 3 months ago
- Source code of paper "Remote Sensing Cross-Modal Image-Text Retrieval Based on Global and Local Information"☆68Updated last year
- [IEEE TIP 2023] Txt2Img-MHN: Remote Sensing Image Generation from Text Using Modern Hopfield Networks☆75Updated last year
- Official implementation of "Cross-Modal Fusion Distillation for Fine-Grained Sketch-Based Image Retrieval", BMVC 2022.☆19Updated 2 years ago
- ☆11Updated 4 years ago
- ☆22Updated 3 years ago
- ☆22Updated 3 years ago
- ROSITA: Enhancing Vision-and-Language Semantic Alignments via Cross- and Intra-modal Knowledge Integration☆56Updated 2 years ago
- Remote sensing Image Captioning is a special case of Image Captioning which solves the difficulties in processing the remote sensing imag…☆11Updated 4 years ago