xiaoyuan1996 / MCRNLinks
A multi-source cross-modal retrieval network
☆14Updated 2 years ago
Alternatives and similar repositories for MCRN
Users that are interested in MCRN are comparing it to the libraries listed below
Sorting:
- The first research for semantic localization☆29Updated 2 years ago
- CLIP4IDC: CLIP for Image Difference Captioning (AACL 2022)☆36Updated 3 years ago
- ☆25Updated 3 years ago
- modified datasets for remote sensing image caption☆11Updated 6 years ago
- Codes for our CVPR 2021 paper "Deep Compositional Metric Learning"☆21Updated 4 years ago
- The official code for "Visual Relationship Detection with Visual-Linguistic Knowledge from Multimodal Representations" (IEEE Access, 2021…☆17Updated 3 years ago
- ☆15Updated 3 years ago
- Code for: Imagine by Reasoning: A Reasoning-Based Implicit Semantic Data Augmentation for Long-Tailed Classification☆27Updated 8 months ago
- A novel deep hashing method (DHCNN) for remote sensing image retrieval and classification, which was pulished in IEEE Trans. Geosci. Remo…☆10Updated 3 years ago
- ☆10Updated last year
- ☆10Updated 2 years ago
- Pytorch implementation of Location-free Camouflge Generation Network☆23Updated 3 years ago
- Code for the paper "MSMatch: Semi-Supervised Multispectral Scene Classification with Few Labels"☆28Updated 10 months ago
- [ECCV'22 Poster] Explicit Image Caption Editing☆22Updated 3 years ago
- ☆20Updated 2 years ago
- Remote sensing Image Captioning is a special case of Image Captioning which solves the difficulties in processing the remote sensing imag…☆11Updated 4 years ago
- IEEE/CVF International Conference on Computer Vision Workshop (2023)☆17Updated last year
- ☆22Updated 4 years ago
- TIER: Text-Image Encoder-based Regression for AIGC Image Quality Assessment☆10Updated 10 months ago
- [IGARSS 2022] CapFormer: Pure transformer for remote sensing image caption☆21Updated 3 years ago
- Official implementation of the paper "ALADIN: Distilling Fine-grained Alignment Scores for Efficient Image-Text Matching and Retrieval"☆27Updated 2 years ago
- [IEEE TIP 2023] Txt2Img-MHN: Remote Sensing Image Generation from Text Using Modern Hopfield Networks☆80Updated 2 years ago
- Paper list of compositional zero-shot learning☆11Updated 3 years ago
- Official implementation of "Cross-Modal Fusion Distillation for Fine-Grained Sketch-Based Image Retrieval", BMVC 2022.☆21Updated 2 years ago
- [ICLR 2023] “ Layer Grafted Pre-training: Bridging Contrastive Learning And Masked Image Modeling For Better Representations”, Ziyu Jian…☆24Updated 2 years ago
- ☆12Updated 3 years ago
- ☆38Updated 3 years ago
- [CVPR 2022] "The Principle of Diversity: Training Stronger Vision Transformers Calls for Reducing All Levels of Redundancy" by Tianlong C…☆25Updated 3 years ago
- ROSITA: Enhancing Vision-and-Language Semantic Alignments via Cross- and Intra-modal Knowledge Integration☆56Updated 2 years ago
- S-CLIP: Semi-supervised Vision-Language Pre-training using Few Specialist Captions☆50Updated 2 years ago