abhrac / xmodal-vitLinks
Official implementation of "Cross-Modal Fusion Distillation for Fine-Grained Sketch-Based Image Retrieval", BMVC 2022.
☆19Updated last year
Alternatives and similar repositories for xmodal-vit
Users that are interested in xmodal-vit are comparing it to the libraries listed below
Sorting:
- Cross-modal Hierarchical Modelling for FGSBIR. Work accepted for Oral presentation in BMVC 2020☆18Updated last year
- Official implementation of Data-Free Sketch-Based Image Retrieval, CVPR 2023.☆26Updated last year
- [TPAMI 2023] Generative Multi-Label Zero-Shot Learning☆52Updated last year
- Code release for Your “Flamingo” is My “Bird”: Fine-Grained, or Not (CVPR 2021 Oral)☆60Updated last year
- [AAAI 2023] The official implementation of "A Benchmark and Asymmetrical-Similarity Learning for Practical Image Copy Detection"☆22Updated 4 months ago
- Shared Attention for Multi-label Zero-shot Learning accepted @ CVPR20☆32Updated 3 years ago
- [BMVC 2023 (Oral)] Official pytorch implementation of the paper: "Unsupervised Hashing with Similarity Distribution Calibration"☆21Updated last year
- Official implementation of the Composed Image Retrieval using Pretrained LANguage Transformers (CIRPLANT) | ICCV 2021 - Image Retrieval o…☆38Updated 11 months ago
- Official Implementation of CoSMo: Content-Style Modulation for Image Retrieval with Text Feedback presented in CVPR 2021.☆66Updated 2 years ago
- [NeurIPS'23] Parts of Speech–Grounded Subspaces in Vision-Language Models☆28Updated last year
- PyTorch Implementation of "Your ViT is Secretly a Hybrid Discriminative-Generative Diffusion Model"☆49Updated 2 years ago
- The Curious Layperson: Fine-Grained Image Recognition without Expert Labels (BMVC 2021 best student paper)☆23Updated 3 years ago
- Source code of Universal Weighting Metric Learning for Cross-Modal Matching. The paper is accepted by CVPR2020.☆22Updated 2 years ago
- ☆47Updated 2 years ago
- Generating Image Specific Text☆27Updated last year
- Official code for "Dynamic Token Normalization Improves Vision Transformer", ICLR 2022.☆28Updated 3 years ago
- This repo is the official implementation of UPL (Unsupervised Prompt Learning for Vision-Language Models).☆116Updated 3 years ago
- PyTorch reimplementation of "A simple, efficient and scalable contrastive masked autoencoder for learning visual representations".☆39Updated 2 years ago
- repo for paper titled: Towards Realistic Zero-Shot Classification via Self Structural Semantic Alignment (AAAI'24 Oral)☆25Updated last year
- [ICCV 2021] Official Pytorch implementation for Discriminative Region-based Multi-Label Zero-Shot Learning SOTA results on NUS-WIDE and …☆62Updated 3 years ago
- [CVPR(W) 2022] UIGR: Unified Interactive Garment Retrieval☆21Updated 3 years ago
- Hypergraph-Induced Semantic Tuplet Loss for Deep Metric Learning [CVPR'22]☆23Updated 3 years ago
- [ECCV-2022]Grounding Visual Representations with Texts for Domain Generalization☆31Updated 2 years ago
- ☆34Updated 2 years ago
- Code for: Imagine by Reasoning: A Reasoning-Based Implicit Semantic Data Augmentation for Long-Tailed Classification☆27Updated 3 weeks ago
- Code of the paper "Solving Inefficiency of Self-supervised Representation Learning"☆38Updated 3 years ago
- Official Implementation of AlignMixup - CVPR 2022☆71Updated 3 years ago
- Fine-Grained Generalized Zero-Shot Learning via Dense Attribute-Based Attention accepted @ CVPR20☆51Updated 2 years ago
- Official Code of ECCV 2022 paper MS-CLIP☆89Updated 2 years ago
- vit for few-shot classification☆47Updated 2 years ago