Noah888 / DARLinks
Enhancing Recipe Retrieval with Foundation Models: A Data Augmentation Perspective
☆10Updated 8 months ago
Alternatives and similar repositories for DAR
Users that are interested in DAR are comparing it to the libraries listed below
Sorting:
- [TIP2023] The code of “Plug-and-Play Regulators for Image-Text Matching”☆33Updated last year
- Source code of our AAAI 2024 paper "Cross-Modal and Uni-Modal Soft-Label Alignment for Image-Text Retrieval"☆42Updated last year
- Code and Dataset for the paper "LAMM: Label Alignment for Multi-Modal Prompt Learning" AAAI 2024☆32Updated last year
- A Prior Instruction Representation Framework for Remote Sensing Image-text Retrieval (MM'23 Oral)☆16Updated last year
- ☆18Updated 7 months ago
- Unofficial Implementation to CDUL: CLIP-Driven Unsupervised Learning for Multi-Label Image Classification [ICCV'23]☆29Updated last year
- The code of "Logits DeConfusion with CLIP for Few-Shot Learning" (CVPR 2025)☆24Updated 2 weeks ago
- ☆34Updated last year
- USER: Unified Semantic Enhancement with Momentum Contrast for Image-Text Retrieval, TIP 2024☆32Updated last week
- ☆48Updated last year
- [ACM MM 2024] Hierarchical Multimodal Fine-grained Modulation for Visual Grounding.☆51Updated 2 months ago
- PyTorch implementation for Cross-modal Retrieval with Noisy Correspondence via Consistency Refining and Mining (TIP 2024)☆17Updated last year
- [TPAMI 2024] This is the Pytorch code for our paper "Context Disentangling and Prototype Inheriting for Robust Visual Grounding".☆17Updated last month
- Enhance Vision-Language Alignment with Noise (AAAI 2025)☆24Updated 6 months ago
- code for studying OpenAI's CLIP explainability☆32Updated 3 years ago
- [ICML2024] Official PyTorch implementation of CoMC: Language-Driven Cross-Modal Classifier for Zero-Shot Multi-Label Image Recognition☆14Updated 11 months ago
- ☆10Updated last year
- ☆27Updated 2 years ago
- This is a summary of research on noisy correspondence. There may be omissions. If anything is missing please get in touch with us. Our em…☆66Updated 2 weeks ago
- 📖 Official Code for “PIR-CLIP: Remote Sensing Image-text Retrieval with Prior Instruction Representation Learning”☆18Updated 8 months ago
- Official PyTorch implementation for Hypersphere-Based Remote Sensing Cross-Modal Text–Image Retrieval via Curriculum Learning.☆15Updated 10 months ago
- Composed Person Retrieval (CPR) is a new cross-modal retrieval task that aims to identify individuals in large-scale person image databas…☆25Updated last month
- Pytorch Implementation for CVPR 2024 paper: Learn to Rectify the Bias of CLIP for Unsupervised Semantic Segmentation☆45Updated 2 weeks ago
- Deep Correlated Prompting for Visual Recognition with Missing Modalities (NeurIPS 2024)☆23Updated 3 months ago
- ECCV24 "ReMamber: Referring Image Segmentation with Mamba Twister" official repository.☆39Updated 11 months ago
- CLIP-Driven Fine-grained Text-Image Person Re-identification☆49Updated last year
- ☆20Updated 9 months ago
- A codebase for flexible and efficient Image Text Representation Alignment☆19Updated 2 years ago
- Implementation for "DualCoOp: Fast Adaptation to Multi-Label Recognition with Limited Annotations" (NeurIPS 2022))☆63Updated last year
- Python code to implement DeIL, a CLIP based approach for open-world few-shot learning.