Noah888 / DARLinks

Enhancing Recipe Retrieval with Foundation Models: A Data Augmentation Perspective

☆10

Alternatives and similar repositories for DAR

Users that are interested in DAR are comparing it to the libraries listed below

Sorting:

Paranioar / RCAR
[TIP2023] The code of “Plug-and-Play Regulators for Image-Text Matching”
☆33Updated last year
lerogo / aaai24_itr_cusa
Source code of our AAAI 2024 paper "Cross-Modal and Uni-Modal Soft-Label Alignment for Image-Text Retrieval"
☆42Updated last year
gaojingsheng / LAMM
Code and Dataset for the paper "LAMM: Label Alignment for Multi-Modal Prompt Learning" AAAI 2024
☆32Updated last year
Zjut-MultimediaPlus / PIR-pytorch
A Prior Instruction Representation Framework for Remote Sensing Image-text Retrieval (MM'23 Oral)
☆16Updated last year
jiexuanyan / CPRFL
☆18Updated 7 months ago
cs-mshah / CDUL
Unofficial Implementation to CDUL: CLIP-Driven Unsupervised Learning for Multi-Label Image Classification [ICCV'23]
☆29Updated last year
LiShuo1001 / LDC
The code of "Logits DeConfusion with CLIP for Few-Shot Learning" (CVPR 2025)
☆24Updated 2 weeks ago
hhc1997 / L2RM
☆34Updated last year
zhangy0822 / USER
USER: Unified Semantic Enhancement with Momentum Contrast for Image-Text Retrieval, TIP 2024
☆32Updated last week
ppanzx / CHAN
☆48Updated last year
linhuixiao / HiVG
[ACM MM 2024] Hierarchical Multimodal Fine-grained Modulation for Visual Grounding.
☆51Updated 2 months ago
XLearning-SCU / 2024-TIP-CREAM
PyTorch implementation for Cross-modal Retrieval with Noisy Correspondence via Consistency Refining and Mining (TIP 2024)
☆17Updated last year
WayneTomas / TransCP
[TPAMI 2024] This is the Pytorch code for our paper "Context Disentangling and Prototype Inheriting for Robust Visual Grounding".
☆17Updated last month
hyzhang98 / PiNI
Enhance Vision-Language Alignment with Noise (AAAI 2025)
☆24Updated 6 months ago
sMamooler / CLIP_Explainability
code for studying OpenAI's CLIP explainability
☆32Updated 3 years ago
yic20 / CoMC
[ICML2024] Official PyTorch implementation of CoMC: Language-Driven Cross-Modal Classifier for Zero-Shot Multi-Label Image Recognition
☆14Updated 11 months ago
hhc1997 / MSCN
☆10Updated last year
xu5zhao / BiCro
☆27Updated 2 years ago
XLearning-SCU / Awesome-Noisy-Correspondence
This is a summary of research on noisy correspondence. There may be omissions. If anything is missing please get in touch with us. Our em…
☆66Updated 2 weeks ago
jaychempan / PIR-CLIP
📖 Official Code for “PIR-CLIP: Remote Sensing Image-text Retrieval with Prior Instruction Representation Learning”
☆18Updated 8 months ago
ZhangWeihang99 / HVSA
Official PyTorch implementation for Hypersphere-Based Remote Sensing Cross-Modal Text–Image Retrieval via Curriculum Learning.
☆15Updated 10 months ago
Delong-liu-bupt / Composed_Person_Retrieval
Composed Person Retrieval (CPR) is a new cross-modal retrieval task that aims to identify individuals in large-scale person image databas…
☆25Updated last month
dogehhh / ReCLIP
Pytorch Implementation for CVPR 2024 paper: Learn to Rectify the Bias of CLIP for Unsupervised Semantic Segmentation
☆45Updated 2 weeks ago
hulianyuyy / Deep_Correlated_Prompting
Deep Correlated Prompting for Visual Recognition with Missing Modalities (NeurIPS 2024)
☆23Updated 3 months ago
yyh-rain-song / ReMamber
ECCV24 "ReMamber: Referring Image Segmentation with Mamba Twister" official repository.
☆39Updated 11 months ago
shuanglinyan / CFine
CLIP-Driven Fine-grained Text-Image Person Re-identification
☆49Updated last year
TangXu-Group / Cross-modal-remote-sensing-image-and-text-retrieval-models
☆20Updated 9 months ago
ChenDelong1999 / ITRA
A codebase for flexible and efficient Image Text Representation Alignment
☆19Updated 2 years ago
sunxm2357 / DualCoOp
Implementation for "DualCoOp: Fast Adaptation to Multi-Label Recognition with Limited Annotations" (NeurIPS 2022))
☆63Updated last year
The-Shuai / DeIL
Python code to implement DeIL, a CLIP based approach for open-world few-shot learning.
☆15Updated 7 months ago