Noah888 / DAR
Enhancing Recipe Retrieval with Foundation Models: A Data Augmentation Perspective
☆9Updated 6 months ago
Alternatives and similar repositories for DAR:
Users that are interested in DAR are comparing it to the libraries listed below
- Source code of our AAAI 2024 paper "Cross-Modal and Uni-Modal Soft-Label Alignment for Image-Text Retrieval"☆40Updated last year
- ☆45Updated last year
- Code and Dataset for the paper "LAMM: Label Alignment for Multi-Modal Prompt Learning" AAAI 2024☆32Updated last year
- Unofficial Implementation to CDUL: CLIP-Driven Unsupervised Learning for Multi-Label Image Classification [ICCV'23]☆24Updated 10 months ago
- [TIP2023] The code of “Plug-and-Play Regulators for Image-Text Matching”☆33Updated last year
- Enhance Vision-Language Alignment with Noise (AAAI 2025)☆19Updated 4 months ago
- ☆16Updated 5 months ago
- Official pytorch implementation of ZiRa, a method for incremental vision language object detection (IVLOD),which has been accepted by Neu…☆23Updated 6 months ago
- [ACM MM 2024] Hierarchical Multimodal Fine-grained Modulation for Visual Grounding.☆45Updated last week
- Word4Per is an innovative framework for Zero-Shot Composed Person Retrieval (ZS-CPR), integrating visual and textual information for enha…☆22Updated 4 months ago
- ☆23Updated last year
- Deep Correlated Prompting for Visual Recognition with Missing Modalities (NeurIPS 2024)☆24Updated last month
- Pytorch Implementation for CVPR 2024 paper: Learn to Rectify the Bias of CLIP for Unsupervised Semantic Segmentation☆41Updated this week
- Multimodal-Composite-Editing-and-Retrieval-update☆32Updated 6 months ago
- ☆10Updated last year
- [ICLR'24] Consistency-guided Prompt Learning for Vision-Language Models☆71Updated 11 months ago
- ☆33Updated last year
- A Prior Instruction Representation Framework for Remote Sensing Image-text Retrieval (MM'23 Oral)☆16Updated last year
- Python code to implement DeIL, a CLIP based approach for open-world few-shot learning.☆14Updated 5 months ago
- Official PyTorch implementation for Hypersphere-Based Remote Sensing Cross-Modal Text–Image Retrieval via Curriculum Learning.☆14Updated 8 months ago
- Parameter-Efficient Transfer Learning for Remote Sensing Image-Text Retrieval, 2023☆25Updated last year
- PyTorch implementation for Cross-modal Retrieval with Noisy Correspondence via Consistency Refining and Mining (TIP 2024)☆15Updated last year
- This repository lists some awesome public projects about Zero-shot/Few-shot Learning based on CLIP (Contrastive Language-Image Pre-Traini…☆22Updated 5 months ago
- 📖 Official Code for “PIR-CLIP: Remote Sensing Image-text Retrieval with Prior Instruction Representation Learning”☆18Updated 6 months ago
- [AAAI2024] Official implementation of TGP-T☆28Updated last year
- ☆14Updated last year
- The official code for "TextRefiner: Internal Visual Feature as Efficient Refiner for Vision-Language Models Prompt Tuning" | [AAAI2025]☆35Updated last month
- ☆21Updated 2 years ago
- Pytorch implementation of "Test-time Adaption against Multi-modal Reliability Bias".☆35Updated 4 months ago
- A codebase for flexible and efficient Image Text Representation Alignment☆19Updated last year