ZiChao111 / FTI4CIR
Codes of the Fine-grained Textual Inversion network for Zero-Shot Composed Image Retrieval
☆14Updated last month
Related projects: ⓘ
- [SIGIR 2024] - Simple but Effective Raw-Data Level Multimodal Fusion for Composed Image Retrieval☆19Updated 2 months ago
- ☆43Updated 3 weeks ago
- The code of MGCC: Text-based Occluded Person Re-identification via Multi-Granularity Contrastive Consistency Learning☆11Updated 2 months ago
- The code of the paper "Negative Pre-aware for Noisy Cross-modal Matching" in AAAI 2024.☆15Updated 4 months ago
- Context-I2W: Mapping Images to Context-dependent words for Accurate Zero-Shot Composed Image Retrieval [AAAI 2024 Oral]☆36Updated 5 months ago
- This is a summary of research on noisy correspondence. There may be omissions. If anything is missing please get in touch with us. Our em…☆35Updated last week
- USER: Unified Semantic Enhancement with Momentum Contrast for Image-Text Retrieval, TIP 2024☆19Updated 5 months ago
- CLIP-Driven Fine-grained Text-Image Person Re-identification☆34Updated 9 months ago
- Collection of Composed Image Retrieval (CIR) papers.☆67Updated last week
- AMC: Adaptive Multi-expert Collaborative Network for Text-guided Image Retrieval☆13Updated 3 weeks ago
- Cross-modal Active Complementary Learning with Self-refining Correspondence (NeurIPS 2023, Pytorch Code)☆12Updated 3 months ago
- Noisy-Correspondence Learning for Text-to-Image Person Re-identification (CVPR 2024 Pytorch Code)☆51Updated 3 months ago
- [ICML 2024] "Visual-Text Cross Alignment: Refining the Similarity Score in Vision-Language Models"☆38Updated 2 weeks ago
- ☆13Updated 2 weeks ago
- ☆13Updated last month
- Official implementation of "Text Is MASS: Modeling as Stochastic Embedding for Text-Video Retrieval (CVPR 2024 Highlight)"☆44Updated last month
- [ECCV2024] Reflective Instruction Tuning: Mitigating Hallucinations in Large Vision-Language Models☆11Updated 2 months ago
- 【ICLR 2024, Spotlight】Sentence-level Prompts Benefit Composed Image Retrieval☆60Updated 5 months ago
- Word4Per is an innovative framework for Zero-Shot Composed Person Retrieval (ZS-CPR), integrating visual and textual information for enha…☆13Updated 9 months ago
- ☆11Updated 2 months ago
- Noise of Web (NoW) is a challenging noisy correspondence learning (NCL) benchmark containing 100K image-text pairs for robust image-text …☆11Updated 2 weeks ago
- Learning Hierarchical Prompt with Structured Linguistic Knowledge for Vision-Language Models (AAAI 2024)☆62Updated 7 months ago
- 【AAAI 2024】An Empirical Study of CLIP for Text-based Person Search☆45Updated 5 months ago
- Source code of our CVPR2024 paper TeachCLIP for Text-to-Video Retrieval☆12Updated last month
- Source code of our AAAI 2024 paper "Cross-Modal and Uni-Modal Soft-Label Alignment for Image-Text Retrieval"☆21Updated 5 months ago
- Generating Structured Pseudo Labels for Noise-resistant Zero-shot Video Sentence Localization☆13Updated last year
- The official implementation for BLIP4CIR with bi-directional training | Bi-directional Training for Composed Image Retrieval via Text Pro…☆23Updated 7 months ago
- [BMVC 2023] Zero-shot Composed Text-Image Retrieval☆42Updated last year
- [ICME 2024 Oral] DARA: Domain- and Relation-aware Adapters Make Parameter-efficient Tuning for Visual Grounding☆12Updated 3 weeks ago
- (CVPR2024) MeaCap: Memory-Augmented Zero-shot Image Captioning☆31Updated last month