sjy0727 / CLIP-Text-Image-RetrievalLinks

该项目旨在通过输入文本描述来检索与之相匹配的图片。

☆42

Alternatives and similar repositories for CLIP-Text-Image-Retrieval

Users that are interested in CLIP-Text-Image-Retrieval are comparing it to the libraries listed below

Sorting:

ppanzx / CHAN
☆50Updated 2 years ago
zhangy0822 / USER
USER: Unified Semantic Enhancement with Momentum Contrast for Image-Text Retrieval, TIP 2024
☆33Updated 3 months ago
HuangYuantong / video-text-retrieval
毕业设计：《基于CLIP模型的视频文本检索设计与实现》
☆12Updated last year
LCFractal / TGDT
Efficient Token-Guided Image-Text Retrieval with Consistent Multimodal Contrastive Training
☆28Updated 2 years ago
yangjianxin1 / ClipCap-Chinese
基于ClipCap的看图说话Image Caption模型
☆315Updated 3 years ago
ChiYeungLaw / LexLIP-ICCV23
Official Code for the ICCV23 Paper: "LexLIP: Lexicon-Bottlenecked Language-Image Pre-Training for Large-Scale Image-Text Sparse Retrieval…
☆40Updated last year
zjukg / Structure-CLIP
[Paper][AAAI2024]Structure-CLIP: Towards Scene Graph Knowledge to Enhance Multi-modal Structured Representations
☆148Updated last year
alipay / Ant-Multi-Modal-Framework
Research Code for Multimodal-Cognition Team in Ant Group
☆167Updated 3 months ago
weiji-Feng / Image2Poem
A project that can generate ancient poems based on pictures, including CLIP, T5, GPT2 models
☆22Updated 7 months ago
AAA-Zheng / Image-Text-Matching-Summary
Summary of Related Research on Image-Text Matching
☆71Updated 2 years ago
LuminosityX / HAT
Implementation of our paper, 'Unifying Two-Stream Encoders with Transformers for Cross-Modal Retrieval.'
☆27Updated last year
haokunwen / DQU-CIR
[SIGIR 2024] - Simple but Effective Raw-Data Level Multimodal Fusion for Composed Image Retrieval
☆43Updated last year
sunanhe / MKT
Official implementation of "Open-Vocabulary Multi-Label Classification via Multi-Modal Knowledge Transfer".
☆129Updated 11 months ago
lerogo / aaai24_itr_cusa
Source code of our AAAI 2024 paper "Cross-Modal and Uni-Modal Soft-Label Alignment for Image-Text Retrieval"
☆48Updated last year
DarrenZZhang / MM23-MITH
☆19Updated last year
sugarandgugu / Text2Image-Retrieval
计算机视觉课程设计-基于Chinese-CLIP的图文检索系统
☆94Updated 2 years ago
linyq2117 / TagCLIP
[AAAI 2024] TagCLIP: A Local-to-Global Framework to Enhance Open-Vocabulary Multi-Label Classification of CLIP Without Training
☆102Updated last year
kkzhang95 / Awesome-Composed-Multi-modal-Retrieval
A comprehensive survey of Composed Multi-modal Retrieval (CMR), including Composed Image Retrieval (CIR) and Composed Video Retrieval (CV…
☆57Updated last month
om-ai-lab / GroundVLP
GroundVLP: Harnessing Zero-shot Visual Grounding from Vision-Language Pre-training and Open-Vocabulary Object Detection (AAAI 2024)
☆72Updated last year
BUAADreamer / SPN4CIR
[ACM MM 2024] Improving Composed Image Retrieval via Contrastive Learning with Scaling Positives and Negatives
☆39Updated last month
PKU-ICST-MIPL / MKVSE-TOMM2023
☆28Updated 2 years ago
KevinLight831 / AMC
[ToMM2023] - AMC: Adaptive Multi-expert Collaborative Network for Text-guided Image Retrieval
☆20Updated last year
JiuTian-VL / JiuTian-LION
[CVPR 2024] LION: Empowering Multimodal Large Language Model with Dual-Level Visual Knowledge
☆153Updated last month
Paranioar / RCAR
[TIP2023] The code of “Plug-and-Play Regulators for Image-Text Matching”
☆33Updated last year
CrossmodalGroup / NAAF
Implementation of our CVPR2022 paper, Negative-Aware Attention Framework for Image-Text Matching.
☆119Updated 2 years ago
PKU-ICST-MIPL / Finedefics_ICLR2025
☆72Updated 5 months ago
adxcreative / EERCF
Towards Efficient and Effective Text-to-Video Retrieval with Coarse-to-Fine Visual Representation Learning
☆19Updated 7 months ago
fuxianghuang1 / Multimodal-Composite-Editing-and-Retrieval
Multimodal-Composite-Editing-and-Retrieval-update
☆33Updated 11 months ago
OpenMatch / UniVL-DR
[ICLR 2023] This is the code repo for our ICLR‘23 paper "Universal Vision-Language Dense Retrieval: Learning A Unified Representation Spa…
☆53Updated last year
Cuberick-Orion / Bi-Blip4CIR
The official implementation for BLIP4CIR with bi-directional training | Bi-directional Training for Composed Image Retrieval via Text Pro…
☆32Updated last year