该项目旨在通过输入文本描述来检索与之相匹配的图片。
☆44Aug 24, 2023Updated 2 years ago
Alternatives and similar repositories for CLIP-Text-Image-Retrieval
Users that are interested in CLIP-Text-Image-Retrieval are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 基于多模态检索的互联网图文匹配☆15Mar 17, 2024Updated 2 years ago
- Efficient Token-Guided Image-Text Retrieval with Consistent Multimodal Contrastive Training☆30Jun 20, 2023Updated 2 years ago
- 使用OpenCV+onnxruntime部署中文clip做以文搜图,给出一句话来描述想要的图片,就能从图库中搜出来符合要求的图片。包含C++和Python两个版本的程序☆87Jan 15, 2024Updated 2 years ago
- 基于开源预训练模型来实现一个简单的CLIP模型☆32Jan 14, 2023Updated 3 years ago
- ☆27Sep 3, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆13Sep 5, 2023Updated 2 years ago
- The codes of our paper "EasyInv: Toward Fast and Better DDIM Inversion"☆14Jun 1, 2025Updated 10 months ago
- An impelementation of image search engine using CLIP (Contrastive Language-Image Pre-Training☆15Aug 9, 2024Updated last year
- Repo for paper "CODIS: Benchmarking Context-Dependent Visual Comprehension for Multimodal Large Language Models".☆12Oct 14, 2024Updated last year
- Official PyTorch Implementation of ParGo: Bridging Vision-Language with Partial and Global Views. (AAAI 2025)☆16Jan 7, 2025Updated last year
- Controlled Text Generation Image Dataset☆26Apr 8, 2024Updated 2 years ago
- [AAAI2021] The code of “Similarity Reasoning and Filtration for Image-Text Matching”☆219Apr 11, 2024Updated 2 years ago
- Official code and data of "3AM: An Ambiguity-Aware Multi-Modal Machine Translation Dataset"☆12Dec 8, 2024Updated last year
- ☆11Dec 31, 2020Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Cross-Modal Implicit Relation Reasoning and Aligning for Text-to-Image Person Retrieval (CVPR 2023)☆276Mar 26, 2025Updated last year
- 自己练习的各种demo和课程☆12Jun 16, 2023Updated 2 years ago
- ESPER☆24Mar 29, 2024Updated 2 years ago
- Code and data for the paper: DTSM: Toward Dense Table Structure Recognition with Text Query Encoder and Adjacent Feature Aggregator☆12Apr 28, 2024Updated last year
- A project for telling stories according to images in some particular style☆16Dec 16, 2018Updated 7 years ago
- ☆12May 3, 2024Updated last year
- Multicultural Proverbs and Sayings☆13Jan 11, 2025Updated last year
- Simple image search engine by a text query using CLIP☆23Nov 11, 2025Updated 5 months ago
- 基于NLP的舆情监控系统,支持中文情感分析☆20Mar 10, 2025Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆40Mar 3, 2026Updated last month
- resources for text detection, text recognition, and end to end text spotting☆11Apr 23, 2023Updated 2 years ago
- 💻NUAA 2018 操作系统小作业-模拟内存分配程序(BF算法)☆13Jul 2, 2018Updated 7 years ago
- MediaPipeを用いたハンドジェスチャーによる簡単なマウス操作を行うプログラムです。☆12Mar 17, 2021Updated 5 years ago
- This is a Pytorch implementation of contrastive Learning(CL) baselines.☆14Aug 29, 2022Updated 3 years ago
- ☆54Sep 13, 2023Updated 2 years ago
- AI大模型的基本开发框架,适合普通后端程序员,功能类似coze包括:fastapi后端接口,搜索,文档解析和向量化,RPA和爬虫,自定义agent,对接第三方数据接口,mongodb数据库,控制json返回,多模态理解和生成等等☆13Jul 18, 2024Updated last year
- 【2024 ECAI】First Creating Backgrounds Then Rendering Texts: A New Paradigm for Visual Text Blending☆14Jun 16, 2025Updated 9 months ago
- ☆23Mar 28, 2025Updated last year
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Learn how to create impactful AI Agents using Agno AI Python Package☆13Jul 31, 2025Updated 8 months ago
- 操作系统必修实验:多进程并发环境模拟以及低级调度算法的仿真实现☆10Nov 13, 2019Updated 6 years ago
- an implementation of MoCo and MoCo-v2 improvements pre-trained on Imagenette☆23Jun 15, 2021Updated 4 years ago
- Official implementation of "Seeing is Understanding: Unlocking Causal Attention into Modality-Mutual Attention for Multimodal LLMs"☆20May 23, 2025Updated 10 months ago
- Official Code for “PriorCLIP: Visual Prior Guided Vision-Language Model for Remote Sensing Image-Text Retrieval”☆27Dec 19, 2025Updated 3 months ago
- 【今日头条】文本作者身份识别比赛☆10Aug 20, 2018Updated 7 years ago
- 本项目使用LLaVA 1.6多模态模型实现以文搜图和以图搜图功能。☆28Feb 26, 2024Updated 2 years ago