haofanwang/natural-language-joint-query-search

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/haofanwang/natural-language-joint-query-search)

haofanwang / natural-language-joint-query-search

Search photos on Unsplash based on OpenAI's CLIP model, support search with joint image+text queries and attention visualization.

☆224

Alternatives and similar repositories for natural-language-joint-query-search

Users that are interested in natural-language-joint-query-search are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

haltakov / natural-language-image-search
View on GitHub
Search photos on Unsplash using natural language
☆1,041Oct 13, 2022Updated 3 years ago
hila-chefer / Transformer-MM-Explainability
View on GitHub
[ICCV 2021- Oral] Official PyTorch implementation for Generic Attention-model Explainability for Interpreting Bi-Modal and Encoder-Decode…
☆911Aug 24, 2023Updated 2 years ago
Zasder3 / train-CLIP
View on GitHub
A PyTorch Lightning solution to training OpenAI's CLIP from scratch.
☆720Apr 15, 2022Updated 4 years ago
HendrikStrobelt / miniClip
View on GitHub
☆48May 21, 2025Updated last year
Deferf / CLIP_Video_Representation
View on GitHub
Use CLIP to represent video for Retrieval Task
☆71Mar 1, 2021Updated 5 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
Weili-NLP / UNIMO
View on GitHub
UNIMO: Towards Unified-Modal Understanding and Generation via Cross-Modal Contrastive Learning
☆69May 20, 2021Updated 5 years ago
ChenyuHeidiZhang / VL-commonsense
View on GitHub
☆14May 23, 2022Updated 4 years ago
ABaldrati / CLIP4CirDemo
View on GitHub
[CVPR 2022 - Demo Track] - Effective conditioned and composed image retrieval combining CLIP-based features
☆86Nov 12, 2024Updated last year
rmokady / CLIP_prefix_caption
View on GitHub
Simple image captioning model
☆1,421Jun 9, 2024Updated 2 years ago
OrLichter / lcm-lookahead
View on GitHub
☆57Apr 30, 2024Updated 2 years ago
ShivamShrirao / CLIP_Image_Search
View on GitHub
Search Images through image dataset with text prompt using Open AI's CLIP neural network.
☆36May 29, 2021Updated 5 years ago
rom1504 / clip-retrieval
View on GitHub
Easily compute clip embeddings and build a clip retrieval system with them
☆2,786Mar 28, 2026Updated 3 months ago
jayleicn / ClipBERT
View on GitHub
[CVPR 2021 Best Student Paper Honorable Mention, Oral] Official PyTorch code for ClipBERT, an efficient framework for end-to-end learning…
☆730Aug 8, 2023Updated 2 years ago
kevinzakka / clip_playground
View on GitHub
An ever-growing playground of notebooks showcasing CLIP's impressive zero-shot capabilities
☆178Jul 27, 2022Updated 3 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
YihengZhang-CV / MCL-Motion-Focused-Contrastive-Learning
View on GitHub
☆15Jan 11, 2022Updated 4 years ago
shuaiwa16 / image-enhanced-event-extraction
View on GitHub
The source code of the paper image enhanced event detection in news articles.
☆11May 27, 2022Updated 4 years ago
mlfoundations / open_clip
View on GitHub
An open source implementation of CLIP.
☆14,007Updated this week
ArrowLuo / CLIP4Clip
View on GitHub
An official implementation for "CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval"
☆1,029Apr 12, 2024Updated 2 years ago
facebookresearch / SLIP
View on GitHub
Code release for SLIP Self-supervision meets Language-Image Pre-training
☆792Feb 9, 2023Updated 3 years ago
USTC-IMCC / PaperReading
View on GitHub
Paper Reading of IMCC groups.
☆18Oct 22, 2025Updated 8 months ago
BAAI-WuDao / BriVL
View on GitHub
Bridging Vision and Language Model
☆286Mar 27, 2023Updated 3 years ago
ChenRocks / UNITER
View on GitHub
Research code for ECCV 2020 paper "UNITER: UNiversal Image-TExt Representation Learning"
☆800Jun 30, 2021Updated 5 years ago
microsoft / Oscar
View on GitHub
Oscar and VinVL
☆1,054Aug 28, 2023Updated 2 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
Charleshhy / One-shot-Human-Parsing
View on GitHub
[AAAI 2021] (oral) Progressive One-shot Human Parsing, [TPAMI 2023] End-to-end One-shot Human Parsing
☆72Aug 12, 2023Updated 2 years ago
pzzhang / VinVL
View on GitHub
project page for VinVL
☆360Jul 26, 2023Updated 2 years ago
google-research / xmcgan_image_generation
View on GitHub
☆100Jun 23, 2026Updated 3 weeks ago
neilfei / brivl-nmi
View on GitHub
☆60Jun 2, 2022Updated 4 years ago
ashkamath / mdetr
View on GitHub
☆1,050Oct 3, 2022Updated 3 years ago
dribnet / clipit_old
View on GitHub
VQGAN+CLIP with some additional tuning. For notebooks and the command line.
☆50Aug 20, 2021Updated 4 years ago
ylsung / VL_adapter
View on GitHub
PyTorch code for "VL-Adapter: Parameter-Efficient Transfer Learning for Vision-and-Language Tasks" (CVPR2022)
☆212Dec 18, 2022Updated 3 years ago
Zasder3 / CLIP-Style-Transfer
View on GitHub
Doing style transfer with linguistic features using OpenAI's CLIP.
☆14May 4, 2021Updated 5 years ago
jbarrow / distillate
View on GitHub
PDF Extraction Toolkit (wraps and trains LayoutLM)
☆11Oct 8, 2021Updated 4 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
openai / CLIP
View on GitHub
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
☆34,043Mar 25, 2026Updated 3 months ago
utlive / FRIQUEE
View on GitHub
☆17Jan 17, 2021Updated 5 years ago
imirzadeh / MC-SGD
View on GitHub
Linear Mode Connectivity in Multitask and Continual Learning: https://arxiv.org/abs/2010.04495
☆13Oct 12, 2020Updated 5 years ago
Sense-GVT / DeCLIP
View on GitHub
Supervision Exists Everywhere: A Data Efficient Contrastive Language-Image Pre-training Paradigm
☆678Sep 19, 2022Updated 3 years ago
WikiChao / ScalingConcept
View on GitHub
☆24Nov 1, 2024Updated last year
raoyongming / DenseCLIP
View on GitHub
[CVPR 2022] DenseCLIP: Language-Guided Dense Prediction with Context-Aware Prompting
☆550Sep 15, 2023Updated 2 years ago
UKPLab / MMT-Retrieval
View on GitHub
☆131Dec 10, 2022Updated 3 years ago