Search photos on Unsplash based on OpenAI's CLIP model, support search with joint image+text queries and attention visualization.
☆223Sep 9, 2021Updated 4 years ago
Alternatives and similar repositories for natural-language-joint-query-search
Users that are interested in natural-language-joint-query-search are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Search photos on Unsplash using natural language☆1,036Oct 13, 2022Updated 3 years ago
- [ICCV 2021- Oral] Official PyTorch implementation for Generic Attention-model Explainability for Interpreting Bi-Modal and Encoder-Decode…☆906Aug 24, 2023Updated 2 years ago
- A PyTorch Lightning solution to training OpenAI's CLIP from scratch.☆720Apr 15, 2022Updated 4 years ago
- ☆47May 21, 2025Updated 10 months ago
- Use CLIP to represent video for Retrieval Task☆70Mar 1, 2021Updated 5 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- UNIMO: Towards Unified-Modal Understanding and Generation via Cross-Modal Contrastive Learning☆70May 20, 2021Updated 4 years ago
- This is an implementation of the paper "Show and Tell: A Neural Image Caption Generator".☆19Dec 7, 2018Updated 7 years ago
- [CVPR 2022 - Demo Track] - Effective conditioned and composed image retrieval combining CLIP-based features☆85Nov 12, 2024Updated last year
- ☆57Apr 30, 2024Updated last year
- Simple image captioning model☆1,416Jun 9, 2024Updated last year
- Easily compute clip embeddings and build a clip retrieval system with them☆2,749Mar 28, 2026Updated 3 weeks ago
- [CVPR 2021 Best Student Paper Honorable Mention, Oral] Official PyTorch code for ClipBERT, an efficient framework for end-to-end learning…☆730Aug 8, 2023Updated 2 years ago
- An ever-growing playground of notebooks showcasing CLIP's impressive zero-shot capabilities☆178Jul 27, 2022Updated 3 years ago
- ☆15Jan 11, 2022Updated 4 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- The source code of the paper image enhanced event detection in news articles.☆11May 27, 2022Updated 3 years ago
- This is tensorflow 2.2 based SCAMET framework for remote sensing image captioning.☆13Aug 10, 2023Updated 2 years ago
- Paper Reading of IMCC groups.☆17Oct 22, 2025Updated 5 months ago
- An open source implementation of CLIP.☆13,695Apr 6, 2026Updated last week
- Code release for SLIP Self-supervision meets Language-Image Pre-training☆790Feb 9, 2023Updated 3 years ago
- An official implementation for "CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval"☆1,026Apr 12, 2024Updated 2 years ago
- Bridging Vision and Language Model☆286Mar 27, 2023Updated 3 years ago
- Research code for ECCV 2020 paper "UNITER: UNiversal Image-TExt Representation Learning"☆800Jun 30, 2021Updated 4 years ago
- CutDiffusion: A Simple, Fast, Cheap, and Strong Diffusion Extrapolation Method☆27Oct 9, 2025Updated 6 months ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆44Jun 1, 2023Updated 2 years ago
- Oscar and VinVL☆1,051Aug 28, 2023Updated 2 years ago
- project page for VinVL☆359Jul 26, 2023Updated 2 years ago
- ☆98Mar 25, 2026Updated 3 weeks ago
- ☆1,047Oct 3, 2022Updated 3 years ago
- ☆60Jun 2, 2022Updated 3 years ago
- VQGAN+CLIP with some additional tuning. For notebooks and the command line.☆50Aug 20, 2021Updated 4 years ago
- CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image☆33,184Mar 25, 2026Updated 3 weeks ago
- Doing style transfer with linguistic features using OpenAI's CLIP.☆14May 4, 2021Updated 4 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Official InfiniBench: A Benchmark for Large Multi-Modal Models in Long-Form Movies and TV Shows☆19Nov 4, 2025Updated 5 months ago
- ☆24Nov 1, 2024Updated last year
- Supervision Exists Everywhere: A Data Efficient Contrastive Language-Image Pre-training Paradigm☆676Sep 19, 2022Updated 3 years ago
- [CVPR 2022] DenseCLIP: Language-Guided Dense Prediction with Context-Aware Prompting☆546Sep 15, 2023Updated 2 years ago
- ☆12Mar 23, 2018Updated 8 years ago
- ☆131Dec 10, 2022Updated 3 years ago
- ☆152Sep 28, 2022Updated 3 years ago