Search photos on Unsplash based on OpenAI's CLIP model, support search with joint image+text queries and attention visualization.
☆224Sep 9, 2021Updated 4 years ago
Alternatives and similar repositories for natural-language-joint-query-search
Users that are interested in natural-language-joint-query-search are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Search photos on Unsplash using natural language☆1,039Oct 13, 2022Updated 3 years ago
- A PyTorch Lightning solution to training OpenAI's CLIP from scratch.☆721Apr 15, 2022Updated 4 years ago
- ☆48May 21, 2025Updated last year
- Use CLIP to represent video for Retrieval Task☆70Mar 1, 2021Updated 5 years ago
- A visualizer to display attention weights on text☆24Apr 9, 2019Updated 7 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- UNIMO: Towards Unified-Modal Understanding and Generation via Cross-Modal Contrastive Learning☆69May 20, 2021Updated 5 years ago
- This is an implementation of the paper "Show and Tell: A Neural Image Caption Generator".☆19Dec 7, 2018Updated 7 years ago
- ☆14May 23, 2022Updated 4 years ago
- [CVPR 2022 - Demo Track] - Effective conditioned and composed image retrieval combining CLIP-based features☆86Nov 12, 2024Updated last year
- ☆57Apr 30, 2024Updated 2 years ago
- Simple image captioning model☆1,418Jun 9, 2024Updated last year
- Easily compute clip embeddings and build a clip retrieval system with them☆2,768Mar 28, 2026Updated 2 months ago
- [CVPR 2021 Best Student Paper Honorable Mention, Oral] Official PyTorch code for ClipBERT, an efficient framework for end-to-end learning…☆730Aug 8, 2023Updated 2 years ago
- An ever-growing playground of notebooks showcasing CLIP's impressive zero-shot capabilities☆178Jul 27, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆15Jan 11, 2022Updated 4 years ago
- Paper Reading of IMCC groups.☆17Oct 22, 2025Updated 7 months ago
- An open source implementation of CLIP.☆13,889Updated this week
- Code release for SLIP Self-supervision meets Language-Image Pre-training☆791Feb 9, 2023Updated 3 years ago
- An official implementation for "CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval"☆1,028Apr 12, 2024Updated 2 years ago
- A template for deep learning projects.☆16May 7, 2025Updated last year
- Bridging Vision and Language Model☆286Mar 27, 2023Updated 3 years ago
- Research code for ECCV 2020 paper "UNITER: UNiversal Image-TExt Representation Learning"☆800Jun 30, 2021Updated 4 years ago
- CutDiffusion: A Simple, Fast, Cheap, and Strong Diffusion Extrapolation Method☆27Oct 9, 2025Updated 8 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆44Jun 1, 2023Updated 3 years ago
- Oscar and VinVL☆1,053Aug 28, 2023Updated 2 years ago
- project page for VinVL☆359Jul 26, 2023Updated 2 years ago
- ☆99May 19, 2026Updated 2 weeks ago
- ☆1,047Oct 3, 2022Updated 3 years ago
- ☆18Jan 17, 2021Updated 5 years ago
- VQGAN+CLIP with some additional tuning. For notebooks and the command line.☆50Aug 20, 2021Updated 4 years ago
- PyTorch code for "VL-Adapter: Parameter-Efficient Transfer Learning for Vision-and-Language Tasks" (CVPR2022)☆211Dec 18, 2022Updated 3 years ago
- Doing style transfer with linguistic features using OpenAI's CLIP.☆14May 4, 2021Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image☆33,689Mar 25, 2026Updated 2 months ago
- PDF Extraction Toolkit (wraps and trains LayoutLM)☆10Oct 8, 2021Updated 4 years ago
- Linear Mode Connectivity in Multitask and Continual Learning: https://arxiv.org/abs/2010.04495☆12Oct 12, 2020Updated 5 years ago
- Official InfiniBench: A Benchmark for Large Multi-Modal Models in Long-Form Movies and TV Shows☆20Nov 4, 2025Updated 7 months ago
- ☆24Nov 1, 2024Updated last year
- Supervision Exists Everywhere: A Data Efficient Contrastive Language-Image Pre-training Paradigm☆676Sep 19, 2022Updated 3 years ago
- [CVPR 2022] DenseCLIP: Language-Guided Dense Prediction with Context-Aware Prompting☆549Sep 15, 2023Updated 2 years ago