Search photos on Unsplash based on OpenAI's CLIP model, support search with joint image+text queries and attention visualization.
☆223Sep 9, 2021Updated 4 years ago
Alternatives and similar repositories for natural-language-joint-query-search
Users that are interested in natural-language-joint-query-search are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Search photos on Unsplash using natural language☆1,037Oct 13, 2022Updated 3 years ago
- [ICCV 2021- Oral] Official PyTorch implementation for Generic Attention-model Explainability for Interpreting Bi-Modal and Encoder-Decode…☆907Aug 24, 2023Updated 2 years ago
- A PyTorch Lightning solution to training OpenAI's CLIP from scratch.☆721Apr 15, 2022Updated 4 years ago
- Use CLIP to represent video for Retrieval Task☆70Mar 1, 2021Updated 5 years ago
- UNIMO: Towards Unified-Modal Understanding and Generation via Cross-Modal Contrastive Learning☆70May 20, 2021Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- This is an implementation of the paper "Show and Tell: A Neural Image Caption Generator".☆19Dec 7, 2018Updated 7 years ago
- ☆14May 23, 2022Updated 3 years ago
- [CVPR 2022 - Demo Track] - Effective conditioned and composed image retrieval combining CLIP-based features☆86Nov 12, 2024Updated last year
- ☆57Apr 30, 2024Updated 2 years ago
- Simple image captioning model☆1,418Jun 9, 2024Updated last year
- Easily compute clip embeddings and build a clip retrieval system with them☆2,760Mar 28, 2026Updated last month
- [CVPR 2021 Best Student Paper Honorable Mention, Oral] Official PyTorch code for ClipBERT, an efficient framework for end-to-end learning…☆730Aug 8, 2023Updated 2 years ago
- An ever-growing playground of notebooks showcasing CLIP's impressive zero-shot capabilities☆178Jul 27, 2022Updated 3 years ago
- Paper Reading of IMCC groups.☆17Oct 22, 2025Updated 6 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- An open source implementation of CLIP.☆13,770Apr 30, 2026Updated last week
- Code release for SLIP Self-supervision meets Language-Image Pre-training☆790Feb 9, 2023Updated 3 years ago
- An official implementation for "CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval"☆1,027Apr 12, 2024Updated 2 years ago
- Bridging Vision and Language Model☆286Mar 27, 2023Updated 3 years ago
- Research code for ECCV 2020 paper "UNITER: UNiversal Image-TExt Representation Learning"☆800Jun 30, 2021Updated 4 years ago
- ☆44Jun 1, 2023Updated 2 years ago
- [AAAI 2021] (oral) Progressive One-shot Human Parsing, [TPAMI 2023] End-to-end One-shot Human Parsing☆72Aug 12, 2023Updated 2 years ago
- Oscar and VinVL☆1,053Aug 28, 2023Updated 2 years ago
- project page for VinVL☆359Jul 26, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆98Updated this week
- ☆60Jun 2, 2022Updated 3 years ago
- ☆1,046Oct 3, 2022Updated 3 years ago
- ☆18Jan 17, 2021Updated 5 years ago
- VQGAN+CLIP with some additional tuning. For notebooks and the command line.☆50Aug 20, 2021Updated 4 years ago
- PyTorch code for "VL-Adapter: Parameter-Efficient Transfer Learning for Vision-and-Language Tasks" (CVPR2022)☆211Dec 18, 2022Updated 3 years ago
- CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image☆33,398Mar 25, 2026Updated last month
- Doing style transfer with linguistic features using OpenAI's CLIP.☆14May 4, 2021Updated 5 years ago
- PDF Extraction Toolkit (wraps and trains LayoutLM)☆10Oct 8, 2021Updated 4 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Supervision Exists Everywhere: A Data Efficient Contrastive Language-Image Pre-training Paradigm☆675Sep 19, 2022Updated 3 years ago
- [CVPR 2022] DenseCLIP: Language-Guided Dense Prediction with Context-Aware Prompting☆548Sep 15, 2023Updated 2 years ago
- ☆131Dec 10, 2022Updated 3 years ago
- [CVPR(W) 2022] UIGR: Unified Interactive Garment Retrieval☆23Dec 3, 2021Updated 4 years ago
- ☆152Sep 28, 2022Updated 3 years ago
- StyleSpace Analysis: Disentangled Controls for StyleGAN Image Generation☆325Nov 27, 2022Updated 3 years ago
- Grounded Language-Image Pre-training☆2,588Jan 24, 2024Updated 2 years ago