Search photos on Unsplash based on OpenAI's CLIP model, support search with joint image+text queries and attention visualization.
☆225Sep 9, 2021Updated 4 years ago
Alternatives and similar repositories for natural-language-joint-query-search
Users that are interested in natural-language-joint-query-search are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Search photos on Unsplash using natural language☆1,036Oct 13, 2022Updated 3 years ago
- [ICCV 2021- Oral] Official PyTorch implementation for Generic Attention-model Explainability for Interpreting Bi-Modal and Encoder-Decode…☆903Aug 24, 2023Updated 2 years ago
- A PyTorch Lightning solution to training OpenAI's CLIP from scratch.☆720Apr 15, 2022Updated 3 years ago
- ☆47May 21, 2025Updated 10 months ago
- Use CLIP to represent video for Retrieval Task☆70Mar 1, 2021Updated 5 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- UNIMO: Towards Unified-Modal Understanding and Generation via Cross-Modal Contrastive Learning☆70May 20, 2021Updated 4 years ago
- This is an implementation of the paper "Show and Tell: A Neural Image Caption Generator".☆19Dec 7, 2018Updated 7 years ago
- ☆15May 23, 2022Updated 3 years ago
- [CVPR 2022 - Demo Track] - Effective conditioned and composed image retrieval combining CLIP-based features☆85Nov 12, 2024Updated last year
- ☆57Apr 30, 2024Updated last year
- Easily compute clip embeddings and build a clip retrieval system with them☆2,736Updated this week
- 📝 Anything for coding faster and more comfortable.☆13Jan 21, 2026Updated 2 months ago
- [CVPR 2021 Best Student Paper Honorable Mention, Oral] Official PyTorch code for ClipBERT, an efficient framework for end-to-end learning…☆730Aug 8, 2023Updated 2 years ago
- An ever-growing playground of notebooks showcasing CLIP's impressive zero-shot capabilities☆178Jul 27, 2022Updated 3 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- ☆15Jan 11, 2022Updated 4 years ago
- This is tensorflow 2.2 based SCAMET framework for remote sensing image captioning.☆13Aug 10, 2023Updated 2 years ago
- Paper Reading of IMCC groups.☆17Oct 22, 2025Updated 5 months ago
- An open source implementation of CLIP.☆13,579Mar 12, 2026Updated 2 weeks ago
- Code release for SLIP Self-supervision meets Language-Image Pre-training☆787Feb 9, 2023Updated 3 years ago
- An official implementation for "CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval"☆1,026Apr 12, 2024Updated last year
- A template for deep learning projects.☆16May 7, 2025Updated 10 months ago
- Bridging Vision and Language Model☆286Mar 27, 2023Updated 3 years ago
- Research code for ECCV 2020 paper "UNITER: UNiversal Image-TExt Representation Learning"☆800Jun 30, 2021Updated 4 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- ☆44Jun 1, 2023Updated 2 years ago
- [AAAI 2021] (oral) Progressive One-shot Human Parsing, [TPAMI 2023] End-to-end One-shot Human Parsing☆72Aug 12, 2023Updated 2 years ago
- Oscar and VinVL☆1,052Aug 28, 2023Updated 2 years ago
- project page for VinVL☆359Jul 26, 2023Updated 2 years ago
- ☆97Updated this week
- ☆1,047Oct 3, 2022Updated 3 years ago
- ☆60Jun 2, 2022Updated 3 years ago
- VQGAN+CLIP with some additional tuning. For notebooks and the command line.☆50Aug 20, 2021Updated 4 years ago
- PyTorch code for "VL-Adapter: Parameter-Efficient Transfer Learning for Vision-and-Language Tasks" (CVPR2022)☆211Dec 18, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image☆32,946Feb 18, 2026Updated last month
- Official InfiniBench: A Benchmark for Large Multi-Modal Models in Long-Form Movies and TV Shows☆19Nov 4, 2025Updated 4 months ago
- Supervision Exists Everywhere: A Data Efficient Contrastive Language-Image Pre-training Paradigm☆677Sep 19, 2022Updated 3 years ago
- [CVPR 2022] DenseCLIP: Language-Guided Dense Prediction with Context-Aware Prompting☆544Sep 15, 2023Updated 2 years ago
- ☆12Mar 23, 2018Updated 8 years ago
- ☆131Dec 10, 2022Updated 3 years ago
- ☆152Sep 28, 2022Updated 3 years ago