haofanwang / natural-language-joint-query-searchView external linksLinks
Search photos on Unsplash based on OpenAI's CLIP model, support search with joint image+text queries and attention visualization.
☆224Sep 9, 2021Updated 4 years ago
Alternatives and similar repositories for natural-language-joint-query-search
Users that are interested in natural-language-joint-query-search are comparing it to the libraries listed below
Sorting:
- Search photos on Unsplash using natural language☆1,035Oct 13, 2022Updated 3 years ago
- [ICCV 2021- Oral] Official PyTorch implementation for Generic Attention-model Explainability for Interpreting Bi-Modal and Encoder-Decode…☆900Aug 24, 2023Updated 2 years ago
- A PyTorch Lightning solution to training OpenAI's CLIP from scratch.☆719Apr 15, 2022Updated 3 years ago
- Use CLIP to represent video for Retrieval Task☆70Mar 1, 2021Updated 4 years ago
- UNIMO: Towards Unified-Modal Understanding and Generation via Cross-Modal Contrastive Learning☆70May 20, 2021Updated 4 years ago
- ☆15Jan 11, 2022Updated 4 years ago
- [CVPR 2021 Best Student Paper Honorable Mention, Oral] Official PyTorch code for ClipBERT, an efficient framework for end-to-end learning…☆723Aug 8, 2023Updated 2 years ago
- Simple image captioning model☆1,408Jun 9, 2024Updated last year
- ☆56Apr 30, 2024Updated last year
- ☆47May 21, 2025Updated 8 months ago
- A visualizer to display attention weights on text☆24Apr 9, 2019Updated 6 years ago
- Easily compute clip embeddings and build a clip retrieval system with them☆2,726Aug 15, 2025Updated 6 months ago
- An official implementation for "CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval"☆1,025Apr 12, 2024Updated last year
- An ever-growing playground of notebooks showcasing CLIP's impressive zero-shot capabilities☆178Jul 27, 2022Updated 3 years ago
- Search Images through image dataset with text prompt using Open AI's CLIP neural network.☆36May 29, 2021Updated 4 years ago
- ☆24Nov 1, 2024Updated last year
- Bridging Vision and Language Model☆286Mar 27, 2023Updated 2 years ago
- An open source implementation of CLIP.☆13,383Updated this week
- [CVPR 2022 - Demo Track] - Effective conditioned and composed image retrieval combining CLIP-based features☆84Nov 12, 2024Updated last year
- project page for VinVL☆359Jul 26, 2023Updated 2 years ago
- Code release for SLIP Self-supervision meets Language-Image Pre-training☆787Feb 9, 2023Updated 3 years ago
- ☆1,047Oct 3, 2022Updated 3 years ago
- Supervision Exists Everywhere: A Data Efficient Contrastive Language-Image Pre-training Paradigm☆674Sep 19, 2022Updated 3 years ago
- [CVPR(W) 2022] UIGR: Unified Interactive Garment Retrieval☆22Dec 3, 2021Updated 4 years ago
- An Efficient Dataset Condensation Plugin and Its Application to Continual Learning. NeurIPS, 2023.☆12Nov 29, 2023Updated 2 years ago
- Third place of 2021 IEEE GRSS Data Fusion Contest: Track MSD☆10Mar 31, 2021Updated 4 years ago
- Official InfiniBench: A Benchmark for Large Multi-Modal Models in Long-Form Movies and TV Shows☆19Nov 4, 2025Updated 3 months ago
- Oscar and VinVL☆1,052Aug 28, 2023Updated 2 years ago
- ☆96Feb 5, 2026Updated last week
- [AAAI 2021] (oral) Progressive One-shot Human Parsing, [TPAMI 2023] End-to-end One-shot Human Parsing☆72Aug 12, 2023Updated 2 years ago
- Research code for ECCV 2020 paper "UNITER: UNiversal Image-TExt Representation Learning"☆800Jun 30, 2021Updated 4 years ago
- [CVPR 2022] DenseCLIP: Language-Guided Dense Prediction with Context-Aware Prompting☆543Sep 15, 2023Updated 2 years ago
- ☆13Jul 22, 2024Updated last year
- Linear Mode Connectivity in Multitask and Continual Learning: https://arxiv.org/abs/2010.04495☆12Oct 12, 2020Updated 5 years ago
- ICME2022 Special Session “Beyond Accuracy: Responsible, Responsive, and Robust Multimedia Retrieval ”☆12Jun 3, 2024Updated last year
- Code for ICPR paper☆22Nov 22, 2021Updated 4 years ago
- CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image☆32,562Jul 23, 2024Updated last year
- Grounded Language-Image Pre-training☆2,573Jan 24, 2024Updated 2 years ago
- source code and pre-trained/fine-tuned checkpoint for NAACL 2021 paper LightningDOT☆72Nov 14, 2022Updated 3 years ago