kingyiusuen / clip-image-search
Search images with a text or image query, using Open AI's pretrained CLIP model.
☆205Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for clip-image-search
- Using efficientnet to provide embeddings for retrieval☆154Updated last year
- Easily compute clip embeddings from video frames☆136Updated last year
- Tool for automating common video key-frame extraction, video compression and Image Auto-crop/Image-resize tasks☆323Updated 3 months ago
- Crop using CLIP☆336Updated 2 years ago
- extending stable diffusion prompts with suitable style cues using text generation☆177Updated last year
- ☆311Updated last year
- code for CLIPDraw☆130Updated 2 years ago
- Search photos on Unsplash based on OpenAI's CLIP model, support search with joint image+text queries and attention visualization.☆209Updated 3 years ago
- Get hundred of million of image+url from the crawling at home dataset and preprocess them☆206Updated 5 months ago
- PyTorch code for "Fine-grained Image Captioning with CLIP Reward" (Findings of NAACL 2022)☆235Updated 2 years ago
- Let's make a video clip☆93Updated 2 years ago
- ☆100Updated last year
- [A toolbox for fun.] Transform Image into Unique Paragraph with ChatGPT, BLIP2, OFA, GRIT, Segment Anything, ControlNet.☆791Updated last year
- Official Repository of ChatCaptioner☆452Updated last year
- ☆328Updated last year
- A simple script that reads a directory of videos, grabs a random frame, and automatically discovers a prompt for it☆131Updated 9 months ago
- OpenAI CLIP text encoders for multiple languages!☆763Updated last year
- Code/Data for the paper: "LLaVAR: Enhanced Visual Instruction Tuning for Text-Rich Image Understanding"☆258Updated 5 months ago
- Fine tuning OpenAI's CLIP model on Indian Fashion Dataset☆50Updated last year
- Dataset of prompts, synthetic AI generated images, and aesthetic ratings.☆399Updated 2 years ago
- Our idea is to combine the power of computer vision model and LLMs. We use YOLO, CLIP and DINOv2 to extract high-level features from imag …☆100Updated last year
- ☆111Updated 3 years ago
- ☆165Updated 2 years ago
- GIT: A Generative Image-to-text Transformer for Vision and Language☆549Updated 11 months ago
- This is the official repository for the LENS (Large Language Models Enhanced to See) system.☆351Updated 11 months ago
- Create GIFs and Videos using Stable Diffusion☆222Updated 8 months ago
- Sample implementation of natural language image search with OpenAI's CLIP and Elasticsearch or Opensearch.☆63Updated 2 years ago
- This repository aims to implement an Image Search engine powered by the CLIP model.☆39Updated 2 years ago
- Stable Fashion: A prompt based virtual try on repository☆85Updated last year
- Official implementation of the ICASSP-2022 paper "Text2Poster: Laying Out Stylized Texts on Retrieved Images"☆205Updated 11 months ago