haltakov / natural-language-image-search
Search photos on Unsplash using natural language
☆1,005Updated 2 years ago
Alternatives and similar repositories for natural-language-image-search:
Users that are interested in natural-language-image-search are comparing it to the libraries listed below
- Search images with a text or image query, using Open AI's pretrained CLIP model.☆227Updated 3 years ago
- Crop using CLIP☆338Updated 2 years ago
- OpenAI CLIP text encoders for multiple languages!☆785Updated last year
- Contrastive Language-Image Forensic Search allows free text searching through videos using OpenAI's machine learning model CLIP☆462Updated 2 years ago
- Simple image captioning model☆1,344Updated 8 months ago
- ☆648Updated 11 months ago
- Easily compute clip embeddings and build a clip retrieval system with them☆2,497Updated 10 months ago
- Automatically create Faiss knn indices with the most optimal similarity search parameters.☆837Updated 9 months ago
- official code repo for paper "CogView2: Faster and Better Text-to-Image Generation via Hierarchical Transformers"☆948Updated 2 years ago
- GIT: A Generative Image-to-text Transformer for Vision and Language☆557Updated last year
- CLIP + FFT/DWT/RGB = text to image/video☆782Updated 2 weeks ago
- Text-to-Image generation. The repo for NeurIPS 2021 paper "CogView: Mastering Text-to-Image Generation via Transformers".☆1,756Updated last year
- Code release for SLIP Self-supervision meets Language-Image Pre-training☆761Updated 2 years ago
- Sketch Your Own GAN: Customizing a GAN model with hand-drawn sketches.☆712Updated last year
- Officially unofficial re-implementation of paper: Paint Transformer: Feed Forward Neural Painting with Stroke Prediction, ICCV 2021.☆499Updated last year
- Implementation of 🦩 Flamingo, state-of-the-art few-shot visual question answering attention net out of Deepmind, in Pytorch☆1,231Updated 2 years ago
- Source code for models described in the paper "AudioCLIP: Extending CLIP to Image, Text and Audio" (https://arxiv.org/abs/2106.13043)☆796Updated 3 years ago
- Open-AI's DALL-E for large scale training in mesh-tensorflow.☆433Updated 3 years ago
- WIT (Wikipedia-based Image Text) Dataset is a large multimodal multilingual dataset comprising 37M+ image-text sets with 11M+ unique imag…☆1,032Updated 5 months ago
- Tool for automating common video key-frame extraction, video compression and Image Auto-crop/Image-resize tasks☆346Updated 6 months ago
- Implementation of NÜWA, state of the art attention network for text to video synthesis, in Pytorch☆547Updated 2 years ago
- Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch☆5,599Updated last year
- Awesome list for research on CLIP (Contrastive Language-Image Pre-Training).☆1,178Updated 8 months ago
- Official Implementation for "StyleCLIP: Text-Driven Manipulation of StyleGAN Imagery" (ICCV 2021 Oral)☆4,054Updated last year
- It is a simple library to speed up CLIP inference up to 3x (K80 GPU)☆213Updated last year
- Simple implementation of OpenAI CLIP model in PyTorch.☆657Updated 10 months ago
- PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation☆5,052Updated 6 months ago
- Code for Text2Human (SIGGRAPH 2022). Paper: Text2Human: Text-Driven Controllable Human Image Generation☆839Updated 7 months ago
- FashionCLIP is a CLIP-like model fine-tuned for the fashion domain.☆367Updated last month
- 🤗🖼️ HuggingPics: Fine-tune Vision Transformers for anything using images found on the web.☆293Updated 9 months ago