vijishmadhavan / Crop-CLIP
Crop using CLIP
☆339Updated 2 years ago
Alternatives and similar repositories for Crop-CLIP:
Users that are interested in Crop-CLIP are comparing it to the libraries listed below
- CLIP Object Detection, search object on image using natural language #Zeroshot #Unsupervised #CLIP #ODS☆139Updated 3 years ago
- Using CLIP and StyleGAN to generate faces from prompts.☆131Updated 3 years ago
- OpenAI CLIP text encoders for multiple languages!☆791Updated last year
- Implementation of Parti, Google's pure attention-based text-to-image neural network, in Pytorch☆530Updated last year
- ☆198Updated 3 years ago
- Conceptual 12M is a dataset containing (image-URL, caption) pairs collected for vision-and-language pre-training.☆388Updated 2 years ago
- Pytorch implementation of Make-A-Scene: Scene-Based Text-to-Image Generation with Human Priors☆334Updated 2 years ago
- ☆655Updated last year
- PyTorch code for "Fine-grained Image Captioning with CLIP Reward" (Findings of NAACL 2022)☆241Updated 2 years ago
- Officially unofficial re-implementation of paper: Paint Transformer: Feed Forward Neural Painting with Stroke Prediction, ICCV 2021.☆505Updated 2 years ago
- Styled text-to-drawing synthesis method. Featured at IJCAI 2022 and the 2021 NeurIPS Workshop on Machine Learning for Creativity and Desi…☆279Updated 2 years ago
- Code release for SLIP Self-supervision meets Language-Image Pre-training☆766Updated 2 years ago
- ☆351Updated 2 years ago
- Modelverse: Content-Based Search for Deep Generative Models☆223Updated 4 months ago
- Implementation of NÜWA, state of the art attention network for text to video synthesis, in Pytorch☆546Updated 2 years ago
- This is a summary of easily available datasets for generalized DALLE-pytorch training.☆128Updated 2 years ago
- GIT: A Generative Image-to-text Transformer for Vision and Language☆563Updated last year
- code for CLIPDraw☆136Updated 2 years ago
- Get hundred of million of image+url from the crawling at home dataset and preprocess them☆218Updated 10 months ago
- Code release for "Detecting Twenty-thousand Classes using Image-level Supervision".☆1,937Updated last year
- Official PaddlePaddle implementation of Paint Transformer☆317Updated 3 years ago
- Here is a collection of checkpoints for DALLE-pytorch models, from where you can keep on training or start generating images.☆147Updated 2 years ago
- [WACV2021] Foreground-aware Semantic Representations for Image Harmonization https://arxiv.org/abs/2006.00809☆271Updated last year
- A phenaki reproduction using pytorch.☆220Updated last year
- Robust fine-tuning of zero-shot models☆693Updated 2 years ago
- Towards Unified Keyframe Propagation Models☆238Updated 2 years ago
- Code for Text2Human (SIGGRAPH 2022). Paper: Text2Human: Text-Driven Controllable Human Image Generation☆843Updated 8 months ago
- Search photos on Unsplash based on OpenAI's CLIP model, support search with joint image+text queries and attention visualization.☆221Updated 3 years ago
- ☆334Updated 2 years ago
- Next-generation Video instance recognition framework on top of Detectron2 which supports InstMove (CVPR 2023), SeqFormer(ECCV Oral), and…☆614Updated last year