vijishmadhavan / Crop-CLIP
Crop using CLIP
☆335Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for Crop-CLIP
- ☆645Updated 8 months ago
- Code release for SLIP Self-supervision meets Language-Image Pre-training☆746Updated last year
- Implementation of NÜWA, state of the art attention network for text to video synthesis, in Pytorch☆544Updated last year
- Using CLIP and StyleGAN to generate faces from prompts.☆129Updated 3 years ago
- Styled text-to-drawing synthesis method. Featured at IJCAI 2022 and the 2021 NeurIPS Workshop on Machine Learning for Creativity and Desi…☆277Updated last year
- Omnivore: A Single Model for Many Visual Modalities☆559Updated last year
- CLIP Object Detection, search object on image using natural language #Zeroshot #Unsupervised #CLIP #ODS☆138Updated 2 years ago
- ☆265Updated 2 years ago
- Pytorch implementation of Make-A-Scene: Scene-Based Text-to-Image Generation with Human Priors☆333Updated 2 years ago
- It is a simple library to speed up CLIP inference up to 3x (K80 GPU)☆200Updated last year
- Next-generation Video instance recognition framework on top of Detectron2 which supports InstMove (CVPR 2023), SeqFormer(ECCV Oral), and…☆602Updated 8 months ago
- A concise but complete implementation of CLIP with various experimental improvements from recent papers☆690Updated last year
- Conceptual 12M is a dataset containing (image-URL, caption) pairs collected for vision-and-language pre-training.☆364Updated last year
- Officially unofficial re-implementation of paper: Paint Transformer: Feed Forward Neural Painting with Stroke Prediction, ICCV 2021.☆488Updated last year
- Implementation of Zero-Shot Image-to-Text Generation for Visual-Semantic Arithmetic☆269Updated 2 years ago
- Implementation of Parti, Google's pure attention-based text-to-image neural network, in Pytorch☆523Updated 11 months ago
- ☆198Updated 2 years ago
- Official PyTorch Implementation of "GAN-Supervised Dense Visual Alignment" (CVPR 2022 Oral, Best Paper Finalist)☆1,012Updated 2 years ago
- official code repo for paper "CogView2: Faster and Better Text-to-Image Generation via Hierarchical Transformers"☆949Updated 2 years ago
- Contrastive Language-Image Forensic Search allows free text searching through videos using OpenAI's machine learning model CLIP☆450Updated 2 years ago
- Get hundred of million of image+url from the crawling at home dataset and preprocess them☆205Updated 5 months ago
- Supervision Exists Everywhere: A Data Efficient Contrastive Language-Image Pre-training Paradigm☆636Updated 2 years ago
- Modelverse: Content-Based Search for Deep Generative Models☆220Updated last year
- [ICML 2023] Official PyTorch implementation of Global Context Vision Transformers☆425Updated 10 months ago
- Official implementation of the CVPR 2022 paper "DETReg: Unsupervised Pretraining with Region Priors for Object Detection".☆334Updated last year
- ☆349Updated 2 years ago
- Code release for "Detecting Twenty-thousand Classes using Image-level Supervision".☆1,877Updated 7 months ago
- code for CLIPDraw☆130Updated 2 years ago
- Search photos on Unsplash based on OpenAI's CLIP model, support search with joint image+text queries and attention visualization.☆208Updated 3 years ago
- Robust fine-tuning of zero-shot models☆644Updated 2 years ago