galatolofederico / clip-glass
Repository for "Generating images from caption and vice versa via CLIP-Guided Generative Latent Space Search"
☆179Updated 2 years ago
Related projects: ⓘ
- ☆198Updated 2 years ago
- ☆147Updated 11 months ago
- Feed forward VQGAN-CLIP model, where the goal is to eliminate the need for optimizing the latent space of VQGAN for each input prompt☆136Updated 8 months ago
- ☆346Updated 2 years ago
- ImageBART: Bidirectional Context with Multinomial Diffusion for Autoregressive Image Synthesis☆122Updated 2 years ago
- ☆152Updated 2 years ago
- Finetune glide-text2im from openai on your own data.☆88Updated 2 years ago
- Pytorch implementation of Make-A-Scene: Scene-Based Text-to-Image Generation with Human Priors☆331Updated 2 years ago
- Using CLIP and StyleGAN to generate faces from prompts.☆126Updated 3 years ago
- Here is a collection of checkpoints for DALLE-pytorch models, from where you can keep on training or start generating images.☆146Updated last year
- [ICCV 2021] Aligning Latent and Image Spaces to Connect the Unconnectable☆239Updated 3 years ago
- Code for paper LAFITE: Towards Language-Free Training for Text-to-Image Generation (CVPR 2022)☆181Updated last year
- This is a summary of easily available datasets for generalized DALLE-pytorch training.☆127Updated 2 years ago
- ☆235Updated last year
- v objective diffusion inference code for JAX.☆209Updated 2 years ago
- A CLI tool/python module for generating images from text using guided diffusion and CLIP from OpenAI.☆455Updated 2 years ago
- CLOOB Conditioned Latent Diffusion training and inference code☆111Updated 2 years ago
- A collection of Jupyter notebooks to play with NVIDIA's StyleGAN3 and OpenAI's CLIP for a text-based guided image generation.☆206Updated 2 years ago
- Styled text-to-drawing synthesis method. Featured at IJCAI 2022 and the 2021 NeurIPS Workshop on Machine Learning for Creativity and Desi…☆276Updated last year
- combination of OpenAI GLIDE and Latent Diffusion☆136Updated 2 years ago
- Implementation of Zero-Shot Image-to-Text Generation for Visual-Semantic Arithmetic☆261Updated 2 years ago
- Refactoring dalle-pytorch and taming-transformers for TPU VM☆60Updated 3 years ago
- 1.4B latent diffusion model fine tuning☆259Updated 2 years ago
- Conceptual 12M is a dataset containing (image-URL, caption) pairs collected for vision-and-language pre-training.☆357Updated last year
- ☆110Updated 3 years ago
- Learning to ground explanations of affect for visual art.☆302Updated 3 years ago
- ☆103Updated this week
- StyleGAN2-ada for practice☆175Updated 4 months ago
- code for CLIPDraw☆125Updated 2 years ago
- v objective diffusion inference code for PyTorch.☆711Updated last year