lucidrains / parti-pytorch
Implementation of Parti, Google's pure attention-based text-to-image neural network, in Pytorch
☆527Updated last year
Alternatives and similar repositories for parti-pytorch:
Users that are interested in parti-pytorch are comparing it to the libraries listed below
- Pytorch implementation of Make-A-Scene: Scene-Based Text-to-Image Generation with Human Priors☆335Updated 2 years ago
- Official implementation of VQ-Diffusion☆910Updated 9 months ago
- A concise but complete implementation of CLIP with various experimental improvements from recent papers☆705Updated last year
- Pretrained Dalle2 from laion☆501Updated last year
- Official Jax Implementation of MaskGIT☆479Updated 2 years ago
- ☆275Updated 2 years ago
- v objective diffusion inference code for PyTorch.☆717Updated 2 years ago
- [CVPR 2022] Official PyTorch Implementation for DiffusionCLIP: Text-guided Image Manipulation Using Diffusion Models☆819Updated last year
- Pytorch implementation of MaskGIT: Masked Generative Image Transformer (https://arxiv.org/pdf/2202.04200.pdf)☆421Updated last year
- ☆350Updated 2 years ago
- Implementation of NÜWA, state of the art attention network for text to video synthesis, in Pytorch☆545Updated 2 years ago
- 1.4B latent diffusion model fine tuning☆263Updated 2 years ago
- Implementation of Muse: Text-to-Image Generation via Masked Generative Transformers, in Pytorch☆886Updated 11 months ago
- A phenaki reproduction using pytorch.☆220Updated last year
- [ECCV 2022] Compositional Generation using Diffusion Models☆459Updated 5 months ago
- ☆332Updated last year
- Code for paper LAFITE: Towards Language-Free Training for Text-to-Image Generation (CVPR 2022)☆181Updated last year
- Open reproduction of MUSE for fast text2image generation.☆339Updated 7 months ago
- An unofficial implementation of both ViT-VQGAN and RQ-VAE in Pytorch☆295Updated last year
- Official implementation for "Blended Diffusion for Text-driven Editing of Natural Images" [CVPR 2022]☆569Updated 7 months ago
- Official PyTorch implementation of TATS: A Long Video Generation Framework with Time-Agnostic VQGAN and Time-Sensitive Transformer (ECCV …☆275Updated 8 months ago
- ☆450Updated 2 years ago
- A CLI tool/python module for generating images from text using guided diffusion and CLIP from OpenAI.☆461Updated 2 years ago
- official code repo for paper "CogView2: Faster and Better Text-to-Image Generation via Hierarchical Transformers"☆947Updated 2 years ago
- Official Implementation for "Attend-and-Excite: Attention-Based Semantic Guidance for Text-to-Image Diffusion Models" (SIGGRAPH 2023)☆718Updated last year
- Official Implementation of Paella https://arxiv.org/abs/2211.07292v2☆742Updated last year
- v objective diffusion inference code for JAX.☆212Updated 2 years ago
- Conceptual 12M is a dataset containing (image-URL, caption) pairs collected for vision-and-language pre-training.☆377Updated last year
- Dataset of prompts, synthetic AI generated images, and aesthetic ratings.☆405Updated 2 years ago
- Diffusers-Interpret 🤗🧨🕵️♀️: Model explainability for 🤗 Diffusers. Get explanations for your generated images.☆273Updated 2 years ago