lucidrains / parti-pytorch
Implementation of Parti, Google's pure attention-based text-to-image neural network, in Pytorch
☆523Updated 11 months ago
Related projects ⓘ
Alternatives and complementary repositories for parti-pytorch
- Pytorch implementation of Make-A-Scene: Scene-Based Text-to-Image Generation with Human Priors☆334Updated 2 years ago
- Official Jax Implementation of MaskGIT☆449Updated 2 years ago
- A concise but complete implementation of CLIP with various experimental improvements from recent papers☆693Updated last year
- Official implementation of VQ-Diffusion☆899Updated 7 months ago
- Implementation of NÜWA, state of the art attention network for text to video synthesis, in Pytorch☆545Updated last year
- v objective diffusion inference code for PyTorch.☆714Updated last year
- Pytorch implementation of MaskGIT: Masked Generative Image Transformer (https://arxiv.org/pdf/2202.04200.pdf)☆412Updated last year
- ☆350Updated 2 years ago
- Implementation of Muse: Text-to-Image Generation via Masked Generative Transformers, in Pytorch☆869Updated 8 months ago
- Pretrained Dalle2 from laion☆500Updated last year
- Official Implementation of Paella https://arxiv.org/abs/2211.07292v2☆741Updated last year
- 1.4B latent diffusion model fine tuning☆261Updated 2 years ago
- A phenaki reproduction using pytorch.☆219Updated last year
- ☆443Updated 2 years ago
- Open reproduction of MUSE for fast text2image generation.☆332Updated 5 months ago
- ☆275Updated 2 years ago
- [ECCV 2022] Compositional Generation using Diffusion Models☆455Updated 3 months ago
- official code repo for paper "CogView2: Faster and Better Text-to-Image Generation via Hierarchical Transformers"☆951Updated 2 years ago
- MinImagen: A minimal implementation of the Imagen text-to-image model☆295Updated last year
- Code for paper LAFITE: Towards Language-Free Training for Text-to-Image Generation (CVPR 2022)☆180Updated last year
- [CVPR 2022] Official PyTorch Implementation for DiffusionCLIP: Text-guided Image Manipulation Using Diffusion Models☆806Updated last year
- Conceptual 12M is a dataset containing (image-URL, caption) pairs collected for vision-and-language pre-training.☆368Updated last year
- PyTorch implementation for SDEdit: Image Synthesis and Editing with Stochastic Differential Equations☆1,000Updated last year
- Implementation of Recurrent Interface Network (RIN), for highly efficient generation of images and video without cascading networks, in P…☆194Updated 9 months ago
- v objective diffusion inference code for JAX.☆211Updated 2 years ago
- Code release for SLIP Self-supervision meets Language-Image Pre-training☆747Updated last year
- An unofficial implementation of both ViT-VQGAN and RQ-VAE in Pytorch☆286Updated last year
- Official implementation for "Blended Diffusion for Text-driven Editing of Natural Images" [CVPR 2022]☆560Updated 5 months ago
- [SIGGRAPH'22] StyleGAN-XL: Scaling StyleGAN to Large Diverse Datasets☆967Updated 4 months ago
- Official Implementation for "Attend-and-Excite: Attention-Based Semantic Guidance for Text-to-Image Diffusion Models" (SIGGRAPH 2023)☆699Updated 9 months ago