lucidrains / phenaki-pytorch
Implementation of Phenaki Video, which uses Mask GIT to produce text guided videos of up to 2 minutes in length, in Pytorch
☆750Updated 3 months ago
Related projects ⓘ
Alternatives and complementary repositories for phenaki-pytorch
- A phenaki reproduction using pytorch.☆219Updated last year
- Implementation of Make-A-Video, new SOTA text to video generator from Meta AI, in Pytorch☆1,918Updated 6 months ago
- Implementation of Muse: Text-to-Image Generation via Masked Generative Transformers, in Pytorch☆866Updated 8 months ago
- Implementation of Video Diffusion Models, Jonathan Ho's new paper extending DDPMs to Video Generation - in Pytorch☆1,250Updated 6 months ago
- Implementation of NÜWA, state of the art attention network for text to video synthesis, in Pytorch☆544Updated last year
- Official Pytorch Implementation for "Text2LIVE: Text-Driven Layered Image and Video Editing" (ECCV 2022 Oral)☆877Updated last year
- Official Pytorch Implementation for "MultiDiffusion: Fusing Diffusion Paths for Controlled Image Generation" presenting "MultiDiffusion" …☆995Updated last year
- Official implementation of "Composer: Creative and Controllable Image Synthesis with Composable Conditions"☆1,537Updated 10 months ago
- official code repo for paper "CogView2: Faster and Better Text-to-Image Generation via Hierarchical Transformers"☆949Updated 2 years ago
- Finetune ModelScope's Text To Video model using Diffusers 🧨☆664Updated 10 months ago
- Versatile Diffusion: Text, Images and Variations All in One Diffusion Model, arXiv 2022 / ICCV 2023☆1,317Updated last year
- Pretrained Dalle2 from laion☆500Updated last year
- [ECCV 2022] Compositional Generation using Diffusion Models☆456Updated 2 months ago
- A large-scale text-to-image prompt gallery dataset based on Stable Diffusion☆1,212Updated 3 months ago
- Unofficial implementation of "Prompt-to-Prompt Image Editing with Cross Attention Control" with Stable Diffusion☆1,294Updated 2 years ago
- ☆1,451Updated 9 months ago
- Implementation of Parti, Google's pure attention-based text-to-image neural network, in Pytorch☆523Updated 11 months ago
- Official JAX implementation of MAGVIT: Masked Generative Video Transformer☆950Updated 9 months ago
- [ICML'23] StyleGAN-T: Unlocking the Power of GANs for Fast Large-Scale Text-to-Image Synthesis☆1,152Updated last year
- Official Implementation of Paella https://arxiv.org/abs/2211.07292v2☆740Updated last year
- [ICLR 2024] Official pytorch implementation of "ControlVideo: Training-free Controllable Text-to-Video Generation"☆781Updated last year
- Implementation of GigaGAN, new SOTA GAN out of Adobe. Culmination of nearly a decade of research into GANs☆1,824Updated 4 months ago
- Deep Learning Examples☆808Updated 3 weeks ago
- Official implementation of VQ-Diffusion☆894Updated 6 months ago
- ☆328Updated last year
- Official implementation of Würstchen: Efficient Pretraining of Text-to-Image Models☆527Updated 7 months ago
- A linear estimator on top of clip to predict the aesthetic quality of pictures☆478Updated 2 years ago
- Official Implementation for "Attend-and-Excite: Attention-Based Semantic Guidance for Text-to-Image Diffusion Models" (SIGGRAPH 2023)☆698Updated 9 months ago
- v objective diffusion inference code for PyTorch.☆713Updated last year
- Prompt-Free Diffusion: Taking "Text" out of Text-to-Image Diffusion Models, arxiv 2023 / CVPR 2024☆731Updated 11 months ago