lucidrains / phenaki-pytorchLinks
Implementation of Phenaki Video, which uses Mask GIT to produce text guided videos of up to 2 minutes in length, in Pytorch
☆790Updated last year
Alternatives and similar repositories for phenaki-pytorch
Users that are interested in phenaki-pytorch are comparing it to the libraries listed below
Sorting:
- Implementation of Make-A-Video, new SOTA text to video generator from Meta AI, in Pytorch☆1,983Updated last year
- Implementation of Muse: Text-to-Image Generation via Masked Generative Transformers, in Pytorch☆919Updated last year
- A phenaki reproduction using pytorch.☆220Updated 2 years ago
- Official Pytorch Implementation for "Text2LIVE: Text-Driven Layered Image and Video Editing" (ECCV 2022 Oral)☆892Updated 2 years ago
- [ICML'23] StyleGAN-T: Unlocking the Power of GANs for Fast Large-Scale Text-to-Image Synthesis☆1,197Updated 2 years ago
- Implementation of Video Diffusion Models, Jonathan Ho's new paper extending DDPMs to Video Generation - in Pytorch☆1,359Updated last year
- Finetune ModelScope's Text To Video model using Diffusers 🧨☆692Updated 2 years ago
- Official Pytorch Implementation for "MultiDiffusion: Fusing Diffusion Paths for Controlled Image Generation" presenting "MultiDiffusion" …☆1,053Updated 2 years ago
- Pretrained Dalle2 from laion☆503Updated 2 years ago
- Official implementation of "Composer: Creative and Controllable Image Synthesis with Composable Conditions"☆1,562Updated last year
- Deep Learning Examples☆828Updated last year
- Implementation of Parti, Google's pure attention-based text-to-image neural network, in Pytorch☆537Updated 2 years ago
- [ECCV 2022] Compositional Generation using Diffusion Models☆484Updated 7 months ago
- Implementation of NÜWA, state of the art attention network for text to video synthesis, in Pytorch☆549Updated 2 years ago
- Unofficial implementation of "Prompt-to-Prompt Image Editing with Cross Attention Control" with Stable Diffusion☆1,346Updated 3 years ago
- Versatile Diffusion: Text, Images and Variations All in One Diffusion Model, arXiv 2022 / ICCV 2023☆1,334Updated 2 years ago
- official code repo for paper "CogView2: Faster and Better Text-to-Image Generation via Hierarchical Transformers"☆956Updated 3 years ago
- Official JAX implementation of MAGVIT: Masked Generative Video Transformer☆988Updated last year
- Official Pytorch Implementation for “Plug-and-Play Diffusion Features for Text-Driven Image-to-Image Translation” (CVPR 2023)☆993Updated 2 years ago
- LVDM: Latent Video Diffusion Models for High-Fidelity Long Video Generation☆499Updated last year
- A large-scale text-to-image prompt gallery dataset based on Stable Diffusion☆1,343Updated last year
- Implementation of Paint-with-words with Stable Diffusion : method from eDiff-I that let you generate image from text-labeled segmentation…☆646Updated 2 years ago
- ControlLoRA: A Lightweight Neural Network To Control Stable Diffusion Spatial Information☆617Updated last year
- ☆1,479Updated last year
- [ICLR 2024] Official pytorch implementation of "ControlVideo: Training-free Controllable Text-to-Video Generation"☆851Updated 2 years ago
- Unoffical implement for [StyleDrop](https://arxiv.org/abs/2306.00983)☆583Updated 2 years ago
- ☆1,062Updated last year
- Large-scale text-video dataset. 10 million captioned short videos.☆668Updated last year
- ☆336Updated 2 years ago
- Dataset of prompts, synthetic AI generated images, and aesthetic ratings.☆424Updated 3 years ago