lucidrains / phenaki-pytorch
Implementation of Phenaki Video, which uses Mask GIT to produce text guided videos of up to 2 minutes in length, in Pytorch
☆769Updated 8 months ago
Alternatives and similar repositories for phenaki-pytorch:
Users that are interested in phenaki-pytorch are comparing it to the libraries listed below
- A phenaki reproduction using pytorch.☆220Updated last year
- Implementation of NÜWA, state of the art attention network for text to video synthesis, in Pytorch☆546Updated 2 years ago
- Implementation of Video Diffusion Models, Jonathan Ho's new paper extending DDPMs to Video Generation - in Pytorch☆1,300Updated 11 months ago
- Implementation of Make-A-Video, new SOTA text to video generator from Meta AI, in Pytorch☆1,963Updated 11 months ago
- Official Pytorch Implementation for "Text2LIVE: Text-Driven Layered Image and Video Editing" (ECCV 2022 Oral)☆888Updated 2 years ago
- [ICML'23] StyleGAN-T: Unlocking the Power of GANs for Fast Large-Scale Text-to-Image Synthesis☆1,178Updated 2 years ago
- Official JAX implementation of MAGVIT: Masked Generative Video Transformer☆981Updated last year
- [ECCV 2022] Compositional Generation using Diffusion Models☆471Updated 7 months ago
- Finetune ModelScope's Text To Video model using Diffusers 🧨☆683Updated last year
- [ICCV 2023 Oral] "FateZero: Fusing Attentions for Zero-shot Text-based Video Editing"☆1,145Updated last year
- Official repo for VideoComposer: Compositional Video Synthesis with Motion Controllability☆930Updated last year
- Implementation of Parti, Google's pure attention-based text-to-image neural network, in Pytorch☆530Updated last year
- Official Pytorch Implementation for "MultiDiffusion: Fusing Diffusion Paths for Controlled Image Generation" presenting "MultiDiffusion" …☆1,027Updated last year
- LVDM: Latent Video Diffusion Models for High-Fidelity Long Video Generation☆476Updated 5 months ago
- [ICLR 2024] Official pytorch implementation of "ControlVideo: Training-free Controllable Text-to-Video Generation"☆823Updated last year
- Implementation of Muse: Text-to-Image Generation via Masked Generative Transformers, in Pytorch☆892Updated last year
- Official implementation of "DreamPose: Fashion Image-to-Video Synthesis via Stable Diffusion"☆998Updated last year
- Pretrained Dalle2 from laion☆501Updated 2 years ago
- Official implementation of "Composer: Creative and Controllable Image Synthesis with Composable Conditions"☆1,558Updated last year
- ControlLoRA: A Lightweight Neural Network To Control Stable Diffusion Spatial Information☆589Updated 8 months ago
- Large-scale text-video dataset. 10 million captioned short videos.☆628Updated 8 months ago
- Implementation of Paint-with-words with Stable Diffusion : method from eDiff-I that let you generate image from text-labeled segmentation…☆642Updated 2 years ago
- Unoffical implement for [StyleDrop](https://arxiv.org/abs/2306.00983)☆577Updated last year
- ELITE: Encoding Visual Concepts into Textual Embeddings for Customized Text-to-Image Generation (ICCV 2023, Oral)☆534Updated last year
- Consistency Distilled Diff VAE☆2,175Updated last year
- Official PyTorch implementation of TATS: A Long Video Generation Framework with Time-Agnostic VQGAN and Time-Sensitive Transformer (ECCV …☆282Updated 11 months ago
- Prompt-Free Diffusion: Taking "Text" out of Text-to-Image Diffusion Models, arxiv 2023 / CVPR 2024☆747Updated last year
- [ICLR 2024] SEINE: Short-to-Long Video Diffusion Model for Generative Transition and Prediction☆936Updated 5 months ago
- The pytorch implementation of our CVPR 2023 paper "Conditional Image-to-Video Generation with Latent Flow Diffusion Models"☆460Updated 9 months ago
- ☆334Updated 2 years ago