lucidrains / make-a-video-pytorch
Implementation of Make-A-Video, new SOTA text to video generator from Meta AI, in Pytorch
☆1,958Updated 10 months ago
Alternatives and similar repositories for make-a-video-pytorch:
Users that are interested in make-a-video-pytorch are comparing it to the libraries listed below
- Implementation of Phenaki Video, which uses Mask GIT to produce text guided videos of up to 2 minutes in length, in Pytorch☆766Updated 7 months ago
- Implementation of Video Diffusion Models, Jonathan Ho's new paper extending DDPMs to Video Generation - in Pytorch☆1,290Updated 10 months ago
- ☆2,970Updated 2 years ago
- Versatile Diffusion: Text, Images and Variations All in One Diffusion Model, arXiv 2022 / ICCV 2023☆1,327Updated last year
- A large-scale text-to-image prompt gallery dataset based on Stable Diffusion☆1,255Updated 8 months ago
- Unofficial implementation of "Prompt-to-Prompt Image Editing with Cross Attention Control" with Stable Diffusion☆1,328Updated 2 years ago
- Custom Diffusion: Multi-Concept Customization of Text-to-Image Diffusion (CVPR 2023)☆1,926Updated last year
- ☆3,260Updated 10 months ago
- [ICML'23] StyleGAN-T: Unlocking the Power of GANs for Fast Large-Scale Text-to-Image Synthesis☆1,172Updated last year
- [ICCV 2023 Oral] Text-to-Image Diffusion Models are Zero-Shot Video Generators☆4,149Updated last year
- Karras et al. (2022) diffusion models for PyTorch☆2,411Updated 2 months ago
- ☆1,030Updated last year
- Official implementation of "Composer: Creative and Controllable Image Synthesis with Composable Conditions"☆1,554Updated last year
- Open-Set Grounded Text-to-Image Generation☆2,096Updated last year
- Consistency Distilled Diff VAE☆2,162Updated last year
- official code repo for paper "CogView2: Faster and Better Text-to-Image Generation via Hierarchical Transformers"☆948Updated 2 years ago
- ☆1,560Updated 2 years ago
- Official Pytorch Implementation for "Text2LIVE: Text-Driven Layered Image and Video Editing" (ECCV 2022 Oral)☆887Updated 2 years ago
- ☆1,465Updated last year
- Pretrained Dalle2 from laion☆501Updated last year
- Implementation of GigaGAN, new SOTA GAN out of Adobe. Culmination of nearly a decade of research into GANs☆1,883Updated 2 months ago
- Official implementation of VQ-Diffusion☆920Updated 11 months ago
- Implementation of Imagen, Google's Text-to-Image Neural Network, in Pytorch☆8,214Updated 5 months ago
- GLIDE: a diffusion-based text-conditional image synthesis model☆3,595Updated last year
- Official repo for VideoComposer: Compositional Video Synthesis with Motion Controllability☆928Updated last year
- Zero-shot Image-to-Image Translation [SIGGRAPH 2023]☆1,107Updated 5 months ago
- Implementation of Muse: Text-to-Image Generation via Masked Generative Transformers, in Pytorch☆891Updated last year
- Implementation of NÜWA, state of the art attention network for text to video synthesis, in Pytorch☆546Updated 2 years ago
- [ICCV 2023 Oral] "FateZero: Fusing Attentions for Zero-shot Text-based Video Editing"☆1,140Updated last year
- VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models☆4,775Updated 8 months ago