LAION-AI / phenaki
A phenaki reproduction using pytorch.
☆219Updated last year
Alternatives and similar repositories for phenaki:
Users that are interested in phenaki are comparing it to the libraries listed below
- Implementation of Phenaki Video, which uses Mask GIT to produce text guided videos of up to 2 minutes in length, in Pytorch☆766Updated 7 months ago
- 1.4B latent diffusion model fine tuning☆264Updated 2 years ago
- [ECCV 2022] Compositional Generation using Diffusion Models☆469Updated 7 months ago
- Pytorch implementation of Make-A-Scene: Scene-Based Text-to-Image Generation with Human Priors☆334Updated 2 years ago
- Finetune glide-text2im from openai on your own data.☆89Updated 2 years ago
- Official PyTorch implementation of LongVideoGAN☆316Updated 2 years ago
- Official PyTorch implementation of TATS: A Long Video Generation Framework with Time-Agnostic VQGAN and Time-Sensitive Transformer (ECCV …☆280Updated 10 months ago
- Unofficial implementation of Tune-A-Video☆191Updated 2 years ago
- ELITE: Encoding Visual Concepts into Textual Embeddings for Customized Text-to-Image Generation (ICCV 2023, Oral)☆533Updated last year
- A linear estimator on top of clip to predict the aesthetic quality of pictures☆529Updated 2 years ago
- ☆166Updated 2 years ago
- Description and pointers of laion datasets☆245Updated 2 years ago
- Official Implementation of "Control-A-Video: Controllable Text-to-Video Generation with Diffusion Models"☆388Updated last year
- LVDM: Latent Video Diffusion Models for High-Fidelity Long Video Generation☆471Updated 4 months ago
- stable diffusion training☆291Updated 2 years ago
- Implementation of Encoder-based Domain Tuning for Fast Personalization of Text-to-Image Models☆324Updated last year
- ☆334Updated 2 years ago
- Implementation of Key-Locked Rank One Editing, from Nvidia AI☆233Updated last year
- Let's make a video clip☆93Updated 2 years ago
- Large-scale text-video dataset. 10 million captioned short videos.☆627Updated 7 months ago
- Training-Free Structured Diffusion Guidance for Compositional Text-to-Image Synthesis☆313Updated last year
- Implementation of "SVDiff: Compact Parameter Space for Diffusion Fine-Tuning"☆376Updated last year
- Dataset of prompts, synthetic AI generated images, and aesthetic ratings.☆410Updated 2 years ago
- Official Implementation for "Attend-and-Excite: Attention-Based Semantic Guidance for Text-to-Image Diffusion Models" (SIGGRAPH 2023)☆724Updated last year
- CLOOB Conditioned Latent Diffusion training and inference code☆112Updated 2 years ago
- Get hundred of million of image+url from the crawling at home dataset and preprocess them☆218Updated 9 months ago
- Video-P2P: Video Editing with Cross-attention Control☆403Updated 8 months ago
- Open reproduction of MUSE for fast text2image generation.☆347Updated 9 months ago
- The official implementation for "Gen-L-Video: Multi-Text to Long Video Generation via Temporal Co-Denoising".☆295Updated last year
- Implementation of NÜWA, state of the art attention network for text to video synthesis, in Pytorch☆546Updated 2 years ago