lucidrains / flexible-diffusion-modeling-videos-pytorch
Implementation of the video diffusion model and training scheme presented in the paper, Flexible Diffusion Modeling of Long Videos, in Pytorch
☆84Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for flexible-diffusion-modeling-videos-pytorch
- ImageBART: Bidirectional Context with Multinomial Diffusion for Autoregressive Image Synthesis☆123Updated 2 years ago
- Implementation of Retrieval-Augmented Denoising Diffusion Probabilistic Models in Pytorch☆64Updated 2 years ago
- JAX implementation ViT-VQGAN☆77Updated 2 years ago
- The official implementation of "Train Sparsely, Generate Densely: Memory-efficient Unsupervised Training of High-resolution Temporal GAN"☆78Updated 2 years ago
- Finetune glide-text2im from openai on your own data.☆88Updated 2 years ago
- Official implementation of the paper "Uncovering the Disentanglement Capability in Text-to-Image Diffusion Models☆160Updated last year
- Official PyTorch implementation of Generating Videos with Dynamics-aware Implicit Generative Adversarial Networks (ICLR 2022).☆182Updated last year
- ☆110Updated last year
- An in-context conditioning version of MUSE with pre-trained checkpoints.☆111Updated last year
- Official PyTorch implementation of TATS: A Long Video Generation Framework with Time-Agnostic VQGAN and Time-Sensitive Transformer (ECCV …☆268Updated 6 months ago
- [ICCV 2023] Unsupervised Compositional Concepts Discovery with Text-to-Image Generative Models☆78Updated last year
- ☆48Updated last year
- Simple large-scale training of stable diffusion with multi-node support.☆126Updated last year
- The PyTorch implementation of Latent Video Transformer.☆95Updated 6 months ago
- Implementation of Transframer, Deepmind's U-net + Transformer architecture for up to 30 seconds video generation, in Pytorch☆68Updated 2 years ago
- GENIE: Higher-Order Denoising Diffusion Solvers☆89Updated last year
- Code for paper LAFITE: Towards Language-Free Training for Text-to-Image Generation (CVPR 2022)☆180Updated last year
- [ICLR2023] Discrete Contrastive Diffusion for Cross-Modal Music and Image Generation (CDCD).☆155Updated last year
- This respository contains the code for the CVPR 2023 paper SINE: SINgle Image Editing with Text-to-Image Diffusion Models.☆183Updated 10 months ago
- DALL-Eval: Probing the Reasoning Skills and Social Biases of Text-to-Image Generation Models (ICCV 2023)☆137Updated 11 months ago
- CLOOB training (JAX) and inference (JAX and PyTorch)☆70Updated 2 years ago
- Official Pytorch Implementation of Synthesizing Coherent Story with Auto-Regressive Latent Diffusion Models☆192Updated last year
- ☆78Updated 10 months ago
- [NeurIPS 2022] (Amortized) distributional control for pre-trained generative models☆119Updated last year
- ☆60Updated last year
- [arXiv:2406.07548] Image and Video Tokenization with Binary Spherical Quantization☆84Updated 5 months ago
- Simple script to compute CLIP-based scores given a DALL-e trained model.☆30Updated 3 years ago
- ☆63Updated last year
- (wip) Use LAION-AI's CLIP "conditoned prior" to generate CLIP image embeds from CLIP text embeds.☆28Updated 2 years ago