lucidrains / lumiere-pytorch
Implementation of Lumiere, SOTA text-to-video generation from Google Deepmind, in Pytorch
☆269Updated 8 months ago
Alternatives and similar repositories for lumiere-pytorch:
Users that are interested in lumiere-pytorch are comparing it to the libraries listed below
- Minimal implementation of scalable rectified flow transformers, based on SD3's approach☆491Updated 8 months ago
- ☆191Updated last month
- PeRFlow: Piecewise Rectified Flow as Universal Plug-and-Play Accelerator (NeurIPS 2024)☆492Updated 9 months ago
- Implementation of a single layer of the MMDiT, proposed in Stable Diffusion 3, in Pytorch☆331Updated 2 months ago
- Officail Implementation for "Cross-Image Attention for Zero-Shot Appearance Transfer"☆359Updated 10 months ago
- Unofficial PyTorch implementation of the VideoLDM.☆154Updated last year
- Train VAE like a boss☆270Updated 5 months ago
- AlignProp uses direct reward backpropogation for the alignment of large-scale text-to-image diffusion models. Our method is 25x more samp…☆278Updated 4 months ago
- [ECCV 2024] Official Repository for DiffiT: Diffusion Vision Transformers for Image Generation☆488Updated 4 months ago
- Video Diffusion Alignment via Reward Gradients. We improve a variety of video diffusion models such as VideoCrafter, OpenSora, ModelScope…☆250Updated 2 weeks ago
- FMBoost: Boosting Latent Diffusion with Flow Matching (ECCV 2024 Oral)☆223Updated 3 months ago
- [ICML 2024 Spotlight] FiT: Flexible Vision Transformer for Diffusion Model☆408Updated 4 months ago
- Scaling Diffusion Transformers with Mixture of Experts☆294Updated 6 months ago
- PyTorch implementation of CLIP Maximum Mean Discrepancy (CMMD) for evaluating image generation models.☆117Updated 11 months ago
- Official PyTorch and Diffusers Implementation of "LinFusion: 1 GPU, 1 Minute, 16K Image"☆297Updated 3 months ago
- HART: Efficient Visual Generation with Hybrid Autoregressive Transformer☆441Updated 5 months ago
- Official Pytorch Implementation of DenseDiffusion (ICCV 2023)☆491Updated last year
- Official PyTorch implementation of the paper "In-Context Learning Unlocked for Diffusion Models"☆399Updated last year
- [CVPR2024] VideoBooth: Diffusion-based Video Generation with Image Prompts☆292Updated 9 months ago
- Code for instruction-tuning Stable Diffusion.☆223Updated last year
- Huggingface-compatible SDXL Unet implementation that is readily hackable☆413Updated last year
- Data release for the ImageInWords (IIW) paper.☆209Updated 4 months ago
- Official implementation for "Stable Flow: Vital Layers for Training-Free Image Editing" [CVPR 2025]☆323Updated 2 months ago
- VideoVAE+: Large Motion Video Autoencoding with Cross-modal Video VAE☆301Updated 2 months ago
- [NeurIPS 2024] CV-VAE: A Compatible Video VAE for Latent Generative Video Models☆268Updated 3 months ago
- [ICLR 2025] OpenVid-1M: A Large-Scale High-Quality Dataset for Text-to-video Generation☆256Updated last month
- ConsistI2V: Enhancing Visual Consistency for Image-to-Video Generation (TMLR 2024)☆241Updated 8 months ago
- Scalable Diffusion Models with State Space Backbone☆152Updated last year
- Movie Gen Bench - two media generation evaluation benchmarks released with Meta Movie Gen☆378Updated 2 weeks ago
- [CVPR2025] PyTorch-based reimplementation of CrossFlow, as proposed in 'Flowing from Words to Pixels: A Noise-Free Framework for Cross-Mo…☆143Updated 2 weeks ago