srpkdyy / VideoLDM
Unofficial PyTorch implementation of the VideoLDM.
☆156Updated last year
Alternatives and similar repositories for VideoLDM:
Users that are interested in VideoLDM are comparing it to the libraries listed below
- [CVPR 2024] On the Content Bias in Fréchet Video Distance☆109Updated 6 months ago
- [NeurIPS 2024] CV-VAE: A Compatible Video VAE for Latent Generative Video Models☆274Updated 4 months ago
- Scalable Diffusion Models with State Space Backbone☆152Updated last year
- An in-context conditioning version of MUSE with pre-trained checkpoints.☆111Updated last year
- ☆192Updated 2 months ago
- [CVPR'23] Video Probabilistic Diffusion Models in Projected Latent Space☆315Updated 11 months ago
- ☆104Updated last year
- ☆229Updated last year
- Official pytorch implementation of the paper: "An Edit Friendly DDPM Noise Space: Inversion and Manipulations". CVPR 2024.☆332Updated 9 months ago
- CCEdit: Creative and Controllable Video Editing via Diffusion Models☆109Updated 10 months ago
- Official PyTorch implementation for ICLR2024 paper "The Blessing of Randomness: SDE Beats ODE in General Diffusion-based Image Editing"☆111Updated last year
- [ICLR 2025] OpenVid-1M: A Large-Scale High-Quality Dataset for Text-to-video Generation☆284Updated last month
- Comparison between Frechet Video Distance implementation from StyleGAN-V and the original paper☆105Updated 3 months ago
- [ICML 2024 Spotlight] FiT: Flexible Vision Transformer for Diffusion Model☆410Updated 5 months ago
- [CVPR 2024] EvalCrafter: Benchmarking and Evaluating Large Video Generation Models☆166Updated 6 months ago
- Official implementation of "Controlling Text-to-Image Diffusion by Orthogonal Finetuning".☆290Updated 6 months ago
- "FreeU: Free Lunch in Diffusion U-Net" for Huggingface Diffusers☆99Updated last year
- [ICLR2024] The official implementation of paper "VDT: General-purpose Video Diffusion Transformers via Mask Modeling", by Haoyu Lu, Guoxi…☆236Updated 11 months ago
- MoVQGAN - model for the image encoding and reconstruction☆233Updated last year
- Official implementation of the paper: REPA-E: Unlocking VAE for End-to-End Tuning of Latent Diffusion Transformers☆143Updated last week
- Code for the paper "Pix2Video: Video Editing using Image Diffusion"☆69Updated last year
- [CVPR2025] PyTorch-based reimplementation of CrossFlow, as proposed in 'Flowing from Words to Pixels: A Noise-Free Framework for Cross-Mo…☆164Updated last month
- [ICCV 2023] Efficient Diffusion Training via Min-SNR Weighting Strategy☆245Updated 4 months ago
- Author's Implementation for E-LatentLPIPS☆145Updated 5 months ago
- This respository contains the code for the CVPR 2024 paper AVID: Any-Length Video Inpainting with Diffusion Model.☆163Updated last year
- ConsistI2V: Enhancing Visual Consistency for Image-to-Video Generation (TMLR 2024)☆242Updated 9 months ago
- [CVPR 2025 Oral] Alias-free Latent Diffusion Models (official implementation)☆74Updated last month
- Official PyTorch and Diffusers Implementation of "LinFusion: 1 GPU, 1 Minute, 16K Image"☆300Updated 4 months ago
- STAR: Scale-wise Text-to-image generation via Auto-Regressive representations☆140Updated 2 months ago
- VideoVAE+: Large Motion Video Autoencoding with Cross-modal Video VAE☆314Updated 3 months ago