srpkdyy/VideoLDM

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/srpkdyy/VideoLDM)

srpkdyy / VideoLDM

Unofficial PyTorch implementation of the VideoLDM.

☆165

Alternatives and similar repositories for VideoLDM

Users that are interested in VideoLDM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

nv-tlabs / VideoLDM
View on GitHub
☆25Apr 15, 2023Updated 3 years ago
TIGER-AI-Lab / ConsistI2V
View on GitHub
ConsistI2V: Enhancing Visual Consistency for Image-to-Video Generation [TMLR 2024]
☆260Jul 1, 2024Updated 2 years ago
AILab-CVC / FreeNoise
View on GitHub
[ICLR 2024] Code for FreeNoise based on VideoCrafter
☆429Aug 25, 2025Updated 11 months ago
YingqingHe / LVDM
View on GitHub
LVDM: Latent Video Diffusion Models for High-Fidelity Long Video Generation
☆503Nov 16, 2024Updated last year
nihaomiao / CVPR23_LFDM
View on GitHub
The pytorch implementation of our CVPR 2023 paper "Conditional Image-to-Video Generation with Latent Flow Diffusion Models"
☆470Jun 18, 2024Updated 2 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
Vchitect / Latte
View on GitHub
[TMLR 2025] Latte: Latent Diffusion Transformer for Video Generation.
☆1,946Oct 30, 2025Updated 8 months ago
ali-vilab / videocomposer
View on GitHub
Official repo for VideoComposer: Compositional Video Synthesis with Motion Controllability
☆958Nov 11, 2023Updated 2 years ago
ExponentialML / Text-To-Video-Finetuning
View on GitHub
Finetune ModelScope's Text To Video model using Diffusers 🧨
☆700Dec 14, 2023Updated 2 years ago
wtybest / EnMMDiT
View on GitHub
[TPAMI 2026] Enhancing MMDiT-Based Text-to-Image Models for Similar Subject Generation
☆15Mar 7, 2026Updated 4 months ago
Weifeng-Chen / control-a-video
View on GitHub
Official Implementation of "Control-A-Video: Controllable Text-to-Video Generation with Diffusion Models"
☆404Jul 4, 2023Updated 3 years ago
sihyun-yu / PVDM
View on GitHub
[CVPR'23] Video Probabilistic Diffusion Models in Projected Latent Space
☆322May 14, 2024Updated 2 years ago
YBYBZhang / ControlVideo
View on GitHub
[ICLR 2024] Official pytorch implementation of "ControlVideo: Training-free Controllable Text-to-Video Generation"
☆863Oct 12, 2023Updated 2 years ago
RQ-Wu / LAMP
View on GitHub
[CVPR 2024] | LAMP: Learn a Motion Pattern for Few-Shot Based Video Generation
☆283Apr 22, 2024Updated 2 years ago
G-U-N / Gen-L-Video
View on GitHub
The official implementation for "Gen-L-Video: Multi-Text to Long Video Generation via Temporal Co-Denoising".
☆308Oct 19, 2025Updated 9 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
pixeli99 / SVD_Xtend
View on GitHub
Stable Video Diffusion Training Code and Extensions.
☆733Jul 25, 2024Updated 2 years ago
arthurhero / deep_fill_2_pytorch
View on GitHub
Pytorch implementation of deep fill v2 (original by Jiayu et al.)
☆10Jun 26, 2019Updated 7 years ago
showlab / Awesome-Video-Diffusion
View on GitHub
A curated list of recent diffusion models for video generation, editing, and various other applications.
☆5,733Updated this week
ai-forever / KandinskyVideo
View on GitHub
KandinskyVideo — multilingual end-to-end text2video latent diffusion model
☆187May 28, 2024Updated 2 years ago
google-research / magvit
View on GitHub
Official JAX implementation of MAGVIT: Masked Generative Video Transformer
☆1,002Jan 17, 2024Updated 2 years ago
tumurzakov / AnimateDiff
View on GitHub
AnimationDiff with train
☆124Feb 26, 2024Updated 2 years ago
Vchitect / LaVie
View on GitHub
[IJCV 2024] LaVie: High-Quality Video Generation with Cascaded Latent Diffusion Models
☆952Nov 13, 2024Updated last year
kabachuha / InfiNet
View on GitHub
Implementation of DiffusionOverDiffusion architecture presented in NUWA-XL in a form of ControlNet-like module on top of ModelScope text2…
☆85Apr 22, 2023Updated 3 years ago
YingqingHe / ScaleCrafter
View on GitHub
[ICLR 2024 Spotlight] Official implementation of ScaleCrafter for higher-resolution visual generation at inference time.
☆507Mar 7, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
ali-vilab / VGen
View on GitHub
Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models
☆3,155Jan 10, 2025Updated last year
AILab-CVC / VideoCrafter
View on GitHub
VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models
☆5,069Jan 9, 2026Updated 6 months ago
m-bain / webvid
View on GitHub
Large-scale text-video dataset. 10 million captioned short videos.
☆685Aug 14, 2024Updated last year
thu-ml / controlvideo
View on GitHub
Official implementation for "ControlVideo: Adding Conditional Control for One Shot Text-to-Video Editing"
☆231Jun 12, 2023Updated 3 years ago
CiaraStrawberry / svd-temporal-controlnet
View on GitHub
☆469Feb 12, 2024Updated 2 years ago
zhang-zx / AVID
View on GitHub
This respository contains the code for the CVPR 2024 paper AVID: Any-Length Video Inpainting with Diffusion Model.
☆177Feb 27, 2024Updated 2 years ago
wyhsirius / LEO
View on GitHub
☆43Nov 12, 2024Updated last year
VideoVerses / VideoVAEPlus
View on GitHub
[ICCV 2025] VideoVAE+: Large Motion Video Autoencoding with Cross-modal Video VAE
☆410Jan 19, 2025Updated last year
PixArt-alpha / PixArt-sigma
View on GitHub
PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation
☆1,933Oct 31, 2024Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
Alpha-VLLM / Lumina-T2X
View on GitHub
Lumina-T2X is a unified framework for Text to Any Modality Generation
☆2,247Feb 16, 2025Updated last year
lucidrains / video-diffusion-pytorch
View on GitHub
Implementation of Video Diffusion Models, Jonathan Ho's new paper extending DDPMs to Video Generation - in Pytorch
☆1,384May 3, 2024Updated 2 years ago
ZhihaoHu / VideoControlNet
View on GitHub
Official Pytorch Implementation for "VideoControlNet: A Motion-Guided Video-to-Video Translation Framework by Using Diffusion Model with …
☆118Jul 26, 2023Updated 3 years ago
mihirp1998 / VADER
View on GitHub
Video Diffusion Alignment via Reward Gradients. We improve a variety of video diffusion models such as VideoCrafter, OpenSora, ModelScope…
☆315Mar 12, 2025Updated last year
videodreamer23 / videodreamer23.github.io
View on GitHub
☆31Nov 7, 2023Updated 2 years ago
hi-zhengcheng / vividzoo
View on GitHub
☆39Oct 19, 2024Updated last year
yanivnik / sinfusion-code
View on GitHub
☆110Jan 18, 2025Updated last year