Picsart-AI-Research / StreamingT2VLinks

[CVPR 2025] StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text

☆1,611

Alternatives and similar repositories for StreamingT2V

Users that are interested in StreamingT2V are comparing it to the libraries listed below

Sorting:

aigc-apps / EasyAnimate
📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion
☆2,231Updated 8 months ago
Doubiiu / DynamiCrafter
[ECCV 2024, Oral] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors
☆2,966Updated last year
Alpha-VLLM / Lumina-T2X
Lumina-T2X is a unified framework for Text to Any Modality Generation
☆2,235Updated 9 months ago
mayuelala / FollowYourClick
[AAAI 2025] Follow-Your-Click: This repo is the official implementation of "Follow-Your-Click: Open-domain Regional Image Animation via S…
☆908Updated 2 months ago
Vchitect / Latte
[TMLR 2025] Latte: Latent Diffusion Transformer for Video Generation.
☆1,888Updated 3 weeks ago
dvlab-research / ControlNeXt
Controllable video and image Generation, SVD, Animate Anyone, ControlNet, ControlNeXt, LoRA
☆1,621Updated last year
alibaba / animate-anything
Fine-Grained Open Domain Image Animation with Motion Guidance
☆950Updated last year
MyNiuuu / MOFA-Video
[ECCV 2024] MOFA-Video: Controllable Image Animation via Generative Motion Field Adaptions in Frozen Image-to-Video Diffusion Model.
☆755Updated 11 months ago
Tencent / MimicMotion
High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance
☆2,470Updated this week
NUS-HPC-AI-Lab / VideoSys
VideoSys: An easy and efficient system for video generation
☆2,007Updated 2 months ago
ali-vilab / MimicBrush
Official implementations for paper: Zero-shot Image Editing with Reference Imitation
☆1,301Updated last year
open-mmlab / PIA
[CVPR 2024] PIA, your Personalized Image Animator. Animate your images by text prompt, combing with Dreambooth, achieving stunning videos…
☆974Updated last year
Vchitect / LaVie
[IJCV 2024] LaVie: High-Quality Video Generation with Cascaded Latent Diffusion Models
☆939Updated last year
TMElyralab / MuseV
MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising
☆2,796Updated last year
PixArt-alpha / PixArt-sigma
PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation
☆1,867Updated last year
TencentARC / MotionCtrl
Official Code for MotionCtrl [SIGGRAPH 2024]
☆1,466Updated 9 months ago
TencentARC / SEED-Story
SEED-Story: Multimodal Long Story Generation with Large Language Model
☆874Updated last year
instantX-research / InstantStyle
InstantStyle: Free Lunch towards Style-Preserving in Text-to-Image Generation 🔥
☆1,986Updated last year
jy0205 / Pyramid-Flow
[ICLR 2025] Pyramidal Flow Matching for Efficient Video Generative Modeling
☆3,121Updated 11 months ago
X-LANCE / AniTalker
[ACM MM 2024] This is the official code for "AniTalker: Animate Vivid and Diverse Talking Faces through Identity-Decoupled Facial Motion …
☆1,599Updated last year
aigc-apps / VideoX-Fun
📹 A more flexible framework that can generate videos at any resolution and creates videos from images.
☆1,550Updated this week
stepfun-ai / Step1X-Edit
A SOTA open-source image editing model, which aims to provide comparable performance against the closed-source models like GPT-4o and Gem…
☆1,742Updated 2 months ago
TMElyralab / MusePose
MusePose: a Pose-Driven Image-to-Video Framework for Virtual Human Generation
☆2,618Updated 8 months ago
TIGER-AI-Lab / AnyV2V
Code and data for "AnyV2V: A Tuning-Free Framework For Any Video-to-Video Editing Tasks" [TMLR 2024]
☆634Updated last year
Vchitect / SEINE
[ICLR 2024] SEINE: Short-to-Long Video Diffusion Model for Generative Transition and Prediction
☆944Updated last year
YangLing0818 / RPG-DiffusionMaster
[ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (RPG)
☆1,835Updated 9 months ago
FireRedTeam / StoryMaker
StoryMaker: Towards consistent characters in text-to-image generation
☆715Updated 11 months ago
kongzhecn / OMG
[ECCV 2024] OMG: Occlusion-friendly Personalized Multi-concept Generation In Diffusion Models
☆697Updated last year
Jeff-LiangF / streamv2v
Official Pytorch implementation of StreamV2V.
☆520Updated 9 months ago
ali-vilab / VGen
Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models
☆3,144Updated 10 months ago