Picsart-AI-Research / StreamingT2VLinks
[CVPR 2025] StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text
β1,627Updated 10 months ago
Alternatives and similar repositories for StreamingT2V
Users that are interested in StreamingT2V are comparing it to the libraries listed below
Sorting:
- πΊ An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusionβ2,249Updated 11 months ago
- [ECCV 2024, Oral] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priorsβ2,991Updated last year
- Controllable video and image Generation, SVD, Animate Anyone, ControlNet, ControlNeXt, LoRAβ1,634Updated last year
- High-Quality Human Motion Video Generation with Confidence-aware Pose Guidanceβ2,528Updated 2 months ago
- [AAAI 2025] Follow-Your-Click: This repo is the official implementation of "Follow-Your-Click: Open-domain Regional Image Animation via Sβ¦β912Updated 4 months ago
- [ECCV 2024] MOFA-Video: Controllable Image Animation via Generative Motion Field Adaptions in Frozen Image-to-Video Diffusion Model.β761Updated last year
- Fine-Grained Open Domain Image Animation with Motion Guidanceβ961Updated last year
- Lumina-T2X is a unified framework for Text to Any Modality Generationβ2,251Updated 11 months ago
- [TMLR 2025] Latte: Latent Diffusion Transformer for Video Generation.β1,917Updated 3 months ago
- [IJCV 2024] LaVie: High-Quality Video Generation with Cascaded Latent Diffusion Modelsβ946Updated last year
- Official Code for MotionCtrl [SIGGRAPH 2024]β1,489Updated 11 months ago
- [CVPR 2024] PIA, your Personalized Image Animator. Animate your images by text prompt, combing with Dreambooth, achieving stunning videosβ¦β978Updated last year
- PixArt-Ξ£: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generationβ1,897Updated last year
- [ICLR 2024] SEINE: Short-to-Long Video Diffusion Model for Generative Transition and Predictionβ957Updated last year
- MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoisingβ2,823Updated last year
- Official repository of In-Context LoRA for Diffusion Transformersβ2,057Updated last year
- SEED-Story: Multimodal Long Story Generation with Large Language Modelβ883Updated last year
- InstantStyle: Free Lunch towards Style-Preserving in Text-to-Image Generation π₯β2,006Updated last year
- CogView4, CogView3-Plus and CogView3(ECCV 2024)β1,105Updated 10 months ago
- [ECCV 2024] OMG: Occlusion-friendly Personalized Multi-concept Generation In Diffusion Modelsβ701Updated last year
- [ICML 2024] MagicPose(also known as MagicDance): Realistic Human Poses and Facial Expressions Retargeting with Identity-aware Diffusionβ776Updated last year
- VideoSys: An easy and efficient system for video generationβ2,015Updated 5 months ago
- [CVPR 2024] FRESCO: Spatial-Temporal Correspondence for Zero-Shot Video Translationβ784Updated last year
- MusePose: a Pose-Driven Image-to-Video Framework for Virtual Human Generationβ2,653Updated 11 months ago
- πΉ A more flexible framework that can generate videos at any resolution and creates videos from images.β1,891Updated this week
- Code and data for "AnyV2V: A Tuning-Free Framework For Any Video-to-Video Editing Tasks" [TMLR 2024]β648Updated last year
- Official implementations for paper: Zero-shot Image Editing with Reference Imitationβ1,305Updated last year
- Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion modelsβ3,153Updated last year
- HunyuanVideo-I2V: A Customizable Image-to-Video Model based on HunyuanVideoβ1,777Updated 8 months ago
- [ICCV 2025 Highlight] OminiControl: Minimal and Universal Control for Diffusion Transformerβ1,903Updated 7 months ago