Picsart-AI-Research / StreamingT2VLinks
[CVPR 2025] StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text
β1,625Updated 10 months ago
Alternatives and similar repositories for StreamingT2V
Users that are interested in StreamingT2V are comparing it to the libraries listed below
Sorting:
- [ECCV 2024, Oral] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priorsβ2,983Updated last year
- πΊ An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusionβ2,246Updated 10 months ago
- [AAAI 2025] Follow-Your-Click: This repo is the official implementation of "Follow-Your-Click: Open-domain Regional Image Animation via Sβ¦β909Updated 4 months ago
- [TMLR 2025] Latte: Latent Diffusion Transformer for Video Generation.β1,903Updated 2 months ago
- Controllable video and image Generation, SVD, Animate Anyone, ControlNet, ControlNeXt, LoRAβ1,630Updated last year
- [CVPR 2024] PIA, your Personalized Image Animator. Animate your images by text prompt, combing with Dreambooth, achieving stunning videosβ¦β977Updated last year
- [ECCV 2024] MOFA-Video: Controllable Image Animation via Generative Motion Field Adaptions in Frozen Image-to-Video Diffusion Model.β760Updated last year
- Lumina-T2X is a unified framework for Text to Any Modality Generationβ2,248Updated 11 months ago
- High-Quality Human Motion Video Generation with Confidence-aware Pose Guidanceβ2,515Updated 2 months ago
- Fine-Grained Open Domain Image Animation with Motion Guidanceβ960Updated last year
- [ICLR 2024] SEINE: Short-to-Long Video Diffusion Model for Generative Transition and Predictionβ957Updated last year
- [IJCV 2024] LaVie: High-Quality Video Generation with Cascaded Latent Diffusion Modelsβ947Updated last year
- Official Code for MotionCtrl [SIGGRAPH 2024]β1,489Updated 11 months ago
- InstantStyle: Free Lunch towards Style-Preserving in Text-to-Image Generation π₯β2,005Updated last year
- VideoSys: An easy and efficient system for video generationβ2,016Updated 5 months ago
- MusePose: a Pose-Driven Image-to-Video Framework for Virtual Human Generationβ2,649Updated 10 months ago
- Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion modelsβ3,151Updated last year
- PixArt-Ξ£: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generationβ1,892Updated last year
- [ECCV 2024] OMG: Occlusion-friendly Personalized Multi-concept Generation In Diffusion Modelsβ701Updated last year
- [CVPR 2024] FRESCO: Spatial-Temporal Correspondence for Zero-Shot Video Translationβ783Updated last year
- SEED-Story: Multimodal Long Story Generation with Large Language Modelβ881Updated last year
- [ICML 2024] MagicPose(also known as MagicDance): Realistic Human Poses and Facial Expressions Retargeting with Identity-aware Diffusionβ772Updated last year
- Official implementations for paper: Zero-shot Image Editing with Reference Imitationβ1,304Updated last year
- Code and data for "AnyV2V: A Tuning-Free Framework For Any Video-to-Video Editing Tasks" [TMLR 2024]β643Updated last year
- Official Pytorch implementation of StreamV2V.β531Updated 3 weeks ago
- MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoisingβ2,815Updated last year
- Character Animation (AnimateAnyone, Face Reenactment)β3,479Updated last year
- Official repository of In-Context LoRA for Diffusion Transformersβ2,049Updated last year
- [ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (RPG)β1,843Updated 11 months ago
- Transparent Image Layer Diffusion using Latent Transparencyβ2,187Updated last year