Picsart-AI-Research / StreamingT2VLinks
[CVPR 2025] StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text
β1,627Updated 10 months ago
Alternatives and similar repositories for StreamingT2V
Users that are interested in StreamingT2V are comparing it to the libraries listed below
Sorting:
- πΊ An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusionβ2,249Updated 11 months ago
- [ECCV 2024, Oral] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priorsβ2,991Updated last year
- [AAAI 2025] Follow-Your-Click: This repo is the official implementation of "Follow-Your-Click: Open-domain Regional Image Animation via Sβ¦β911Updated 4 months ago
- High-Quality Human Motion Video Generation with Confidence-aware Pose Guidanceβ2,524Updated 2 months ago
- Lumina-T2X is a unified framework for Text to Any Modality Generationβ2,251Updated 11 months ago
- Controllable video and image Generation, SVD, Animate Anyone, ControlNet, ControlNeXt, LoRAβ1,634Updated last year
- [TMLR 2025] Latte: Latent Diffusion Transformer for Video Generation.β1,917Updated 3 months ago
- [ECCV 2024] MOFA-Video: Controllable Image Animation via Generative Motion Field Adaptions in Frozen Image-to-Video Diffusion Model.β761Updated last year
- Official Code for MotionCtrl [SIGGRAPH 2024]β1,489Updated 11 months ago
- Fine-Grained Open Domain Image Animation with Motion Guidanceβ961Updated last year
- [IJCV 2024] LaVie: High-Quality Video Generation with Cascaded Latent Diffusion Modelsβ946Updated last year
- [CVPR 2024] PIA, your Personalized Image Animator. Animate your images by text prompt, combing with Dreambooth, achieving stunning videosβ¦β978Updated last year
- PixArt-Ξ£: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generationβ1,897Updated last year
- [ICLR 2024] SEINE: Short-to-Long Video Diffusion Model for Generative Transition and Predictionβ957Updated last year
- MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoisingβ2,823Updated last year
- Official implementations for paper: Zero-shot Image Editing with Reference Imitationβ1,306Updated last year
- SEED-Story: Multimodal Long Story Generation with Large Language Modelβ883Updated last year
- MusePose: a Pose-Driven Image-to-Video Framework for Virtual Human Generationβ2,653Updated 11 months ago
- [ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (RPG)β1,843Updated last year
- InstantStyle: Free Lunch towards Style-Preserving in Text-to-Image Generation π₯β2,006Updated last year
- Official Pytorch implementation of StreamV2V.β532Updated last month
- [ECCV 2024] OMG: Occlusion-friendly Personalized Multi-concept Generation In Diffusion Modelsβ701Updated last year
- VideoSys: An easy and efficient system for video generationβ2,015Updated 5 months ago
- PixArt-Ξ±: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesisβ3,275Updated last year
- πΉ A more flexible framework that can generate videos at any resolution and creates videos from images.β1,891Updated this week
- Official repository of In-Context LoRA for Diffusion Transformersβ2,057Updated last year
- [ICLR 2025] Pyramidal Flow Matching for Efficient Video Generative Modelingβ3,154Updated last year
- [CVPR 2024] FRESCO: Spatial-Temporal Correspondence for Zero-Shot Video Translationβ783Updated last year
- Official implementation of "MIMO: Controllable Character Video Synthesis with Spatial Decomposed Modeling"β1,573Updated 7 months ago
- [ACM MM 2024] This is the official code for "AniTalker: Animate Vivid and Diverse Talking Faces through Identity-Decoupled Facial Motion β¦β1,605Updated last year