PKU-YuanGroup / MagicTimeLinks
[TPAMI 2025π₯] MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators
β1,342Updated 6 months ago
Alternatives and similar repositories for MagicTime
Users that are interested in MagicTime are comparing it to the libraries listed below
Sorting:
- [ECCV 2024] The official implementation of paper "BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion"β1,705Updated last year
- Customized ID Consistent for humanβ1,020Updated last month
- [ECCV 2024] Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidanceβ4,245Updated last year
- Unofficial Implementation of Animate Anyoneβ2,933Updated last year
- Code for SCIS-2025 Paper "UniAnimate: Taming Unified Video Diο¬usion Models for Consistent Human Image Animation".β1,180Updated 9 months ago
- [ECCV 2024 Oral] MotionDirector: Motion Customization of Text-to-Video Diffusion Models.β1,039Updated last year
- [CVPR'25]Tora: Trajectory-oriented Diffusion Transformer for Video Generationβ1,228Updated 6 months ago
- β900Updated last year
- [ICLR 2025] Hallo2: Long-Duration and High-Resolution Audio-driven Portrait Image Animationβ3,671Updated 11 months ago
- Vchitect-2.0: Parallel Transformer for Scaling Up Video Diffusion Modelsβ913Updated 10 months ago
- [CVPR 2025] Hallo3: Highly Dynamic and Realistic Portrait Image Animation with Video Diffusion Transformerβ1,361Updated 10 months ago
- [IJCV] Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video Generationβ1,134Updated 4 months ago
- Allegro is a powerful text-to-video model that generates high-quality videos up to 6 seconds at 15 FPS and 720p resolution from simple teβ¦β1,112Updated 11 months ago
- SDXS: Real-Time One-Step Latent Diffusion Models with Image Conditionsβ653Updated last year
- [ICCV 2025] Region-Aware Text-to-Image Generation via Hard Binding and Soft Refinement π₯β620Updated last month
- [CVPR 2025 Highlightπ₯] Identity-Preserving Text-to-Video Generation by Frequency Decompositionβ803Updated 5 months ago
- [ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (RPG)β1,843Updated last year
- [NeurIPS 2024] An official implementation of "ShareGPT4Video: Improving Video Understanding and Generation with Better Captions"β1,084Updated last year
- Video generation from text&image, 1st-genβ919Updated 8 months ago
- Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion modelsβ3,151Updated last year
- The official implementation of RealisDanceβ608Updated 7 months ago
- [SIGGRAPH ASIA 2024 TCS] AnimateLCM: Computation-Efficient Personalized Style Video Generation without Personalized Video Dataβ659Updated last year
- β726Updated last year
- Official Pytorch implementation of StreamV2V.β531Updated last month
- Memory-Guided Diffusion for Expressive Talking Video Generationβ1,076Updated 5 months ago
- [CVPR 2025] StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Textβ1,625Updated 10 months ago
- Lumina-T2X is a unified framework for Text to Any Modality Generationβ2,248Updated 11 months ago
- ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignmentβ1,274Updated last year
- Fine-Grained Open Domain Image Animation with Motion Guidanceβ960Updated last year
- InstantStyle: Free Lunch towards Style-Preserving in Text-to-Image Generation π₯β2,004Updated last year