PKU-YuanGroup / ConsisID
[CVPR 2025π₯] Identity-Preserving Text-to-Video Generation by Frequency Decomposition
β609Updated this week
Alternatives and similar repositories for ConsisID:
Users that are interested in ConsisID are comparing it to the libraries listed below
- The official implementation of RealisDanceβ306Updated 3 months ago
- Region-Aware Text-to-Image Generation via Hard Binding and Soft Refinement π₯β545Updated last month
- [ICLR 2025] Official implementation of MotionClone: Training-Free Motion Cloning for Controllable Video Generationβ454Updated 2 months ago
- Codes for ID-Specific Video Customized Diffusionβ452Updated last year
- [ICLR 2025] Official implementation of MS-Diffusion: Multi-subject Zero-shot Image Personalization with Layout Guidanceβ219Updated 2 weeks ago
- NeurIPS 2024β354Updated 5 months ago
- All-round Creator and Editorβ197Updated last month
- [CVPR'25]Tora: Trajectory-oriented Diffusion Transformer for Video Generationβ1,077Updated this week
- [Arxiv 2024] From Parts to Whole: A Unified Reference Framework for Controllable Human Image Generationβ180Updated 7 months ago
- Official codes of VEnhancer: Generative Space-Time Enhancement for Video Generationβ508Updated 5 months ago
- Hallo3: Highly Dynamic and Realistic Portrait Image Animation with Diffusion Transformer Networksβ1,092Updated this week
- β789Updated 2 months ago
- Let's finetune video generation models!β410Updated last week
- πΉ A more flexible CogVideoX that can generate videos at any resolution and creates videos from images.β650Updated 2 months ago
- β146Updated 9 months ago
- β371Updated 8 months ago
- [ECCV 2024] Bridging Different Language Models and Generative Vision Models for Text-to-Image Generationβ290Updated 7 months ago
- Official repo of our paper "SDXS: Real-Time One-Step Latent Diffusion Models with Image Conditions"β607Updated 9 months ago
- Vchitect-2.0: Parallel Transformer for Scaling Up Video Diffusion Modelsβ888Updated last month
- Allegro is a powerful text-to-video model that generates high-quality videos up to 6 seconds at 15 FPS and 720p resolution from simple teβ¦β1,055Updated 3 weeks ago
- a family of versatile and state-of-the-art video tokenizers.β346Updated last month
- [ICLR'25] SynCamMaster: Synchronizing Multi-Camera Video Generation from Diverse Viewpointsβ496Updated 2 months ago
- [ICLR'25] MovieDreamer: Hierarchical Generation for Coherent Long Visual Sequencesβ274Updated 6 months ago
- β431Updated 3 months ago
- I2V-Adapter: A General Image-to-Video Adapter for Video Diffusion Modelsβ206Updated last year
- UniPortrait: A Unified Framework for Identity-Preserving Single- and Multi-Human Image Personalizationβ236Updated 4 months ago
- [ECCV 2024] MOFA-Video: Controllable Image Animation via Generative Motion Field Adaptions in Frozen Image-to-Video Diffusion Model.β721Updated 2 months ago
- Code and data for "AnyV2V: A Tuning-Free Framework For Any Video-to-Video Editing Tasks" (TMLR 2024)β552Updated 4 months ago
- [ICLR'25] 3DTrajMaster: Mastering 3D Trajectory for Multi-Entity Motion in Video Generationβ304Updated last week