Phantom-video / PhantomLinks
Phantom: Subject-Consistent Video Generation via Cross-Modal Alignment
β1,328Updated last month
Alternatives and similar repositories for Phantom
Users that are interested in Phantom are comparing it to the libraries listed below
Sorting:
- [ICCV 2025] π₯π₯ UNO: A Universal Customization Method for Both Single and Multi-Subject Conditioningβ1,184Updated 3 months ago
- β1,014Updated 2 months ago
- HunyuanCustom: A Multimodal-Driven Architecture for Customized Video Generationβ1,137Updated last month
- [ACM MM 2025] FantasyTalking: Realistic Talking Portrait Generation via Coherent Motion Synthesisβ1,466Updated last week
- β750Updated 5 months ago
- πΉ A more flexible framework that can generate videos at any resolution and creates videos from images.β1,233Updated this week
- SkyReels-A1: Expressive Portrait Animation in Video Diffusion Transformersβ554Updated last month
- β717Updated 2 weeks ago
- β1,741Updated last month
- HunyuanVideo-I2V: A Customizable Image-to-Video Model based on HunyuanVideoβ1,596Updated 2 months ago
- Lumina-Image 2.0: A Unified and Efficient Image Generative Frameworkβ755Updated last month
- Official implementations for paper: VACE: All-in-One Video Creation and Editingβ3,011Updated 2 months ago
- [ICCV 2025 Highlight] OminiControl: Minimal and Universal Control for Diffusion Transformerβ1,707Updated last month
- A pipeline parallel training script for diffusion models.β1,310Updated 2 weeks ago
- [ICCV'25 Oral] ReCamMaster: Camera-Controlled Generative Rendering from A Single Videoβ1,348Updated last week
- The official implementation of CVPR'25 Oral paper "Go-with-the-Flow: Motion-Controllable Video Diffusion Models Using Real-Time Warped Noβ¦β986Updated last month
- Wan: Open and Advanced Large-Scale Video Generative Modelsβ1,645Updated this week
- Image editing is worth a single LoRA! 0.1% training data for fantastic image editing! Training released! Surpasses GPT-4o in ID persistenβ¦β1,856Updated 2 months ago
- Let Them Talk: Audio-Driven Multi-Person Conversational Video Generationβ1,904Updated 3 weeks ago
- Implementation of [CVPR 2025] "DiffSensei: Bridging Multi-Modal LLMs and Diffusion Models for Customized Manga Generation"β834Updated 5 months ago
- Enhance-A-Video: Better Generated Video for Freeβ561Updated 4 months ago
- SkyReels-A2: Compose anything in video diffusion transformersβ638Updated 2 months ago
- A SOTA open-source image editing model, which aims to provide comparable performance against the closed-source models like GPT-4o and Gemβ¦β1,549Updated this week
- β1,246Updated 3 months ago
- β747Updated 8 months ago
- Official repository of In-Context LoRA for Diffusion Transformersβ1,980Updated 7 months ago
- β520Updated 6 months ago
- β816Updated this week
- [ECCV 2024] MOFA-Video: Controllable Image Animation via Generative Motion Field Adaptions in Frozen Image-to-Video Diffusion Model.β751Updated 7 months ago
- Official implementation of "FitDiT: Advancing the Authentic Garment Details for High-fidelity Virtual Try-on"β575Updated 5 months ago