Phantom-video / Phantom
Phantom: Subject-Consistent Video Generation via Cross-Modal Alignment
☆820Updated 2 weeks ago
Alternatives and similar repositories for Phantom:
Users that are interested in Phantom are comparing it to the libraries listed below
- 🔥🔥 UNO: A Universal Customization Method for Both Single and Multi-Subject Conditioning☆967Updated 3 weeks ago
- ☆812Updated 2 weeks ago
- SkyReels-A1: Expressive Portrait Animation in Video Diffusion Transformers☆492Updated last week
- Official implementations for paper: VACE: All-in-One Video Creation and Editing☆1,461Updated 2 weeks ago
- FantasyTalking: Realistic Talking Portrait Generation via Coherent Motion Synthesis☆934Updated last week
- 📹 A more flexible framework that can generate videos at any resolution and creates videos from images.☆962Updated last week
- ☆740Updated 2 months ago
- Motion-Controllable Video Diffusion via Warped Noise☆879Updated last month
- ☆1,051Updated 2 weeks ago
- [Official] Image editing is worth a single LoRA! 0.1% training data and 1% training parameters for fantastic image editing! Surpasses GPT…☆633Updated this week
- Lumina-Image 2.0: A Unified and Efficient Image Generative Framework☆683Updated 2 weeks ago
- ☆416Updated last week
- A SOTA open-source image editing model, which aims to provide comparable performance against the closed-source models like GPT-4o and Gem…☆1,111Updated this week
- Enhance-A-Video: Better Generated Video for Free☆519Updated last month
- A pipeline parallel training script for diffusion models.☆972Updated this week
- ☆520Updated 3 months ago
- Illumination Drawing Tools for Text-to-Image Diffusion Models☆733Updated 4 months ago
- Official repo for CFG-Zero*☆530Updated this week
- HunyuanVideo-I2V: A Customizable Image-to-Video Model based on HunyuanVideo☆1,380Updated 3 weeks ago
- [ICLR 2025] Animate-X: Universal Character Image Animation with Enhanced Motion Representation☆285Updated 2 months ago
- Timestep Embedding Tells: It's Time to Cache for Video Diffusion Model☆751Updated 2 weeks ago
- ACTalker: an end-to-end video diffusion framework for talking head synthesis that supports both single and multi-signal control (e.g., au…☆246Updated 2 weeks ago
- Source code for the SIGGRAPH 2024 paper "X-Portrait: Expressive Portrait Animation with Hierarchical Motion Attention"☆505Updated 9 months ago
- Official implementation of "FitDiT: Advancing the Authentic Garment Details for High-fidelity Virtual Try-on"☆528Updated 3 months ago
- ☆405Updated 6 months ago
- Official repository of In-Context LoRA for Diffusion Transformers☆1,838Updated 4 months ago
- ☆612Updated this week
- Memory-Guided Diffusion for Expressive Talking Video Generation☆813Updated 3 months ago
- SkyReels-A2: Compose anything in video diffusion transformers☆482Updated 2 weeks ago
- [ICLR2025] DisPose: Disentangling Pose Guidance for Controllable Human Image Animation☆366Updated 3 months ago