Phantom-video / PhantomLinks

Phantom: Subject-Consistent Video Generation via Cross-Modal Alignment

☆1,458

Alternatives and similar repositories for Phantom

Users that are interested in Phantom are comparing it to the libraries listed below

Sorting:

bytedance / UNO
[ICCV 2025] 🔥🔥 UNO: A Universal Customization Method for Both Single and Multi-Subject Conditioning
☆1,335Updated 2 months ago
Tencent-Hunyuan / InstantCharacter
☆1,044Updated 6 months ago
aigc-apps / VideoX-Fun
📹 A more flexible framework that can generate videos at any resolution and creates videos from images.
☆1,557Updated this week
Tencent-Hunyuan / HunyuanCustom
HunyuanCustom: A Multimodal-Driven Architecture for Customized Video Generation
☆1,194Updated last month
Fantasy-AMAP / fantasy-talking
[ACM MM 2025] FantasyTalking: Realistic Talking Portrait Generation via Coherent Motion Synthesis
☆1,595Updated 3 months ago
SkyworkAI / SkyReels-A1
SkyReels-A1: Expressive Portrait Animation in Video Diffusion Transformers
☆572Updated 5 months ago
Open-Magic-Video / Magic-1-For-1
☆755Updated 9 months ago
HiDream-ai / HiDream-E1
☆779Updated 4 months ago
Tencent-Hunyuan / HunyuanVideo-Avatar
☆1,946Updated last month
Tencent-Hunyuan / HunyuanVideo-I2V
HunyuanVideo-I2V: A Customizable Image-to-Video Model based on HunyuanVideo
☆1,740Updated 6 months ago
WeChatCV / Stand-In
Stand-In is a lightweight, plug-and-play framework for identity-preserving video generation.
☆675Updated 3 months ago
River-Zhang / ICEdit
[NeurIPS 2025] Image editing is worth a single LoRA! 0.1% training data for fantastic image editing! Surpasses GPT-4o in ID persistence~ …
☆2,037Updated 3 weeks ago
Phantom-video / HuMo
HuMo: Human-Centric Video Generation via Collaborative Multi-Modal Conditioning
☆928Updated last month
Alpha-VLLM / Lumina-Image-2.0
Lumina-Image 2.0: A Unified and Efficient Image Generative Framework
☆830Updated last month
Yuanshi9815 / OminiControl
[ICCV 2025 Highlight] OminiControl: Minimal and Universal Control for Diffusion Transformer
☆1,840Updated 5 months ago
ali-vilab / VACE
[ICCV 2025] Official implementations for paper: VACE: All-in-One Video Creation and Editing
☆3,454Updated last month
lllyasviel / LuminaBrush
Illumination Drawing Tools for Text-to-Image Diffusion Models
☆776Updated 6 months ago
ali-vilab / ACE_plus
☆1,329Updated 7 months ago
tdrussell / diffusion-pipe
A pipeline parallel training script for diffusion models.
☆1,756Updated this week
Eyeline-Labs / Go-with-the-Flow
The official implementation of CVPR'25 Oral paper "Go-with-the-Flow: Motion-Controllable Video Diffusion Models Using Real-Time Warped No…
☆1,045Updated last month
kohya-ss / musubi-tuner
☆1,484Updated this week
stepfun-ai / Step1X-Edit
A SOTA open-source image editing model, which aims to provide comparable performance against the closed-source models like GPT-4o and Gem…
☆1,746Updated this week
SkyworkAI / SkyReels-A2
SkyReels-A2: Compose anything in video diffusion transformers
☆690Updated 6 months ago
ali-vilab / In-Context-LoRA
Official repository of In-Context LoRA for Diffusion Transformers
☆2,034Updated 11 months ago
NUS-HPC-AI-Lab / Enhance-A-Video
Enhance-A-Video: Better Generated Video for Free
☆584Updated 8 months ago
IamCreateAI / Ruyi-Models
☆522Updated 10 months ago
cangcz / AnchorCrafter
☆641Updated 2 weeks ago
KlingTeam / ReCamMaster
[ICCV'25 Best Paper Finalist] ReCamMaster: Camera-Controlled Generative Rendering from A Single Video
☆1,639Updated this week
jianzongwu / DiffSensei
Implementation of [CVPR 2025] "DiffSensei: Bridging Multi-Modal LLMs and Diffusion Models for Customized Manga Generation"
☆882Updated 9 months ago
Yaofang-Liu / Pusa-VidGen
Pusa: Thousands Timesteps Video Diffusion Model
☆665Updated 2 months ago