bytedance / DreamOLinks
[SIGGRAPH Asia 2025] DreamO: A Unified Framework for Image Customization
☆1,733Updated 4 months ago
Alternatives and similar repositories for DreamO
Users that are interested in DreamO are comparing it to the libraries listed below
Sorting:
- Official Repo For "BindWeave: Subject-Consistent Video Generation via Cross-Modal Integration"☆354Updated 3 weeks ago
- [NeurIPS' 2025] JarvisArt: Liberating Human Artistic Creativity via an Intelligent Photo Retouching Agent☆725Updated last week
- PixelHacker: Image Inpainting with Structural and Semantic Consistency☆464Updated 6 months ago
- [ICCV 2025] 🔥🔥 UNO: A Universal Customization Method for Both Single and Multi-Subject Conditioning☆1,338Updated 3 months ago
- SkyReels-A2: Compose anything in video diffusion transformers☆691Updated 6 months ago
- ☆1,044Updated 7 months ago
- Phantom: Subject-Consistent Video Generation via Cross-Modal Alignment☆1,462Updated 3 months ago
- [NeurIPS 2025] Image editing is worth a single LoRA! 0.1% training data for fantastic image editing! Surpasses GPT-4o in ID persistence~ …☆2,046Updated last month
- [ACM MM 2025] FantasyTalking: Realistic Talking Portrait Generation via Coherent Motion Synthesis☆1,601Updated 3 months ago
- HunyuanCustom: A Multimodal-Driven Architecture for Customized Video Generation☆1,196Updated 2 months ago
- We present StableAvatar, the first end-to-end video diffusion transformer, which synthesizes infinite-length high-quality audio-driven av…☆1,154Updated last week
- ☆779Updated 5 months ago
- RealGen: Photorealistic Text-to-Image Generation via Detector-Guided Rewards.☆270Updated last week
- [ICCV 2025 Highlight] OminiControl: Minimal and Universal Control for Diffusion Transformer☆1,862Updated 5 months ago
- Stand-In is a lightweight, plug-and-play framework for identity-preserving video generation.☆689Updated 3 months ago
- ☆753Updated 10 months ago
- HuMo: Human-Centric Video Generation via Collaborative Multi-Modal Conditioning☆1,020Updated 2 months ago
- HunyuanVideo-Foley: Multimodal Diffusion with Representation Alignment for High-Fidelity Foley Audio Generation.☆1,279Updated 2 months ago
- A SOTA open-source image editing model, which aims to provide comparable performance against the closed-source models like GPT-4o and Gem…☆1,967Updated 2 weeks ago
- HunyuanVideo-I2V: A Customizable Image-to-Video Model based on HunyuanVideo☆1,748Updated 6 months ago
- ☆1,338Updated 7 months ago
- Directly Aligning the Full Diffusion Trajectory with Fine-Grained Human Preference☆1,222Updated last month
- ☆1,960Updated 2 months ago
- HunyuanImage-3.0: A Powerful Native Multimodal Model for Image Generation☆2,584Updated last month
- [ICCV 2025] Official implementations for paper: VACE: All-in-One Video Creation and Editing☆3,498Updated 2 months ago
- ☆1,547Updated this week
- Lumina-Image 2.0: A Unified and Efficient Image Generative Framework☆839Updated last month
- FantasyPortrait: Enhancing Multi-Character Portrait Animation with Expression-Augmented Diffusion Transformers☆490Updated 3 months ago
- The official implementation of CVPR'25 Oral paper "Go-with-the-Flow: Motion-Controllable Video Diffusion Models Using Real-Time Warped No…☆1,052Updated 2 months ago
- A pipeline parallel training script for diffusion models.☆1,769Updated 2 weeks ago