bytedance / UMOLinks
π₯π₯ Official Repo of UMO: Scaling Multi-Identity Consistency for Image Customization via Matching Reward
β144Updated 2 weeks ago
Alternatives and similar repositories for UMO
Users that are interested in UMO are comparing it to the libraries listed below
Sorting:
- β93Updated 3 months ago
- β79Updated 7 months ago
- β87Updated 2 months ago
- [Arxiv'25] IC-Custom: Diverse Image Customization via In-Context Learningβ141Updated 2 weeks ago
- Code for CineScale, higher-resolution video generation based on Wanβ158Updated last month
- [ICCV 2025] Code & Data for: SuperEdit - Rectifying and Facilitating Supervision for Instruction-Based Image Editingβ157Updated 3 months ago
- β110Updated 5 months ago
- Official code for VMix: Improving Text-to-Image Diffusion Model with Cross-Attention Mixing Controlβ188Updated 9 months ago
- [ICCV 2025] Enhancing spatial understanding in text-to-Image diffusion modelsβ88Updated 3 weeks ago
- [CVPR 2025] Official Implementation of MotionPro: A Precise Motion Controller for Image-to-Video Generationβ130Updated last month
- OmniInsert: Mask-Free Video Insertion of Any Reference via Diffusion Transformer Modelsβ105Updated last week
- [ICML 2025] Official PyTorch implementation of paper "Ultra-Resolution Adaptation with Ease".β106Updated 5 months ago
- MoviiGen 1.1: Towards Cinematic-Quality Video Generative Modelsβ177Updated 2 months ago
- [ICCV 2025] LayerAnimate: Layer-specific Control for Animationβ188Updated last month
- RepText: Rendering Visual Text via Replicating π₯β138Updated 3 months ago
- β103Updated last month
- FantasyID: Face Knowledge Enhanced ID-Preserving Video Generationβ72Updated last month
- Unofficial extension implementation of CausVidβ58Updated 5 months ago
- Pytorch implementation of Towards Consistent and Controllable Image Synthesis for Face Editingβ62Updated 5 months ago
- CogVideoX-LoRAs is a centralized repository for all LoRA models created for CogVideoX, filling the gap for a unified sharing space. With β¦β81Updated 10 months ago
- HunyuanVideo Keyframe Control Lora is an adapter for HunyuanVideo T2V model for keyframe-based video generationβ163Updated 6 months ago
- Official implementation of CVPR 2025 paper "ID-Patch: Robust ID Association for Group Photo Personalization". This work proposed propose β¦β70Updated 5 months ago
- Ofiicial GoodDrag implementation.β96Updated last week
- Official code of "Edit Transfer: Learning Image Editing via Vision In-Context Relations"β84Updated 3 months ago
- ComfyMind: Toward General-Purpose Generation via Tree-Based Planning and Reactive Feedbackβ109Updated 2 weeks ago
- Keyframe Interpolation with CogvideoXβ137Updated 11 months ago
- The official implementation of βRepVideo: Rethinking Cross-Layer Representation for Video Generationββ119Updated 8 months ago
- Calligrapher: Freestyle Text Image Customizationβ291Updated last month
- [AAAI 2025] Follow-Your-Canvas: This repo is the official implementation of "Follow-Your-Canvas: Higher-Resolution Video Outpainting withβ¦β147Updated last month
- Official implementation of "Normalized Attention Guidance"β166Updated 3 months ago