bytedance / UMOLinks
š„š„ Official Repo of UMO: Scaling Multi-Identity Consistency for Image Customization via Matching Reward
ā179Updated 4 months ago
Alternatives and similar repositories for UMO
Users that are interested in UMO are comparing it to the libraries listed below
Sorting:
- UltraFlux: Data-Model Co-Design for High-quality Native 4K Text-to-Image Generation across Diverse Aspect Ratiosā110Updated last month
- ā99Updated 2 months ago
- Code for CineScale, higher-resolution video generation based on Wanā183Updated 5 months ago
- [Arxiv'25] IC-Custom: Diverse Image Customization via In-Context Learningā158Updated 4 months ago
- Official code for VMix: Improving Text-to-Image Diffusion Model with Cross-Attention Mixing Controlā190Updated last year
- https://little-misfit.github.io/GRAG-Image-Editing/ā116Updated 2 months ago
- [ICCV 2025] Enhancing spatial understanding in text-to-Image diffusion modelsā89Updated 4 months ago
- ā79Updated 11 months ago
- ā91Updated 6 months ago
- Calligrapher: Freestyle Text Image Customizationā295Updated 5 months ago
- Official implementation of "Normalized Attention Guidance"ā178Updated 7 months ago
- Official implementation of MAGREF: Masked Guidance for Any-Reference Video Generation with Subject Disentanglementā285Updated 3 weeks ago
- ā113Updated 9 months ago
- OmniInsert: Mask-Free Video Insertion of Any Reference via Diffusion Transformer Modelsā153Updated 4 months ago
- FantasyID: Face Knowledge Enhanced ID-Preserving Video Generationā78Updated 5 months ago
- ā326Updated 4 months ago
- ā83Updated 2 months ago
- OmniTransfer: All-in-one Framework for Spatio-temporal Video Transferā199Updated last week
- [ICCV 2025] Code & Data for: SuperEdit - Rectifying and Facilitating Supervision for Instruction-Based Image Editingā165Updated 7 months ago
- Official implementation of CVPR 2025 paper "ID-Patch: Robust ID Association for Group Photo Personalization". This work proposed propose ā¦ā75Updated 9 months ago
- RepText: Rendering Visual Text via Replicating š„ā141Updated 7 months ago
- HunyuanVideo Keyframe Control Lora is an adapter for HunyuanVideo T2V model for keyframe-based video generationā168Updated 10 months ago
- MoviiGen 1.1: Towards Cinematic-Quality Video Generative Modelsā184Updated 6 months ago
- Official implementation for "DyPE: Dynamic Position Extrapolation for Ultra High Resolution Diffusion".ā336Updated 2 months ago
- Official PyTorch Implementation of "Optimal Stepsize for Diffusion Sampling".ā195Updated 9 months ago
- Official Repository of "OmniTry: Virtual Try-On Anything without Masks"ā243Updated 5 months ago
- Unofficial extension implementation of CausVidā73Updated 9 months ago
- Wan2.2-Lightning: Speed up wan2.2 model with distillationā266Updated 2 months ago
- Any-to-Bokeh is a novel one-step video bokeh framework that converts arbitrary input videos into temporally coherent, depth-aware bokeh eā¦ā122Updated 6 months ago
- [ICML 2025] Official PyTorch implementation of paper "Ultra-Resolution Adaptation with Ease".ā116Updated 9 months ago