bytedance / UMOLinks
🔥🔥 Official Repo of UMO: Scaling Multi-Identity Consistency for Image Customization via Matching Reward
☆176Updated 3 months ago
Alternatives and similar repositories for UMO
Users that are interested in UMO are comparing it to the libraries listed below
Sorting:
- ☆97Updated 2 months ago
- UltraFlux: Data-Model Co-Design for High-quality Native 4K Text-to-Image Generation across Diverse Aspect Ratios☆103Updated 3 weeks ago
- Code for CineScale, higher-resolution video generation based on Wan☆182Updated 4 months ago
- ☆79Updated 10 months ago
- [Arxiv'25] IC-Custom: Diverse Image Customization via In-Context Learning☆158Updated 3 months ago
- ☆113Updated 8 months ago
- ☆78Updated last month
- ☆91Updated 6 months ago
- Official implementation of "Normalized Attention Guidance"☆175Updated 6 months ago
- OmniInsert: Mask-Free Video Insertion of Any Reference via Diffusion Transformer Models☆148Updated 3 months ago
- [ICCV 2025] Enhancing spatial understanding in text-to-Image diffusion models☆89Updated 4 months ago
- https://little-misfit.github.io/GRAG-Image-Editing/☆116Updated last month
- Official code for VMix: Improving Text-to-Image Diffusion Model with Cross-Attention Mixing Control☆189Updated last year
- Official implementation of MAGREF: Masked Guidance for Any-Reference Video Generation with Subject Disentanglement☆281Updated 3 months ago
- RepText: Rendering Visual Text via Replicating 🔥☆141Updated 7 months ago
- HunyuanVideo Keyframe Control Lora is an adapter for HunyuanVideo T2V model for keyframe-based video generation☆167Updated 9 months ago
- [ICCV 2025] Code & Data for: SuperEdit - Rectifying and Facilitating Supervision for Instruction-Based Image Editing☆165Updated 6 months ago
- MoviiGen 1.1: Towards Cinematic-Quality Video Generative Models☆181Updated 5 months ago
- Official PyTorch Implementation of "Optimal Stepsize for Diffusion Sampling".☆193Updated 9 months ago
- FantasyID: Face Knowledge Enhanced ID-Preserving Video Generation☆77Updated 4 months ago
- Calligrapher: Freestyle Text Image Customization☆294Updated 4 months ago
- [ICCV 2025] LayerAnimate: Layer-specific Control for Animation☆192Updated 4 months ago
- CogVideoX-LoRAs is a centralized repository for all LoRA models created for CogVideoX, filling the gap for a unified sharing space. With …☆81Updated last year
- Official Repository of "OmniTry: Virtual Try-On Anything without Masks"☆237Updated 4 months ago
- ☆321Updated 3 months ago
- Official code for our ICCV2025 paper "SDMatte: Grafting Diffusion Models for Interactive Matting"☆168Updated 4 months ago
- Official implementation of CVPR 2025 paper "ID-Patch: Robust ID Association for Group Photo Personalization". This work proposed propose …☆75Updated 8 months ago
- Mobius: Text to Seamless Looping Video Generation via Latent Shift☆169Updated 8 months ago
- ☆106Updated 4 months ago
- Official Implementation of DRA-Ctrl (Dimension-Reduction Attack! Video Generative Models are Experts on Controllable Image Synthesis)☆118Updated 4 months ago