bytedance / UMOLinks
🔥🔥 Official Repo of UMO: Scaling Multi-Identity Consistency for Image Customization via Matching Reward
☆176Updated 3 months ago
Alternatives and similar repositories for UMO
Users that are interested in UMO are comparing it to the libraries listed below
Sorting:
- ☆96Updated last month
- Code for CineScale, higher-resolution video generation based on Wan☆179Updated 4 months ago
- [Arxiv'25] IC-Custom: Diverse Image Customization via In-Context Learning☆158Updated 3 months ago
- ☆112Updated 8 months ago
- ☆91Updated 5 months ago
- https://little-misfit.github.io/GRAG-Image-Editing/☆114Updated 3 weeks ago
- ☆77Updated last month
- ☆79Updated 9 months ago
- Official implementation of MAGREF: Masked Guidance for Any-Reference Video Generation with Subject Disentanglement☆281Updated 2 months ago
- Official implementation of "Normalized Attention Guidance"☆175Updated 5 months ago
- Calligrapher: Freestyle Text Image Customization☆295Updated 3 months ago
- [ICCV 2025] Code & Data for: SuperEdit - Rectifying and Facilitating Supervision for Instruction-Based Image Editing☆164Updated 5 months ago
- MoviiGen 1.1: Towards Cinematic-Quality Video Generative Models☆181Updated 5 months ago
- [ICCV 2025] Enhancing spatial understanding in text-to-Image diffusion models☆90Updated 3 months ago
- Official code for VMix: Improving Text-to-Image Diffusion Model with Cross-Attention Mixing Control☆188Updated 11 months ago
- UltraFlux: Data-Model Co-Design for High-quality Native 4K Text-to-Image Generation across Diverse Aspect Ratios☆85Updated last week
- Official implementation of CVPR 2025 paper "ID-Patch: Robust ID Association for Group Photo Personalization". This work proposed propose …☆73Updated 7 months ago
- OmniInsert: Mask-Free Video Insertion of Any Reference via Diffusion Transformer Models☆144Updated 3 months ago
- ☆316Updated 3 months ago
- Official PyTorch Implementation of "Optimal Stepsize for Diffusion Sampling".☆194Updated 8 months ago
- RepText: Rendering Visual Text via Replicating 🔥☆141Updated 6 months ago
- FantasyID: Face Knowledge Enhanced ID-Preserving Video Generation☆77Updated 4 months ago
- HunyuanVideo Keyframe Control Lora is an adapter for HunyuanVideo T2V model for keyframe-based video generation☆167Updated 9 months ago
- Official Repository of "OmniTry: Virtual Try-On Anything without Masks"☆233Updated 3 months ago
- CogVideoX-LoRAs is a centralized repository for all LoRA models created for CogVideoX, filling the gap for a unified sharing space. With …☆81Updated last year
- Official code for our ICCV2025 paper "SDMatte: Grafting Diffusion Models for Interactive Matting"☆165Updated 4 months ago
- Wan2.2-Lightning: Speed up wan2.2 model with distillation☆245Updated last month
- ChronoEdit: Towards Temporal Reasoning for Image Editing and World Simulation☆624Updated last month
- [Preprint 2025] Ditto: Scaling Instruction-Based Video Editing with a High-Quality Synthetic Dataset☆539Updated last month
- The official code implementation of the paper "OmniConsistency: Learning Style-Agnostic Consistency from Paired Stylization Data."☆417Updated 6 months ago