bytedance / USOLinks
π₯π₯ Open-sourced unified customization model
β1,201Updated 5 months ago
Alternatives and similar repositories for USO
Users that are interested in USO are comparing it to the libraries listed below
Sorting:
- [ICLR 26 Oral] Stable Video Infinity: Infinite-Length Video Generation with Error Recyclingβ2,025Updated 3 weeks ago
- HuMo: Human-Centric Video Generation via Collaborative Multi-Modal Conditioningβ1,133Updated 2 weeks ago
- HunyuanImage-2.1: An Efficient Diffusion Model for High-Resolution (2K) Text-to-Image Generationββ672Updated 3 months ago
- Qwen-Image-Lightning: Speed up Qwen-Image model with distillationβ1,219Updated last month
- β2,053Updated last month
- Official inference repo for FLUX.2 modelsβ1,762Updated 3 weeks ago
- Official Implementations for Paper - HoloCine: Holistic Generation of Cinematic Multi-Shot Long Video Narrativesβ627Updated 2 months ago
- Industry-level video foundation model for unified Text-to-Video (T2V) and Image-to-Video (I2V) generation.β886Updated 5 months ago
- [SIGGRAPH Asia 25] Voost: A Unified and Scalable Diffusion Transformer for Bidirectional Virtual Try-On and Try-Offβ333Updated 3 months ago
- ObjectClear: Complete Object Removal via Object-Effect Attentionβ532Updated 2 months ago
- Official GitHub repository for FLUX.1 Krea [dev].β360Updated 6 months ago
- β787Updated 6 months ago
- ComfyUI node for highly expressive speech and realistic zero-shot voice cloningβ381Updated last month
- HunyuanImage-3.0: A Powerful Native Multimodal Model for Image Generationβ2,827Updated last week
- Implementation of "Live Avatar: Streaming Real-time Audio-Driven Avatar Generation with Infinite Length"β1,750Updated 2 weeks ago
- ComfyDeployedβ441Updated 4 months ago
- β1,592Updated 2 months ago
- [Preprint 2025] Ditto: Scaling Instruction-Based Video Editing with a High-Quality Synthetic Datasetβ566Updated 3 months ago
- MoCha: End-to-End Video Character Replacement without Structural Guidanceβ635Updated 3 weeks ago
- Stand-In is a lightweight, plug-and-play framework for identity-preserving video generation.β725Updated last month
- β716Updated 3 months ago
- [ICCV 2025] π₯π₯ UNO: A Universal Customization Method for Both Single and Multi-Subject Conditioningβ1,350Updated 5 months ago
- PersonaLive! : Expressive Portrait Image Animation for Live Streamingβ1,612Updated last month
- Pusa: Thousands Timesteps Video Diffusion Modelβ672Updated last week
- HunyuanCustom: A Multimodal-Driven Architecture for Customized Video Generationβ1,204Updated 3 months ago
- Lumina-Image 2.0: A Unified and Efficient Image Generative Frameworkβ859Updated 3 months ago
- F Lite is a 10B parameter diffusion model created by Freepik and Fal, trained exclusively on copyright-safe and SFW content.β427Updated 5 months ago
- Offical Implementation of SCAIL: Towards Studio-Grade Character Animation via In-Context Learning of 3D-Consistent Pose Representationsβ826Updated last month
- Official implementation for "DyPE: Dynamic Position Extrapolation for Ultra High Resolution Diffusion".β337Updated 2 months ago
- β1,046Updated 8 months ago