bytedance / USOLinks
🔥🔥 Open-sourced unified customization model
☆236Updated this week
Alternatives and similar repositories for USO
Users that are interested in USO are comparing it to the libraries listed below
Sorting:
- Official GitHub repository for FLUX.1 Krea [dev].☆334Updated last month
- Qwen-Image-Lightning: Speed up Qwen-Image model with distillation☆537Updated this week
- F Lite is a 10B parameter diffusion model created by Freepik and Fal, trained exclusively on copyright-safe and SFW content.☆405Updated last week
- [Official] Voost: A Unified and Scalable Diffusion Transformer for Bidirectional Virtual Try-On and Try-Off☆302Updated 2 weeks ago
- In-context subject-driven image generation while preserving foreground fidelity☆348Updated 2 months ago
- ObjectClear: Complete Object Removal via Object-Effect Attention☆436Updated 2 weeks ago
- ☆768Updated last month
- Calligrapher: Freestyle Text Image Customization☆280Updated last month
- Pusa: Thousands Timesteps Video Diffusion Model☆597Updated last week
- An inference and training framework for multiple image input in Flux Kontext dev☆371Updated last week
- ComfyUI custom node for the VibeVoice TTS. Expressive, long-form, multi-speaker conversational audio☆59Updated this week
- HunyuanVideo-Foley: Multimodal Diffusion with Representation Alignment for High-Fidelity Foley Audio Generation.☆185Updated this week
- Stand-In is a lightweight, plug-and-play framework for identity-preserving video generation.☆558Updated this week
- The official code implementation of the paper "OmniConsistency: Learning Style-Agnostic Consistency from Paired Stylization Data."☆396Updated 2 months ago
- ☆1,025Updated 3 months ago
- Lumina-Image 2.0: A Unified and Efficient Image Generative Framework☆782Updated 2 months ago
- ICCV 2025 ACTalker: an end-to-end video diffusion framework for talking head synthesis that supports both single and multi-signal control…☆381Updated 2 weeks ago
- Official repository for "Chain-of-Zoom: Extreme Super-Resolution via Scale Autoregression and Preference Alignment"☆720Updated last week
- Official code for AccVideo: Accelerating Video Diffusion Model with Synthetic Dataset☆257Updated 2 months ago
- Unlock Pose Diversity: Accurate and Efficient Implicit Keypoint-based Spatiotemporal Diffusion for Audio-driven Talking Portrait☆273Updated last month
- Official implementation for "Story2Board: A Training‑Free Approach for Expressive Storyboard Generation"☆145Updated last week
- Custom ComfyUI nodes for our community☆113Updated 2 months ago
- ☆752Updated 6 months ago
- FantasyPortrait: Enhancing Multi-Character Portrait Animation with Expression-Augmented Diffusion Transformers☆415Updated 2 weeks ago
- We achieves high-quality first-frame guided video editing given a reference image, while maintaining flexibility for incorporating additi…☆300Updated 2 weeks ago
- Streamlining Cartoon Production with Generative Post-Keyframing☆384Updated 2 weeks ago
- 4Bit Quantized Model for HiDream I1☆245Updated 3 months ago
- Community trainer for Lightricks' LTX Video model 🎬 ⚡️☆303Updated last month
- [SIGGRAPH 2025] Official code of the paper "Cobra: Efficient Line Art COlorization with BRoAder References". Cobra:利用更广泛参考图实现高效线稿上色☆209Updated 4 months ago
- [ICCV 2025] Official pytorch implementation of "FramePainter: Endowing Interactive Image Editing with Video Diffusion Priors"☆394Updated 5 months ago