zai-org / KaleidoLinks
Kaleido: Open-sourced multi-subject reference video generation model, enabling controllable, high-fidelity video synthesis from multiple image references.
☆105Updated 3 weeks ago
Alternatives and similar repositories for Kaleido
Users that are interested in Kaleido are comparing it to the libraries listed below
Sorting:
- [Arxiv'25] IC-Custom: Diverse Image Customization via In-Context Learning☆158Updated 4 months ago
- FantasyID: Face Knowledge Enhanced ID-Preserving Video Generation☆77Updated 4 months ago
- [AAAI 2026] Minute-Long Videos with Dual Parallelisms☆43Updated 2 months ago
- [ICCV 2025] Code & Data for: SuperEdit - Rectifying and Facilitating Supervision for Instruction-Based Image Editing☆165Updated 6 months ago
- ComfyMind: Toward General-Purpose Generation via Tree-Based Planning and Reactive Feedback☆121Updated 3 months ago
- UltraFlux: Data-Model Co-Design for High-quality Native 4K Text-to-Image Generation across Diverse Aspect Ratios☆103Updated 3 weeks ago
- ☆79Updated 10 months ago
- Code for CineScale, higher-resolution video generation based on Wan☆182Updated 4 months ago
- Generate image at any resolution.☆43Updated 3 months ago
- 🔥🔥 Official Repo of UMO: Scaling Multi-Identity Consistency for Image Customization via Matching Reward☆176Updated 4 months ago
- MoviiGen 1.1: Towards Cinematic-Quality Video Generative Models☆181Updated 5 months ago
- DreamVVT: Mastering Realistic Video Virtual Try-On in the Wild via a Stage-Wise Diffusion Transformer Framework☆141Updated 5 months ago
- A Unified Multi-Conditional Diffusion Transformer for Creative Graphic Design☆139Updated 7 months ago
- Code for full fintuing Mochi model with FSDP (and CP)☆30Updated 5 months ago
- Official implementation of MagicFace: Training-free Universal-Style Human Image Customized Synthesis.☆65Updated last year
- [ICLR 2025] Official lmplementation of SPM-Diff: Incorporating Visual Correspondence into Diffusion Model for Virtual Try-On☆47Updated 10 months ago
- Unofficial extension implementation of CausVid☆73Updated 8 months ago
- [ICCV2025] DCM: Dual-Expert Consistency Model for Efficient and High-Quality Video Generation☆201Updated 7 months ago
- ☆132Updated 6 months ago
- [NeurIPS 2025 Spotlight] LeMiCa: Lexicographic Minimax Path Caching for Efficient Diffusion-Based Video Generation☆90Updated last week
- Concat-ID: Towards Universal Identity-Preserving Video Synthesis☆65Updated 8 months ago
- ☆97Updated 2 months ago
- Blending Custom Photos with Video Diffusion Transformers☆48Updated 11 months ago
- [ICML 2025] Official PyTorch implementation of paper "Ultra-Resolution Adaptation with Ease".☆116Updated 8 months ago
- RepText: Rendering Visual Text via Replicating 🔥☆141Updated 7 months ago
- An official implementation of SwapAnyone.☆72Updated 10 months ago
- StreamDiffusion, Live Stream APP☆312Updated 2 weeks ago
- Edit-R1: Reinforce Image Editing with Diffusion Negative-Aware Finetuning and MLLM Implicit Feedback☆208Updated 3 weeks ago
- Official PyTorch implementation of the paper "FlowDirector: Training-Free Flow Steering for Precise Text-to-Video Editing"☆75Updated last month
- Code of RealisHuman: A Two-Stage Approach for Refining Malformed Human Parts in Generated Images☆93Updated last year