Kaleido: Open-sourced multi-subject reference video generation model, enabling controllable, high-fidelity video synthesis from multiple image references.
☆116Dec 23, 2025Updated 2 months ago
Alternatives and similar repositories for Kaleido
Users that are interested in Kaleido are comparing it to the libraries listed below
Sorting:
- Pose Extraction & Rendering for SCAIL: Towards Studio-Grade Character Animation via In-Context Learning of 3D-Consistent Pose Representat…☆180Dec 28, 2025Updated 2 months ago
- Tuning-Free Image Editing with Fidelity and Editability via Unified Latent Diffusion Model☆13Dec 29, 2024Updated last year
- ☆71Nov 24, 2025Updated 3 months ago
- [ICLR 2026] ContextGen: Contextual Layout Anchoring for Identity-Consistent Multi-Instance Generation☆69Feb 12, 2026Updated 2 weeks ago
- UltraFlux: Data-Model Co-Design for High-quality Native 4K Text-to-Image Generation across Diverse Aspect Ratios☆117Dec 17, 2025Updated 2 months ago
- ☆175Nov 8, 2025Updated 3 months ago
- Camera Angle lora for Qwen Image Edit 2511☆95Jan 10, 2026Updated last month
- ☆14Feb 20, 2024Updated 2 years ago
- Official implementation of MAGREF: Masked Guidance for Any-Reference Video Generation with Subject Disentanglement (ICLR2026)☆287Updated this week
- An official implementation of EvoSearch: Scaling Image and Video Generation via Test-Time Evolutionary Search☆100Oct 3, 2025Updated 5 months ago
- [NeurIPS 2025 Spotlight] LeMiCa: Lexicographic Minimax Path Caching for Efficient Diffusion-Based Video Generation☆101Feb 12, 2026Updated 2 weeks ago
- Structured Noise Generation from the paper NeuralRemaster with Phase-Preserving Diffusion☆31Feb 1, 2026Updated last month
- ☆124Jun 17, 2025Updated 8 months ago
- [AAAI2026] Implementation Code for Omni-Effects☆173Dec 9, 2025Updated 2 months ago
- A local-first, high-performance desktop asset manager for AI image generations. Features universal metadata parsing (ComfyUI/A1111), inst…☆55Updated this week
- Official repository for the paper "MVP4D: Multi-View Portrait Video Diffusion for Animatable 4D Avatars"☆41Nov 20, 2025Updated 3 months ago
- [TBench 2024] Official implementation of "AIGCBench: Comprehensive Evaluation of Image-to-Video Content Generated by AI"☆48Jan 30, 2024Updated 2 years ago
- ☆279Jan 8, 2026Updated last month
- Pusa: Thousands Timesteps Video Diffusion Model☆671Feb 13, 2026Updated 2 weeks ago
- Concat-ID: Towards Universal Identity-Preserving Video Synthesis☆66May 7, 2025Updated 9 months ago
- ☆43Sep 1, 2025Updated 6 months ago
- ☆63Jan 28, 2026Updated last month
- Lynx: Towards High-Fidelity Personalized Video Generation☆309Sep 26, 2025Updated 5 months ago
- [NeurIPS 2025 Oral]Infinity⭐️: Unified Spacetime AutoRegressive Modeling for Visual Generation☆721Nov 27, 2025Updated 3 months ago
- AnyTalker: Scaling Multi-person Talking Video Generation with Interactivity Refinement☆278Dec 5, 2025Updated 2 months ago
- Official code for StoryMem: Multi-shot Long Video Storytelling with Memory☆655Jan 22, 2026Updated last month
- FantasyPortrait: Enhancing Multi-Character Portrait Animation with Expression-Augmented Diffusion Transformers☆500Aug 20, 2025Updated 6 months ago
- Repo for SeedVR2 (ICLR2026) & SeedVR (CVPR2025 Highlight)☆1,051Jan 27, 2026Updated last month
- The first spoken long-text dataset derived from live streams, designed to reflect the redundancy-rich and conversational nature of real-w…☆12Jun 28, 2025Updated 8 months ago
- I wanted a node to save my prompts and optionally take in an external prompt from a llm and save it, didn't see one, so I made it.☆46Jan 18, 2026Updated last month
- Official repository for code and information related to the HumanOLAT dataset (ICCV 2025).☆38Nov 17, 2025Updated 3 months ago
- Terminal Velocity Matching☆67Feb 14, 2026Updated 2 weeks ago
- Allows native usage of ModelScope based Text To Video Models in ComfyUI☆27May 23, 2024Updated last year
- [ICLR 2025] Animate-X: Universal Character Image Animation with Enhanced Motion Representation☆381Sep 17, 2025Updated 5 months ago
- HunyuanImage-2.1: An Efficient Diffusion Model for High-Resolution (2K) Text-to-Image Generation☆671Oct 14, 2025Updated 4 months ago
- Official Repo for Tuning-Free Noise Rectification for High Fidelity Image-to-Video Generation☆30Mar 29, 2024Updated last year
- StreamDiffusion, Live Stream APP☆360Feb 18, 2026Updated last week
- Gen3R: 3D Scene Generation Meets Feed-Forward Reconstruction☆186Jan 14, 2026Updated last month
- [ICCV 2025] VideoVAE+: Large Motion Video Autoencoding with Cross-modal Video VAE☆392Jan 19, 2025Updated last year