zai-org/Kaleido

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/zai-org/Kaleido)

zai-org / Kaleido

Kaleido: Open-sourced multi-subject reference video generation model, enabling controllable, high-fidelity video synthesis from multiple image references.

☆140

Alternatives and similar repositories for Kaleido

Users that are interested in Kaleido are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

MAGREF-Video / MAGREF
View on GitHub
Official implementation of MAGREF: Masked Guidance for Any-Reference Video Generation with Subject Disentanglement (ICLR2026)
☆298Mar 24, 2026Updated 3 months ago
zai-org / RealVideo
View on GitHub
A real-time streaming conversational video system that transforms text interactions into continuous, high-fidelity video responses using …
☆334Dec 15, 2025Updated 7 months ago
ML-GSAI / Concat-ID
View on GitHub
Concat-ID: Towards Universal Identity-Preserving Video Synthesis
☆65May 7, 2025Updated last year
j0seo / lookahead-anchoring
View on GitHub
☆15Oct 27, 2025Updated 8 months ago
deepshwang / crepa
View on GitHub
☆15Jun 21, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
PKU-YuanGroup / OpenS2V-Nexus
View on GitHub
[NeurIPS 2025 D&B🔥] OpenS2V-Nexus: A Detailed Benchmark and Million-Scale Dataset for Subject-to-Video Generation
☆222May 19, 2026Updated 2 months ago
zai-org / SCAIL-Pose
View on GitHub
Pose Extraction & Rendering for SCAIL: Towards Studio-Grade Character Animation via In-Context Learning of 3D-Consistent Pose Representat…
☆226Jun 11, 2026Updated last month
byhuang123 / PoCo
View on GitHub
[CVPR2026] Official implementation of our paper “Rethinking Position Embedding as a Context Controller for Multi-Reference and Multi-Shot…
☆19Apr 8, 2026Updated 3 months ago
franciszzj / Saber
View on GitHub
[CVPR 2026] Scaling Zero-Shot Reference-to-Video Generation
☆76Apr 28, 2026Updated 2 months ago
KlingAIResearch / ShotStream
View on GitHub
[ECCV 2026] ShotStream: Streaming Multi-Shot Video Generation for Interactive Storytelling
☆171Jun 23, 2026Updated 3 weeks ago
zli12321 / FFGO-Video-Customization
View on GitHub
Video Content Customization Using First Frame
☆193Mar 17, 2026Updated 4 months ago
kinam0252 / TIC-FT
View on GitHub
☆52Jan 6, 2026Updated 6 months ago
alibaba-damo-academy / Lumos-Custom
View on GitHub
[ICLR-26, ECCV-26, NeurIPS-25] Lumos-Custom Project: research for customized video generation in the Lumos Project.
☆216Jun 29, 2026Updated 3 weeks ago
Phantom-video / HuMo
View on GitHub
HuMo: Human-Centric Video Generation via Collaborative Multi-Modal Conditioning
☆1,274Jan 25, 2026Updated 5 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
KlingAIResearch / MultiShotMaster
View on GitHub
CVPR 2026 | Official Implementation of "MultiShotMaster: A Controllable Multi-Shot Video Generation Framework"
☆168Feb 22, 2026Updated 5 months ago
Phantom-video / Phantom
View on GitHub
Phantom: Subject-Consistent Video Generation via Cross-Modal Alignment
☆1,512Sep 11, 2025Updated 10 months ago
KlingAIResearch / CamCloneMaster
View on GitHub
[SIGGRAPH Asia'25] Enabling Reference-based Camera Control via Context without Explicit 3D Estimation
☆159Jan 18, 2026Updated 6 months ago
Kevin-thu / StoryMem
View on GitHub
Official code for StoryMem: Multi-shot Long Video Storytelling with Memory
☆761May 25, 2026Updated last month
zai-org / SCAIL
View on GitHub
SCAIL: Towards Studio-Grade Character Animation via In-Context Learning of 3D-Consistent Pose Representations (CVPR 2026 Findings)
☆1,024May 6, 2026Updated 2 months ago
Guoxu1233 / DreamID-Omni
View on GitHub
[ICML 2026] DreamID-Omni: Unified Framework for Controllable Human-Centric Audio-Video Generation
☆273May 22, 2026Updated 2 months ago
bytedance / BindWeave
View on GitHub
[ICLR 2026] Official Repo For "BindWeave: Subject-Consistent Video Generation via Cross-Modal Integration"
☆338Jan 28, 2026Updated 5 months ago
bytedance-fanqie-ai / MoGA
View on GitHub
Mixture-of-Groups Attention for End-to-End Long Video Generation
☆99Oct 22, 2025Updated 9 months ago
Fantasy-AMAP / fantasy-talking2
View on GitHub
[AAAI 2026] FantasyTalking2: Timestep-Layer Adaptive Preference Optimization for Audio-Driven Portrait Animation
☆65Aug 20, 2025Updated 11 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
yihao-meng / HoloCine
View on GitHub
[CVPR 2026 Highlight] Official Implementations for Paper - HoloCine: Holistic Generation of Cinematic Multi-Shot Long Video Narratives
☆690Nov 26, 2025Updated 7 months ago
showlab / Kiwi-Edit
View on GitHub
A unified and fully open-source framework for instruction-guided and reference-guided video editing using natural language.
☆305May 13, 2026Updated 2 months ago
zai-org / GLM-Image
View on GitHub
GLM-Image: Auto-regressive for Dense-knowledge and High-fidelity Image Generation.
☆993Mar 20, 2026Updated 4 months ago
D2I-ai / EchoShot
View on GitHub
☆88Nov 16, 2025Updated 8 months ago
HKUST-C4G / DomainShuttle
View on GitHub
DomainShuttle: Freeform Open Domain Subject-driven Text-to-video Generation
☆162Jun 26, 2026Updated 3 weeks ago
FoundationVision / InfinityStar
View on GitHub
[NeurIPS 2025 Oral]Infinity⭐️: Uniﬁed Spacetime AutoRegressive Modeling for Visual Generation
☆773Apr 16, 2026Updated 3 months ago
alibaba-damo-academy / Lumos
View on GitHub
[ICLR 2026] Lumos Project: Frontier video unified model research by Alibaba DAMO Academy.
☆161Apr 6, 2026Updated 3 months ago
KlingAIResearch / VideoAlign
View on GitHub
[NeurIPS 2025] Improving Video Generation with Human Feedback
☆483Sep 24, 2025Updated 9 months ago
refkxh / BiCo
View on GitHub
[CVPR 2026 Highlight] Official implementation of BiCo: Composing Concepts from Images and Videos via Concept-prompt Binding
☆85May 31, 2026Updated last month
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
Phantom-video / OmniInsert
View on GitHub
OmniInsert: Mask-Free Video Insertion of Any Reference via Diffusion Transformer Models
☆162Mar 4, 2026Updated 4 months ago
Phantom-video / Phantom-Data
View on GitHub
Phantom-Data: Towards a General Subject-Consistent Video Generation Dataset
☆115Feb 25, 2026Updated 4 months ago
FrameX-AI / Stream-R1
View on GitHub
☆54May 6, 2026Updated 2 months ago
vvvvvjdy / dmdr
View on GitHub
[ECCV 2026] Official Code of "Distribution Matching Distillation Meets Reinforcement Learning"
☆282Feb 1, 2026Updated 5 months ago
xyz123xyz456 / hallo4
View on GitHub
☆62Dec 1, 2025Updated 7 months ago
bytedance / Video-As-Prompt
View on GitHub
[ICLR 2026] Official repo for paper "Video-As-Prompt: Unified Semantic Control for Video Generation"
☆439Feb 8, 2026Updated 5 months ago
feizc / Ingredients
View on GitHub
Blending Custom Photos with Video Diffusion Transformers
☆50Jan 21, 2025Updated last year