donahowe/AutoStudio

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/donahowe/AutoStudio)

donahowe / AutoStudio

[CVPRW 2026] AutoStudio: Crafting Consistent Subjects in Multi-turn Interactive Image Generation

☆450

Alternatives and similar repositories for AutoStudio

Users that are interested in AutoStudio are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

donahowe / TheaterGen
View on GitHub
TheaterGen: Character Management with LLM for Consistent Multi-turn Image Generation
☆69Sep 26, 2024Updated last year
TencentARC / SEED-Story
View on GitHub
SEED-Story: Multimodal Long Story Generation with Large Language Model
☆884Oct 11, 2024Updated last year
MyNiuuu / MOFA-Video
View on GitHub
[ECCV 2024] MOFA-Video: Controllable Image Animation via Generative Motion Field Adaptions in Frozen Image-to-Video Diffusion Model.
☆765Dec 5, 2024Updated last year
HVision-NKU / StoryDiffusion
View on GitHub
Accepted as [NeurIPS 2024] Spotlight Presentation Paper
☆6,439Sep 26, 2024Updated last year
AIGText / Glyph-ByT5
View on GitHub
[ECCV2024] This is an official inference code of the paper "Glyph-ByT5: A Customized Text Encoder for Accurate Visual Text Rendering" and…
☆624Sep 5, 2025Updated 10 months ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
LPengYang / MotionClone
View on GitHub
[ICLR 2025] Official implementation of MotionClone: Training-Free Motion Cloning for Controllable Video Generation
☆516Jun 17, 2025Updated last year
JIA-Lab-research / ControlNeXt
View on GitHub
Controllable video and image Generation, SVD, Animate Anyone, ControlNet, ControlNeXt, LoRA
☆1,644Sep 25, 2024Updated last year
ID-Animator / ID-Animator
View on GitHub
☆383Jun 6, 2024Updated 2 years ago
aigc-apps / EasyAnimate
View on GitHub
📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion
☆2,267Mar 6, 2025Updated last year
FireRedTeam / StoryMaker
View on GitHub
StoryMaker: Towards consistent characters in text-to-image generation
☆718Dec 2, 2024Updated last year
Tencent / MimicMotion
View on GitHub
High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance
☆2,628Nov 18, 2025Updated 8 months ago
camenduru / FoleyCrafter-jupyter
View on GitHub
☆10Jun 28, 2024Updated 2 years ago
catcathh / UltraPixel
View on GitHub
Implementation of UltraPixel: Advancing Ultra-High-Resolution Image Synthesis to New Peaks
☆616Sep 27, 2024Updated last year
Yuanshi9815 / Video-Infinity
View on GitHub
Video-Infinity generates long videos quickly using multiple GPUs without extra training.
☆191Aug 4, 2024Updated last year
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
OpenGVLab / Diffree
View on GitHub
Diffree: Text-Guided Shape Free Object Inpainting with Diffusion Model
☆239May 5, 2025Updated last year
filliptm / ComfyUI_FL-Trainer
View on GitHub
Train SDXL and SD 1.5
☆177Apr 25, 2026Updated 2 months ago
aim-uofa / MovieDreamer
View on GitHub
[ICLR'25] MovieDreamer: Hierarchical Generation for Coherent Long Visual Sequences
☆323Aug 10, 2024Updated last year
Kwai-Kolors / Kolors
View on GitHub
Kolors Team
☆4,610Nov 13, 2024Updated last year
Alpha-VLLM / Lumina-T2X
View on GitHub
Lumina-T2X is a unified framework for Text to Any Modality Generation
☆2,247Feb 16, 2025Updated last year
czg1225 / AsyncDiff
View on GitHub
[NeurIPS 2024] AsyncDiff: Parallelizing Diffusion Models by Asynchronous Denoising
☆215Sep 27, 2025Updated 9 months ago
G-U-N / Phased-Consistency-Model
View on GitHub
[NeurIPS 2024] Boosting the performance of consistency models with PCM!
☆521Dec 11, 2024Updated last year
smthemex / ComfyUI_StoryDiffusion
View on GitHub
You can using StoryDiffusion in ComfyUI
☆509Oct 11, 2025Updated 9 months ago
ali-vilab / MimicBrush
View on GitHub
Official implementations for paper: Zero-shot Image Editing with Reference Imitation
☆1,311Jun 15, 2024Updated 2 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
G-U-N / Motion-I2V
View on GitHub
[SIGGRAPH 2024] Motion I2V: Consistent and Controllable Image-to-Video Generation with Explicit Motion Modeling
☆190Sep 27, 2024Updated last year
instantX-research / InstantStyle
View on GitHub
InstantStyle: Free Lunch towards Style-Preserving in Text-to-Image Generation 🔥
☆2,011Sep 18, 2024Updated last year
ToTheBeginning / PuLID
View on GitHub
[NeurIPS 2024] Official code for PuLID: Pure and Lightning ID Customization via Contrastive Alignment
☆3,548Jul 31, 2025Updated 11 months ago
MS-Diffusion / MS-Diffusion
View on GitHub
[ICLR 2025] Official implementation of MS-Diffusion: Multi-subject Zero-shot Image Personalization with Layout Guidance
☆311Jul 30, 2025Updated 11 months ago
jeanne-wang / svd_keyframe_interpolation
View on GitHub
☆296Aug 30, 2024Updated last year
qinghew / CharacterFactory
View on GitHub
[TIP 2025] CharacterFactory: Sampling Consistent Characters with GANs for Diffusion Models 🔥
☆222Feb 9, 2026Updated 5 months ago
zeroxoxo / ComfyUI-Fast-Style-Transfer
View on GitHub
ComfyUI node for fast neural style transfer
☆74Apr 7, 2025Updated last year
open-mmlab / Live2Diff
View on GitHub
Live2Diff: A Pipeline that processes Live video streams by a uni-directional video Diffusion model.
☆199Jul 22, 2024Updated last year
TencentQQGYLab / ELLA
View on GitHub
ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment
☆1,285Jul 17, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
agentic-learning-ai-lab / procreate-diffusion
View on GitHub
Public code release for the paper "ProCreate, Don’t Reproduce! Propulsive Energy Diffusion for Creative Generation"
☆43Jun 7, 2026Updated last month
KlingAIResearch / I2V-Adapter
View on GitHub
I2V-Adapter: A General Image-to-Video Adapter for Diffusion Models
☆231Jun 18, 2024Updated 2 years ago
Boese0601 / MagicDance
View on GitHub
[ICML 2024] MagicPose(also known as MagicDance): Realistic Human Poses and Facial Expressions Retargeting with Identity-aware Diffusion
☆775Jul 3, 2024Updated 2 years ago
MC-E / ReVideo
View on GitHub
NeurIPS 2024
☆395Sep 26, 2024Updated last year
TianxingWu / FreeInit
View on GitHub
[ECCV 2024] FreeInit: Bridging Initialization Gap in Video Diffusion Models
☆544Jan 18, 2024Updated 2 years ago
TMElyralab / MusePose
View on GitHub
MusePose: a Pose-Driven Image-to-Video Framework for Virtual Human Generation
☆2,693Mar 5, 2025Updated last year
hay86 / ComfyUI_Hallo
View on GitHub
Unofficial implementation of Hallo in ComfyUI
☆21Jul 30, 2024Updated last year