AutoStudio: Crafting Consistent Subjects in Multi-turn Interactive Image Generation
☆449Apr 13, 2025Updated 11 months ago
Alternatives and similar repositories for AutoStudio
Users that are interested in AutoStudio are comparing it to the libraries listed below
Sorting:
- TheaterGen: Character Management with LLM for Consistent Multi-turn Image Generation☆68Sep 26, 2024Updated last year
- SEED-Story: Multimodal Long Story Generation with Large Language Model☆887Oct 11, 2024Updated last year
- [ECCV 2024] MOFA-Video: Controllable Image Animation via Generative Motion Field Adaptions in Frozen Image-to-Video Diffusion Model.☆768Dec 5, 2024Updated last year
- Accepted as [NeurIPS 2024] Spotlight Presentation Paper☆6,401Sep 26, 2024Updated last year
- [ECCV2024] This is an official inference code of the paper "Glyph-ByT5: A Customized Text Encoder for Accurate Visual Text Rendering" and…☆623Sep 5, 2025Updated 6 months ago
- [ICLR 2025] Official implementation of MotionClone: Training-Free Motion Cloning for Controllable Video Generation☆514Jun 17, 2025Updated 9 months ago
- ☆387Jun 6, 2024Updated last year
- Controllable video and image Generation, SVD, Animate Anyone, ControlNet, ControlNeXt, LoRA☆1,636Sep 25, 2024Updated last year
- 📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion☆2,252Mar 6, 2025Updated last year
- StoryMaker: Towards consistent characters in text-to-image generation☆722Dec 2, 2024Updated last year
- Diffree: Text-Guided Shape Free Object Inpainting with Diffusion Model☆240May 5, 2025Updated 10 months ago
- High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance☆2,546Nov 18, 2025Updated 4 months ago
- Video-Infinity generates long videos quickly using multiple GPUs without extra training.☆191Aug 4, 2024Updated last year
- ☆11Jun 28, 2024Updated last year
- Train SDXL and SD 1.5☆173Oct 18, 2024Updated last year
- Implementation of UltraPixel: Advancing Ultra-High-Resolution Image Synthesis to New Peaks☆614Sep 27, 2024Updated last year
- [NeurIPS 2024] AsyncDiff: Parallelizing Diffusion Models by Asynchronous Denoising☆212Sep 27, 2025Updated 5 months ago
- [ICLR'25] MovieDreamer: Hierarchical Generation for Coherent Long Visual Sequences☆322Aug 10, 2024Updated last year
- Lumina-T2X is a unified framework for Text to Any Modality Generation☆2,253Feb 16, 2025Updated last year
- Kolors Team☆4,608Nov 13, 2024Updated last year
- [NeurIPS 2024] Boosting the performance of consistency models with PCM!☆514Dec 11, 2024Updated last year
- Official implementations for paper: Zero-shot Image Editing with Reference Imitation☆1,310Jun 15, 2024Updated last year
- ComfyUI node for fast neural style transfer☆76Apr 7, 2025Updated 11 months ago
- InstantStyle: Free Lunch towards Style-Preserving in Text-to-Image Generation 🔥☆2,006Sep 18, 2024Updated last year
- [NeurIPS 2024] Official code for PuLID: Pure and Lightning ID Customization via Contrastive Alignment☆3,528Jul 31, 2025Updated 7 months ago
- Public code release for the paper "ProCreate, Don’t Reproduce! Propulsive Energy Diffusion for Creative Generation"☆41Nov 30, 2025Updated 3 months ago
- [SIGGRAPH 2024] Motion I2V: Consistent and Controllable Image-to-Video Generation with Explicit Motion Modeling☆190Sep 27, 2024Updated last year
- Live2Diff: A Pipeline that processes Live video streams by a uni-directional video Diffusion model.☆200Jul 22, 2024Updated last year
- [TIP 2025] CharacterFactory: Sampling Consistent Characters with GANs for Diffusion Models 🔥☆221Feb 9, 2026Updated last month
- ☆295Aug 30, 2024Updated last year
- [ICLR 2025] Official implementation of MS-Diffusion: Multi-subject Zero-shot Image Personalization with Layout Guidance☆309Jul 30, 2025Updated 7 months ago
- NeurIPS 2023, Mix-of-Show: Decentralized Low-Rank Adaptation for Multi-Concept Customization of Diffusion Models☆428May 14, 2024Updated last year
- ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment☆1,281Jul 17, 2024Updated last year
- [ECCV 2024] FreeInit: Bridging Initialization Gap in Video Diffusion Models☆545Jan 18, 2024Updated 2 years ago
- MusePose: a Pose-Driven Image-to-Video Framework for Virtual Human Generation☆2,660Mar 5, 2025Updated last year
- NeurIPS 2024☆395Sep 26, 2024Updated last year
- [ICML 2024] MagicPose(also known as MagicDance): Realistic Human Poses and Facial Expressions Retargeting with Identity-aware Diffusion☆777Jul 3, 2024Updated last year
- ☆40Jun 30, 2024Updated last year
- I2V-Adapter: A General Image-to-Video Adapter for Diffusion Models☆230Jun 18, 2024Updated last year