[CVPRW 2026] AutoStudio: Crafting Consistent Subjects in Multi-turn Interactive Image Generation
☆447Apr 13, 2025Updated last year
Alternatives and similar repositories for AutoStudio
Users that are interested in AutoStudio are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- TheaterGen: Character Management with LLM for Consistent Multi-turn Image Generation☆68Sep 26, 2024Updated last year
- SEED-Story: Multimodal Long Story Generation with Large Language Model☆883Oct 11, 2024Updated last year
- [ECCV 2024] MOFA-Video: Controllable Image Animation via Generative Motion Field Adaptions in Frozen Image-to-Video Diffusion Model.☆767Dec 5, 2024Updated last year
- Accepted as [NeurIPS 2024] Spotlight Presentation Paper☆6,410Sep 26, 2024Updated last year
- [ECCV2024] This is an official inference code of the paper "Glyph-ByT5: A Customized Text Encoder for Accurate Visual Text Rendering" and…☆623Sep 5, 2025Updated 7 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- [ICLR 2025] Official implementation of MotionClone: Training-Free Motion Cloning for Controllable Video Generation☆513Jun 17, 2025Updated 10 months ago
- ☆387Jun 6, 2024Updated last year
- Controllable video and image Generation, SVD, Animate Anyone, ControlNet, ControlNeXt, LoRA☆1,645Sep 25, 2024Updated last year
- 📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion☆2,264Mar 6, 2025Updated last year
- StoryMaker: Towards consistent characters in text-to-image generation☆717Dec 2, 2024Updated last year
- Diffree: Text-Guided Shape Free Object Inpainting with Diffusion Model☆240May 5, 2025Updated 11 months ago
- High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance☆2,569Nov 18, 2025Updated 5 months ago
- Video-Infinity generates long videos quickly using multiple GPUs without extra training.☆191Aug 4, 2024Updated last year
- ☆10Jun 28, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Train SDXL and SD 1.5☆175Updated this week
- Implementation of UltraPixel: Advancing Ultra-High-Resolution Image Synthesis to New Peaks☆616Sep 27, 2024Updated last year
- [NeurIPS 2024] AsyncDiff: Parallelizing Diffusion Models by Asynchronous Denoising☆214Sep 27, 2025Updated 7 months ago
- [ICLR'25] MovieDreamer: Hierarchical Generation for Coherent Long Visual Sequences☆323Aug 10, 2024Updated last year
- Lumina-T2X is a unified framework for Text to Any Modality Generation☆2,253Feb 16, 2025Updated last year
- Kolors Team☆4,614Nov 13, 2024Updated last year
- [NeurIPS 2024] Boosting the performance of consistency models with PCM!☆516Dec 11, 2024Updated last year
- Official implementations for paper: Zero-shot Image Editing with Reference Imitation☆1,309Jun 15, 2024Updated last year
- ComfyUI node for fast neural style transfer☆76Apr 7, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- InstantStyle: Free Lunch towards Style-Preserving in Text-to-Image Generation 🔥☆2,007Sep 18, 2024Updated last year
- [NeurIPS 2024] Official code for PuLID: Pure and Lightning ID Customization via Contrastive Alignment☆3,539Jul 31, 2025Updated 9 months ago
- Public code release for the paper "ProCreate, Don’t Reproduce! Propulsive Energy Diffusion for Creative Generation"☆41Nov 30, 2025Updated 5 months ago
- [SIGGRAPH 2024] Motion I2V: Consistent and Controllable Image-to-Video Generation with Explicit Motion Modeling☆191Sep 27, 2024Updated last year
- Live2Diff: A Pipeline that processes Live video streams by a uni-directional video Diffusion model.☆200Jul 22, 2024Updated last year
- [TIP 2025] CharacterFactory: Sampling Consistent Characters with GANs for Diffusion Models 🔥☆222Feb 9, 2026Updated 2 months ago
- ☆295Aug 30, 2024Updated last year
- [ICLR 2025] Official implementation of MS-Diffusion: Multi-subject Zero-shot Image Personalization with Layout Guidance☆311Jul 30, 2025Updated 9 months ago
- NeurIPS 2023, Mix-of-Show: Decentralized Low-Rank Adaptation for Multi-Concept Customization of Diffusion Models☆430May 14, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment☆1,284Jul 17, 2024Updated last year
- [ECCV 2024] FreeInit: Bridging Initialization Gap in Video Diffusion Models☆544Jan 18, 2024Updated 2 years ago
- MusePose: a Pose-Driven Image-to-Video Framework for Virtual Human Generation☆2,667Mar 5, 2025Updated last year
- [ICML 2024] MagicPose(also known as MagicDance): Realistic Human Poses and Facial Expressions Retargeting with Identity-aware Diffusion☆778Jul 3, 2024Updated last year
- NeurIPS 2024☆397Sep 26, 2024Updated last year
- I2V-Adapter: A General Image-to-Video Adapter for Diffusion Models☆232Jun 18, 2024Updated last year
- ☆40Jun 30, 2024Updated last year