RedAIGC / StoryMaker
StoryMaker: Towards consistent characters in text-to-image generation
☆648Updated 2 months ago
Alternatives and similar repositories for StoryMaker:
Users that are interested in StoryMaker are comparing it to the libraries listed below
- ☆462Updated 2 months ago
- AutoStudio: Crafting Consistent Subjects in Multi-turn Interactive Image Generation☆423Updated 3 months ago
- ☆585Updated 2 months ago
- [ECCV2024] This is an official inference code of the paper "Glyph-ByT5: A Customized Text Encoder for Accurate Visual Text Rendering" and…☆545Updated 7 months ago
- Official implementation of "FitDiT: Advancing the Authentic Garment Details for High-fidelity Virtual Try-on"☆422Updated last week
- [ECCV 2024] OMG: Occlusion-friendly Personalized Multi-concept Generation In Diffusion Models☆682Updated 7 months ago
- ☆406Updated 4 months ago
- Official repository of In-Context LoRA for Diffusion Transformers☆1,581Updated last month
- Stable-Hair: Real-World Hair Transfer via Diffusion Model (AAAI 2025)☆412Updated 3 months ago
- The official implementation of paper "BrushEdit: All-In-One Image Inpainting and Editing"☆508Updated last month
- [ECCV 2024] HiDiffusion: Increases the resolution and speed of your diffusion model by only adding a single line of code!☆799Updated 2 months ago
- [ICLR2025] DisPose: Disentangling Pose Guidance for Controllable Human Image Animation☆326Updated 3 weeks ago
- ☆475Updated 3 weeks ago
- [CVPR 2024] FRESCO: Spatial-Temporal Correspondence for Zero-Shot Video Translation☆752Updated 8 months ago
- [ECCV 2024] MOFA-Video: Controllable Image Animation via Generative Motion Field Adaptions in Frozen Image-to-Video Diffusion Model.☆712Updated 2 months ago
- Illumination Drawing Tools for Text-to-Image Diffusion Models☆529Updated last month
- ☆384Updated 3 months ago
- [Siggraph Asia 2024] Follow-Your-Emoji: This repo is the official implementation of "Follow-Your-Emoji: Fine-Controllable and Expressive …☆359Updated 5 months ago
- Training-free Regional Prompting for Diffusion Transformers 🔥☆551Updated 2 months ago
- ☆390Updated 2 weeks ago
- Code and data for "AnyV2V: A Tuning-Free Framework For Any Video-to-Video Editing Tasks" (TMLR 2024)☆546Updated 3 months ago
- 📹 A more flexible CogVideoX that can generate videos at any resolution and creates videos from images.☆640Updated 2 months ago
- [ICLR 2025] Animate-X - PyTorch Implementation☆302Updated 3 weeks ago
- Official implementation of "AnyDressing: Customizable Multi-Garment Virtual Dressing via Latent Diffusion Models"☆271Updated last month
- SEED-Story: Multimodal Long Story Generation with Large Language Model☆790Updated 4 months ago
- This is a study aim to transfer the single concept by using DIT model self-attention capablity☆632Updated 2 months ago
- ☆367Updated 8 months ago
- DesignEdit: Unify Spatial-Aware Image Editing via Training-free Inpainting with a Multi-Layered Latent Diffusion Framework☆325Updated 2 months ago
- Memory-Guided Diffusion for Expressive Talking Video Generation☆712Updated 3 weeks ago
- The Dawn of Video Generation: Preliminary Explorations with SORA-like Models☆206Updated last month