Stability-AI / StableCascade
Official Code for Stable Cascade
☆6,547Updated 3 months ago
Related projects ⓘ
Alternatives and complementary repositories for StableCascade
- Official implementations for paper: Anydoor: zero-shot object-level image customization☆4,006Updated 7 months ago
- Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models☆2,971Updated 3 weeks ago
- The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.☆5,283Updated 4 months ago
- AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation☆4,652Updated 4 months ago
- [WIP] Layer Diffusion for WebUI (via Forge)☆3,885Updated 2 months ago
- StableSwarmUI, A Modular Stable Diffusion Web-User-Interface, with an emphasis on making powertools easily accessible, high performance, …☆4,588Updated 3 months ago
- Official implementation of OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable Virtual Try-on☆5,607Updated 6 months ago
- Official implementation of AnimateDiff.☆10,603Updated 3 months ago
- Character Animation (AnimateAnyone, Face Reenactment)☆3,185Updated 5 months ago
- [ECCV 2024, Oral] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors☆2,589Updated 2 months ago
- ☆4,579Updated 3 months ago
- An intuitive GUI for GLIGEN that uses ComfyUI in the backend☆2,023Updated 8 months ago
- Accepted as [NeurIPS 2024] Spotlight Presentation Paper☆5,955Updated last month
- Streamlined interface for generating images with AI in Krita. Inpaint and outpaint with optional text prompt, no tweaking required.☆6,919Updated this week
- Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference☆4,383Updated 5 months ago
- An extensive node suite that enables ComfyUI to process 3D inputs (Mesh & UV Texture, etc) using cutting edge algorithms (3DGS, NeRF, etc…☆2,371Updated last week
- Fast stable diffusion on CPU☆1,497Updated 2 weeks ago
- StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generation☆9,741Updated 3 months ago
- InstantID: Zero-shot Identity-Preserving Generation in Seconds 🔥☆11,121Updated 4 months ago
- [ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (RPG)☆1,693Updated last month
- Official implementation of DreaMoving☆1,796Updated 10 months ago
- PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis☆2,809Updated 3 weeks ago
- Improved AnimateDiff for ComfyUI and Advanced Sampling Support☆2,761Updated this week
- Zero-Shot Speech Editing and Text-to-Speech in the Wild☆7,645Updated 4 months ago
- Mora: More like Sora for Generalist Video Generation☆1,517Updated last month
- Transparent Image Layer Diffusion using Latent Transparency☆2,023Updated 5 months ago
- More relighting!☆5,545Updated 3 weeks ago
- [ECCV2024] IDM-VTON : Improving Diffusion Models for Authentic Virtual Try-on in the Wild☆3,935Updated 2 weeks ago
- [CVPR 2024] MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model☆10,485Updated 4 months ago
- Lumina-T2X is a unified framework for Text to Any Modality Generation☆2,086Updated 3 months ago