[ICLR 2024 Spotlight] Official implementation of ScaleCrafter for higher-resolution visual generation at inference time.
☆510Mar 7, 2024Updated last year
Alternatives and similar repositories for ScaleCrafter
Users that are interested in ScaleCrafter are comparing it to the libraries listed below
Sorting:
- [ICLR 2024] Code for FreeNoise based on VideoCrafter☆427Aug 25, 2025Updated 6 months ago
- FreeU: Free Lunch in Diffusion U-Net (CVPR2024 Oral)☆1,895Dec 24, 2024Updated last year
- Official Pytorch Implementation of DenseDiffusion (ICCV 2023)☆501Nov 14, 2023Updated 2 years ago
- [ICCV 2023] Consistent Image Synthesis and Editing☆837Aug 19, 2024Updated last year
- [ECCV 2024] FreeInit: Bridging Initialization Gap in Video Diffusion Models☆545Jan 18, 2024Updated 2 years ago
- The official implementation of "Relay Diffusion: Unifying diffusion process across resolutions for image synthesis" [ICLR 2024 Spotlight]☆313Apr 29, 2024Updated last year
- PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis☆3,281Oct 31, 2024Updated last year
- Consistency Distilled Diff VAE☆2,209Nov 7, 2023Updated 2 years ago
- Official PyTorch implementation for the paper "AnimateZero: Video Diffusion Models are Zero-Shot Image Animators"☆359Dec 8, 2023Updated 2 years ago
- AlignProp uses direct reward backpropogation for the alignment of large-scale text-to-image diffusion models. Our method is 25x more samp…☆312Nov 1, 2024Updated last year
- ✨ Hotshot-XL: State-of-the-art AI text-to-GIF model trained to work alongside Stable Diffusion XL☆1,113Jan 23, 2024Updated 2 years ago
- InstaFlow! One-Step Stable Diffusion with Rectified Flow (ICLR 2024)☆1,386Jun 7, 2024Updated last year
- [ECCV 2024 Oral] MotionDirector: Motion Customization of Text-to-Video Diffusion Models.☆1,040Aug 21, 2024Updated last year
- [IJCV 2024] LaVie: High-Quality Video Generation with Cascaded Latent Diffusion Models☆948Nov 13, 2024Updated last year
- [SIGGRAPH Asia 2024] ReVersion: Diffusion-Based Relation Inversion from Images☆506Oct 7, 2025Updated 4 months ago
- The official Pytorch Implementation for ElasticDiffusion: Training-free Arbitrary Size Image Generation through Global-Local Content Sepa…☆159Dec 24, 2024Updated last year
- Concept Sliders for Precise Control of Diffusion Models☆1,129Jun 20, 2025Updated 8 months ago
- [CVPR 2024] PAIR Diffusion: A Comprehensive Multimodal Object-Level Image Editor☆521Apr 2, 2024Updated last year
- VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models☆5,032Jan 9, 2026Updated last month
- Retrieval-Augmented Video Generation for Telling a Story☆259Feb 5, 2024Updated 2 years ago
- [ECCV 2024] HiDiffusion: Increases the resolution and speed of your diffusion model by only adding a single line of code!☆838Jan 7, 2026Updated last month
- Officail Implementation for "Cross-Image Attention for Zero-Shot Appearance Transfer"☆394May 5, 2024Updated last year
- [NeurIPS 2024] CV-VAE: A Compatible Video VAE for Latent Generative Video Models☆286Dec 4, 2024Updated last year
- Lumina-T2X is a unified framework for Text to Any Modality Generation☆2,252Feb 16, 2025Updated last year
- Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference☆4,615Jun 14, 2024Updated last year
- Prompt-Free Diffusion: Taking "Text" out of Text-to-Image Diffusion Models, arxiv 2023 / CVPR 2024☆758Nov 16, 2023Updated 2 years ago
- [AAAI 2025] Official codes of "ResAdapter: Domain Consistent Resolution Adapter for Diffusion Models".☆768Apr 27, 2025Updated 10 months ago
- Segmind Distilled diffusion☆619Oct 18, 2023Updated 2 years ago
- [ECCV 2024] DragAnything: Motion Control for Anything using Entity Representation☆506Jul 2, 2024Updated last year
- [ECCV 2024, Oral] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors☆2,994Sep 8, 2024Updated last year
- ☆446Mar 24, 2024Updated last year
- Official Code for MotionCtrl [SIGGRAPH 2024]☆1,494Feb 19, 2025Updated last year
- ICLR 2024 (Spotlight)☆785Mar 2, 2024Updated 2 years ago
- [ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (RPG)☆1,844Feb 1, 2025Updated last year
- Mixture of Diffusers for scene composition and high resolution image generation☆447May 21, 2023Updated 2 years ago
- LVDM: Latent Video Diffusion Models for High-Fidelity Long Video Generation☆504Nov 16, 2024Updated last year
- ☆62Jun 25, 2024Updated last year
- PeRFlow: Piecewise Rectified Flow as Universal Plug-and-Play Accelerator (NeurIPS 2024)☆534Sep 8, 2025Updated 5 months ago
- Official Pytorch Implementation for "MultiDiffusion: Fusing Diffusion Paths for Controlled Image Generation" presenting "MultiDiffusion" …☆1,058Sep 21, 2023Updated 2 years ago