yandex-research / switti
The code and models for the paper: Switti: Designing Scale-Wise Transformers for Text-to-Image Synthesis
☆160Updated last month
Alternatives and similar repositories for switti:
Users that are interested in switti are comparing it to the libraries listed below
- Text and image to video generation: Kandinsky 4.0 (2024)☆143Updated 2 months ago
- Official Implementation of weights2weights☆138Updated 2 months ago
- Paint by Inpaint: Learning to Add Image Objects by Removing Them First☆96Updated 5 months ago
- [NeurIPS'2024] Invertible Consistency Distillation for Text-Guided Image Editing in Around 7 Steps☆90Updated 7 months ago
- faster parallel inference of mochi-1 video generation model☆111Updated last month
- ☆60Updated 9 months ago
- Official PyTorch implementation of paper "CLEAR: Conv-Like Linearization Revs Pre-Trained Diffusion Transformers Up".☆195Updated last week
- [NeurIPS 2024] AsyncDiff: Parallelizing Diffusion Models by Asynchronous Denoising☆179Updated 4 months ago
- ☆110Updated 4 months ago
- [ICLR 2025] FasterCache: Training-Free Video Diffusion Model Acceleration with High Quality☆194Updated last month
- An 8-step inversion and 8-step editing process works effectively with the FLUX-dev model. (3x speedup with results that are comparable or…☆226Updated 3 weeks ago
- Cinemo: Consistent and Controllable Image Animation with Motion Diffusion Models☆244Updated 4 months ago
- T-GATE: Temporally Gating Attention to Accelerate Diffusion Model for Free!☆381Updated 5 months ago
- Official Implementation for "Guide-and-Rescale: Self-Guidance Mechanism for Effective Tuning-Free Real Image Editing"☆52Updated 5 months ago
- Official PyTorch and Diffusers Implementation of "LinFusion: 1 GPU, 1 Minute, 16K Image"☆289Updated last month
- [NeurIPS 2024] RectifID: Personalizing Rectified Flow with Anchored Classifier Guidance☆123Updated 4 months ago
- ☆125Updated 2 months ago
- ☆66Updated 4 months ago
- [ECCV 2024] Be-Your-Outpainter https://arxiv.org/abs/2403.13745☆231Updated 3 months ago
- KandinskyVideo — multilingual end-to-end text2video latent diffusion model☆180Updated 8 months ago
- ☆43Updated last month
- Code for FreeScale, a tuning-free method for higher-resolution visual generation☆114Updated last month
- Pytorch implementation of MIMO, Controllable Character Video Synthesis with Spatial Decomposed Modeling, from Alibaba Intelligence Group☆131Updated 4 months ago
- Keyframe Interpolation with CogvideoX☆115Updated 3 months ago
- [WACV 2025] MegaFusion: Extend Diffusion Models towards Higher-resolution Image Generation without Further Tuning☆78Updated 2 months ago
- End-to-end recipes for optimizing diffusion models with torchao and diffusers (inference and FP8 training).☆319Updated this week
- The official implementation of PAR: Parallelized Autoregressive Visual Generation. https://epiphqny.github.io/PAR-project/☆110Updated last month
- Official Implementation of PairCustomization SIGGRAPH Asia 2024☆96Updated last week