cloneofsimo / vqgan-trainingLinks
Train VAE like a boss
☆279Updated 7 months ago
Alternatives and similar repositories for vqgan-training
Users that are interested in vqgan-training are comparing it to the libraries listed below
Sorting:
- Minimal implementation of scalable rectified flow transformers, based on SD3's approach☆562Updated 11 months ago
- Official implementation of Inductive Moment Matching☆475Updated 2 months ago
- Implementation of a single layer of the MMDiT, proposed in Stable Diffusion 3, in Pytorch☆361Updated 4 months ago
- FMBoost: Boosting Latent Diffusion with Flow Matching (ECCV 2024 Oral)☆231Updated 6 months ago
- [CVPR2025] PyTorch-based reimplementation of CrossFlow, as proposed in 'Flowing from Words to Pixels: A Noise-Free Framework for Cross-Mo…☆167Updated 2 months ago
- The aim of this repository is to test and implement Flow-Matching-based models☆92Updated 4 months ago
- UniDisc: A discrete diffusion model for joint multimodal generation, enabling controllable and efficient text-image synthesis, editing, a…☆105Updated 2 months ago
- Smooth Diffusion: Crafting Smooth Latent Spaces in Diffusion Models arXiv 2023 / CVPR 2024☆340Updated 8 months ago
- A general framework for inference-time scaling and steering of diffusion models with arbitrary rewards.☆149Updated 3 months ago
- Implementation of rectified flow and some of its followup research / improvements in Pytorch☆292Updated last month
- Focused on fast experimentation and simplicity☆73Updated 5 months ago
- Huggingface-compatible SDXL Unet implementation that is readily hackable☆423Updated last year
- Implementation of TiTok, proposed by Bytedance in "An Image is Worth 32 Tokens for Reconstruction and Generation"☆173Updated 11 months ago
- Official PyTorch Implementation of "Diffusion Autoencoders are Scalable Image Tokenizers"☆118Updated 4 months ago
- Code for Fast Training of Diffusion Models with Masked Transformers☆403Updated last year
- Text to Image Latent Diffusion using a Transformer core☆184Updated 9 months ago
- ☆47Updated 3 months ago
- PyTorch implementation of CLIP Maximum Mean Discrepancy (CMMD) for evaluating image generation models.☆126Updated last year
- ☆20Updated 7 months ago
- WIP☆93Updated 9 months ago
- AlignProp uses direct reward backpropogation for the alignment of large-scale text-to-image diffusion models. Our method is 25x more samp…☆285Updated 7 months ago
- EDM2 and Autoguidance -- Official PyTorch implementation☆714Updated 5 months ago
- [ICML'25] EQ-VAE: Equivariance Regularized Latent Space for Improved Generative Image Modeling.☆111Updated 3 months ago
- Official PyTorch implementation of paper "CLEAR: Conv-Like Linearization Revs Pre-Trained Diffusion Transformers Up".☆206Updated last month
- Inference-time scaling of diffusion-based image and video generation models.☆144Updated 2 months ago
- Consistency Models Made Easy☆284Updated 7 months ago
- Official Implementation of weights2weights☆140Updated 2 months ago
- PyTorch implementation for "Parallel Sampling of Diffusion Models", NeurIPS 2023 Spotlight☆138Updated last year
- ☆51Updated last year
- T-GATE: Temporally Gating Attention to Accelerate Diffusion Model for Free!☆396Updated 3 months ago