Train VAE like a boss
☆313Oct 21, 2024Updated last year
Alternatives and similar repositories for vqgan-training
Users that are interested in vqgan-training are comparing it to the libraries listed below
Sorting:
- Minimal implementation of scalable rectified flow transformers, based on SD3's approach☆634Jul 1, 2024Updated last year
- ☆23Oct 15, 2024Updated last year
- WIP☆94Aug 13, 2024Updated last year
- ☆33Nov 4, 2024Updated last year
- Author's Implementation for E-LatentLPIPS☆179Nov 5, 2024Updated last year
- Huggingface-compatible SDXL Unet implementation that is readily hackable☆435Aug 9, 2023Updated 2 years ago
- ☆34Sep 10, 2024Updated last year
- [ICLR'25 Oral] Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think☆1,553Mar 16, 2025Updated 11 months ago
- RS-IMLE☆44Dec 7, 2024Updated last year
- ☆30Oct 7, 2024Updated last year
- This repo contains the code for 1D tokenizer and generator☆1,117Mar 20, 2025Updated 11 months ago
- Fine-Grained Subject-Specific Attribute Expression Control in T2I Models☆134Feb 27, 2025Updated last year
- EDM2 and Autoguidance -- Official PyTorch implementation☆822Dec 9, 2024Updated last year
- SEED-Voken: A Series of Powerful Visual Tokenizers☆996Nov 25, 2025Updated 3 months ago
- Focused on fast experimentation and simplicity☆80Dec 24, 2024Updated last year
- ☆27Aug 1, 2024Updated last year
- Official repository for our work on micro-budget training of large-scale diffusion models.☆1,550Jan 12, 2025Updated last year
- Official PyTorch Implementation of "SiT: Exploring Flow and Diffusion-based Generative Models with Scalable Interpolant Transformers"☆1,096Dec 22, 2025Updated 2 months ago
- A suite of image and video neural tokenizers☆1,711Feb 11, 2025Updated last year
- Official implementation of Inf-DiT: Upsampling Any-Resolution Image with Memory-Efficient Diffusion Transformer☆443Jul 5, 2024Updated last year
- Unofficial implementation of 2D ProlificDreamer☆145Jan 6, 2025Updated last year
- FlexTok: Resampling Images into 1D Token Sequences of Flexible Length☆298Jun 2, 2025Updated 9 months ago
- Tiny AutoEncoder for Stable Diffusion (and other image models)☆888Jan 23, 2026Updated last month
- Simple implementation of muP, based on Spectral Condition for Feature Learning. The implementation is SGD only, dont use it for Adam☆86Jul 28, 2024Updated last year
- Tiny AutoEncoder for Stable Diffusion Videos☆36Oct 5, 2024Updated last year
- A PyTorch library for implementing flow matching algorithms, featuring continuous and discrete flow matching implementations. It includes…☆4,167Jan 5, 2026Updated last month
- ☆27May 3, 2024Updated last year
- ☆16Apr 7, 2024Updated last year
- Official implementation of Inductive Moment Matching☆574Jul 11, 2025Updated 7 months ago
- Official implementation of "Perturbed-Attention Guidance"☆60Jul 2, 2024Updated last year
- ☆33Aug 9, 2024Updated last year
- Official Implementation of "Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraini…☆638Oct 16, 2025Updated 4 months ago
- Minute-long video generation at 24FPS.☆50Feb 2, 2026Updated last month
- [CVPR 2025 Oral] Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models☆1,402Dec 16, 2025Updated 2 months ago
- Codebase for the paper-Elucidating the design space of language models for image generation☆46Nov 17, 2024Updated last year
- [CVPR2025] PyTorch-based reimplementation of CrossFlow, as proposed in 'Flowing from Words to Pixels: A Noise-Free Framework for Cross-Mo…☆329Jun 8, 2025Updated 8 months ago
- Elucidating the Design Space of Diffusion-Based Generative Models (EDM)☆1,915Mar 16, 2024Updated last year
- ☆28Mar 4, 2025Updated last year
- Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation☆1,936Aug 15, 2024Updated last year