cloneofsimo/vqgan-training

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/cloneofsimo/vqgan-training)

cloneofsimo / vqgan-training

Train VAE like a boss

☆313

Alternatives and similar repositories for vqgan-training

Users that are interested in vqgan-training are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

cloneofsimo / minRF
View on GitHub
Minimal implementation of scalable rectified flow transformers, based on SD3's approach
☆641Jul 1, 2024Updated 2 years ago
cloneofsimo / minDinoV2
View on GitHub
☆24Oct 15, 2024Updated last year
cloneofsimo / scaling-guide
View on GitHub
WIP
☆96Aug 13, 2024Updated last year
cloneofsimo / repa-rf
View on GitHub
☆32Nov 4, 2024Updated last year
cloneofsimo / infinite-fractal-stream
View on GitHub
☆30Oct 7, 2024Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
cloneofsimo / zeroshampoo
View on GitHub
☆33Sep 10, 2024Updated last year
cloneofsimo / minSDXL
View on GitHub
Huggingface-compatible SDXL Unet implementation that is readily hackable
☆439Aug 9, 2023Updated 2 years ago
fal-ai / diffusion-speedrun
View on GitHub
Focused on fast experimentation and simplicity
☆77Dec 24, 2024Updated last year
mingukkang / elatentlpips
View on GitHub
Author's Implementation for E-LatentLPIPS
☆182Nov 5, 2024Updated last year
SwayStar123 / SpeedrunDiT
View on GitHub
SR-DiT Speedrunning ImageNet Diffusion
☆139Apr 6, 2026Updated 3 months ago
bytedance / 1d-tokenizer
View on GitHub
This repo contains the code for 1D tokenizer and generator
☆1,167Mar 20, 2025Updated last year
cloneofsimo / min-max-in-dit
View on GitHub
☆27May 3, 2024Updated 2 years ago
NVlabs / edm2
View on GitHub
EDM2 and Autoguidance -- Official PyTorch implementation
☆847Dec 9, 2024Updated last year
Kai-46 / minFM
View on GitHub
☆175Oct 27, 2025Updated 8 months ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
cloneofsimo / minSAE
View on GitHub
☆30Dec 2, 2024Updated last year
enkeejunior1 / min-pi-flow
View on GitHub
☆56Nov 6, 2025Updated 8 months ago
sihyun-yu / REPA
View on GitHub
[ICLR'25 Oral] Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think
☆1,680Mar 16, 2025Updated last year
CompVis / attribute-control
View on GitHub
Fine-Grained Subject-Specific Attribute Expression Control in T2I Models
☆136Feb 27, 2025Updated last year
cloneofsimo / efae
View on GitHub
☆24Jun 18, 2024Updated 2 years ago
cloneofsimo / ezmup
View on GitHub
Simple implementation of muP, based on Spectral Condition for Feature Learning. The implementation is SGD only, dont use it for Adam
☆88Jul 28, 2024Updated last year
ethansmith2000 / AutoLoRADiscovery
View on GitHub
☆28Aug 1, 2024Updated last year
End2End-Diffusion / REPA-E
View on GitHub
[ICCV 2025] Official implementation of the paper: REPA-E: Unlocking VAE for End-to-End Tuning of Latent Diffusion Transformers
☆511Dec 6, 2025Updated 7 months ago
SonyResearch / micro_diffusion
View on GitHub
Official repository for our work on micro-budget training of large-scale diffusion models.
☆1,589Jan 12, 2025Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
kvfrans / jax-flow
View on GitHub
Flow-matching algorithms in JAX
☆119Aug 12, 2024Updated last year
willisma / SiT
View on GitHub
Official PyTorch Implementation of "SiT: Exploring Flow and Diffusion-based Generative Models with Scalable Interpolant Transformers"
☆1,190Dec 22, 2025Updated 7 months ago
TencentARC / SEED-Voken
View on GitHub
SEED-Voken: A Series of Powerful Visual Tokenizers
☆1,018Nov 25, 2025Updated 8 months ago
apple / ml-flextok
View on GitHub
FlexTok: Resampling Images into 1D Token Sequences of Flexible Length
☆322Jun 2, 2025Updated last year
lumalabs / imm
View on GitHub
Official implementation of Inductive Moment Matching
☆585Jul 11, 2025Updated last year
DataCTE / SDXL-Training-Improvements
View on GitHub
📊 Research-focused SDXL training framework exploring novel optimization approaches. Goals include enhanced image quality, training stabi…
☆21Jun 7, 2025Updated last year
SerChirag / rs-imle
View on GitHub
RS-IMLE
☆44Dec 7, 2024Updated last year
NVIDIA / Cosmos-Tokenizer
View on GitHub
A suite of image and video neural tokenizers
☆1,732Feb 11, 2025Updated last year
ethansmith2000 / fsdp_optimizers
View on GitHub
supporting pytorch FSDP for optimizers
☆84Dec 8, 2024Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
philippe-eecs / small-vision
View on GitHub
A big_vision inspired repo that implements a generic Auto-Encoder class capable in representation learning and generative modeling.
☆34Jun 26, 2024Updated 2 years ago
hustvl / LightningDiT
View on GitHub
[CVPR 2025 Oral] Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models
☆1,510Dec 16, 2025Updated 7 months ago
madebyollin / taesd
View on GitHub
Tiny AutoEncoder for Stable Diffusion (and other image models)
☆956Jan 23, 2026Updated 6 months ago
openai / consistencydecoder
View on GitHub
Consistency Distilled Diff VAE
☆2,213Nov 7, 2023Updated 2 years ago
zai-org / Inf-DiT
View on GitHub
Official implementation of Inf-DiT: Upsampling Any-Resolution Image with Memory-Efficient Diffusion Transformer
☆448Jul 5, 2024Updated 2 years ago
lucidrains / titok-pytorch
View on GitHub
Implementation of TiTok, proposed by Bytedance in "An Image is Worth 32 Tokens for Reconstruction and Generation"
☆184Jun 20, 2024Updated 2 years ago
Lakonik / LakonLab
View on GitHub
Official implementation of AsymFlow, pi-Flow, GMFlow
☆453Jul 14, 2026Updated last week