thuanz123 / enhancing-transformers
An unofficial implementation of both ViT-VQGAN and RQ-VAE in Pytorch
☆302Updated last month
Alternatives and similar repositories for enhancing-transformers:
Users that are interested in enhancing-transformers are comparing it to the libraries listed below
- ☆462Updated 2 years ago
- Official Jax Implementation of MaskGIT☆509Updated 2 years ago
- Pytorch implementation of MaskGIT: Masked Generative Image Transformer (https://arxiv.org/pdf/2202.04200.pdf)☆437Updated last year
- Official Pytorch Implementation of Our CVPR2023 Paper: "Towards Accurate Image Coding: Improved Autoregressive Image Generation with Dyna…☆175Updated last year
- MoVQGAN - model for the image encoding and reconstruction☆233Updated last year
- [ICLR2025] Halton Scheduler for Masked Generative Image Transformer☆226Updated 3 weeks ago
- [ICCV 2023] Online Clustered Codebook☆171Updated 7 months ago
- Code for the paper Analytic-DPM: an Analytic Estimate of the Optimal Reverse Variance in Diffusion Probabilistic Models (ICLR 2022 Outsta…☆171Updated 2 years ago
- Unofficial Implementation of Consistency Models in pytorch☆255Updated 2 years ago
- Code for the ECCV 2022 paper "Unleashing Transformers"☆184Updated 2 years ago
- Code for Fast Training of Diffusion Models with Masked Transformers☆401Updated 11 months ago
- ☆283Updated 6 months ago
- [ICCV 2023] Efficient Diffusion Training via Min-SNR Weighting Strategy☆247Updated 4 months ago
- Masked Diffusion Transformer is the SOTA for image synthesis. (ICCV 2023)☆562Updated last year
- Official PyTorch implementation of TATS: A Long Video Generation Framework with Time-Agnostic VQGAN and Time-Sensitive Transformer (ECCV …☆281Updated last year
- Implementation of Recurrent Interface Network (RIN), for highly efficient generation of images and video without cascading networks, in P…☆203Updated last year
- A collection of resources and papers on Vector Quantized Variational Autoencoder (VQ-VAE) and its application☆277Updated 3 months ago
- Consistency Models Made Easy☆277Updated 6 months ago
- Implementation of MagViT2 Tokenizer in Pytorch☆601Updated 3 months ago
- [CVPR'23] Video Probabilistic Diffusion Models in Projected Latent Space☆315Updated 11 months ago
- Official PyTorch implementation of the paper "In-Context Learning Unlocked for Diffusion Models"☆404Updated last year
- AlignProp uses direct reward backpropogation for the alignment of large-scale text-to-image diffusion models. Our method is 25x more samp…☆282Updated 6 months ago
- [ICLR 2025][arXiv:2406.07548] Image and Video Tokenization with Binary Spherical Quantization☆151Updated 10 months ago
- CVPR 2022☆146Updated 9 months ago
- [ICLR 2023]DEIS: Fast Sampling of Diffusion Models with Exponential Integrator☆157Updated 2 years ago
- Implementation of a single layer of the MMDiT, proposed in Stable Diffusion 3, in Pytorch☆351Updated 3 months ago
- An in-context conditioning version of MUSE with pre-trained checkpoints.☆111Updated last year
- The official implementation for Pseudo Numerical Methods for Diffusion Models on Manifolds (PNDM, PLMS | ICLR2022)☆343Updated 2 years ago
- Code for paper LAFITE: Towards Language-Free Training for Text-to-Image Generation (CVPR 2022)☆182Updated 2 years ago
- PyTorch code and model checkpoints for Score identity Distillation (SiD) and its adversarial version (SiDA)☆116Updated last month