thuanz123 / enhancing-transformers
An unofficial implementation of both ViT-VQGAN and RQ-VAE in Pytorch
☆300Updated last week
Alternatives and similar repositories for enhancing-transformers:
Users that are interested in enhancing-transformers are comparing it to the libraries listed below
- Official Jax Implementation of MaskGIT☆503Updated 2 years ago
- ☆459Updated 2 years ago
- Pytorch implementation of MaskGIT: Masked Generative Image Transformer (https://arxiv.org/pdf/2202.04200.pdf)☆435Updated last year
- Official Pytorch Implementation of Our CVPR2023 Paper: "Towards Accurate Image Coding: Improved Autoregressive Image Generation with Dyna…☆172Updated last year
- [ICLR2025] Halton Scheduler for Masked Generative Image Transformer☆217Updated last week
- MoVQGAN - model for the image encoding and reconstruction☆229Updated last year
- Code for Fast Training of Diffusion Models with Masked Transformers☆398Updated 11 months ago
- [ICCV 2023] Online Clustered Codebook☆165Updated 6 months ago
- Code for the ECCV 2022 paper "Unleashing Transformers"☆183Updated 2 years ago
- Unofficial Implementation of Consistency Models in pytorch☆254Updated 2 years ago
- Masked Diffusion Transformer is the SOTA for image synthesis. (ICCV 2023)☆557Updated 11 months ago
- [ICCV 2023] Efficient Diffusion Training via Min-SNR Weighting Strategy☆244Updated 4 months ago
- Code for "Diffusion Model Alignment Using Direct Preference Optimization"☆435Updated 2 months ago
- Implementation of a single layer of the MMDiT, proposed in Stable Diffusion 3, in Pytorch☆338Updated 3 months ago
- Implementation of Recurrent Interface Network (RIN), for highly efficient generation of images and video without cascading networks, in P…☆203Updated last year
- Implementation of MagViT2 Tokenizer in Pytorch☆599Updated 3 months ago
- [CVPR'23] Video Probabilistic Diffusion Models in Projected Latent Space☆315Updated 11 months ago
- Consistency Models Made Easy☆278Updated 6 months ago
- Official PyTorch implementation of TATS: A Long Video Generation Framework with Time-Agnostic VQGAN and Time-Sensitive Transformer (ECCV …☆282Updated 11 months ago
- Code for the paper Analytic-DPM: an Analytic Estimate of the Optimal Reverse Variance in Diffusion Probabilistic Models (ICLR 2022 Outsta…☆171Updated 2 years ago
- This repo contains the code for 1D tokenizer and generator☆821Updated 3 weeks ago
- ☆91Updated 2 weeks ago
- ☆279Updated 6 months ago
- [ICLR 2025][arXiv:2406.07548] Image and Video Tokenization with Binary Spherical Quantization☆145Updated 10 months ago
- A collection of resources and papers on Vector Quantized Variational Autoencoder (VQ-VAE) and its application☆268Updated 2 months ago
- Implementation of rectified flow and some of its followup research / improvements in Pytorch☆277Updated 2 months ago
- A PyTorch implementation of MAGE: MAsked Generative Encoder to Unify Representation Learning and Image Synthesis☆557Updated 2 years ago
- AlignProp uses direct reward backpropogation for the alignment of large-scale text-to-image diffusion models. Our method is 25x more samp…☆281Updated 5 months ago
- CVPR 2022☆146Updated 9 months ago
- ☆122Updated 9 months ago