thuanz123/enhancing-transformers

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/thuanz123/enhancing-transformers)

thuanz123 / enhancing-transformers

An unofficial implementation of both ViT-VQGAN and RQ-VAE in Pytorch

☆325

Alternatives and similar repositories for enhancing-transformers

Users that are interested in enhancing-transformers are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

patil-suraj / vit-vqgan
View on GitHub
JAX implementation ViT-VQGAN
☆82Sep 21, 2022Updated 3 years ago
kakaobrain / rq-vae-transformer
View on GitHub
The official implementation of Autoregressive Image Generation using Residual Quantization (CVPR '22)
☆1,028Jan 3, 2024Updated 2 years ago
valeoai / Halton-MaskGIT
View on GitHub
[ICLR2025] Halton Scheduler for Masked Generative Image Transformer
☆286Oct 28, 2025Updated 8 months ago
lucidrains / parti-pytorch
View on GitHub
Implementation of Parti, Google's pure attention-based text-to-image neural network, in Pytorch
☆537Dec 8, 2023Updated 2 years ago
CompVis / taming-transformers
View on GitHub
Taming Transformers for High-Resolution Image Synthesis
☆6,520Jul 30, 2024Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
lucidrains / vector-quantize-pytorch
View on GitHub
Vector (and Scalar) Quantization, in Pytorch
☆3,987Updated this week
dome272 / VQGAN-pytorch
View on GitHub
Pytorch implementation of VQGAN (Taming Transformers for High-Resolution Image Synthesis) (https://arxiv.org/pdf/2012.09841.pdf)
☆552Jul 17, 2024Updated 2 years ago
dome272 / MaskGIT-pytorch
View on GitHub
Pytorch implementation of MaskGIT: Masked Generative Image Transformer (https://arxiv.org/pdf/2202.04200.pdf)
☆473Sep 3, 2023Updated 2 years ago
samb-t / unleashing-transformers
View on GitHub
Code for the ECCV 2022 paper "Unleashing Transformers"
☆186Apr 17, 2023Updated 3 years ago
google-research / maskgit
View on GitHub
Official Jax Implementation of MaskGIT
☆562Nov 18, 2022Updated 3 years ago
CrossmodalGroup / DynamicVectorQuantization
View on GitHub
Official Pytorch Implementation of Our CVPR2023 Paper: "Towards Accurate Image Coding: Improved Autoregressive Image Generation with Dyna…
☆194Jul 23, 2023Updated 3 years ago
cientgu / VQ-Diffusion
View on GitHub
☆487Jun 30, 2022Updated 4 years ago
lyndonzheng / CVQ-VAE
View on GitHub
[ICCV 2023] Online Clustered Codebook
☆189Sep 19, 2024Updated last year
Qiyuan-Ge / PaintMind
View on GitHub
Fast and controllable text-to-image model.
☆41Jun 16, 2023Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
TencentARC / SEED-Voken
View on GitHub
SEED-Voken: A Series of Powerful Visual Tokenizers
☆1,018Nov 25, 2025Updated 8 months ago
kakaobrain / hqtransformer
View on GitHub
Locally Hierarchical Auto-Regressive Modeling for Image Generation (HQ-Transformer)
☆29Feb 14, 2024Updated 2 years ago
ai-forever / MoVQGAN
View on GitHub
MoVQGAN - model for the image encoding and reconstruction
☆266Oct 31, 2023Updated 2 years ago
microsoft / VQ-Diffusion
View on GitHub
Official implementation of VQ-Diffusion
☆981Apr 17, 2024Updated 2 years ago
lxa9867 / ImageFolder
View on GitHub
High-performance Image Tokenizers for VAR and AR
☆307Apr 25, 2025Updated last year
zh460045050 / VQGAN-LC
View on GitHub
☆145Jun 28, 2024Updated 2 years ago
bytedance / 1d-tokenizer
View on GitHub
This repo contains the code for 1D tokenizer and generator
☆1,167Mar 20, 2025Updated last year
LTH14 / mage
View on GitHub
A PyTorch implementation of MAGE: MAsked Generative Encoder to Unify Representation Learning and Image Synthesis
☆582Mar 10, 2023Updated 3 years ago
huggingface / open-muse
View on GitHub
Open reproduction of MUSE for fast text2image generation.
☆358Jun 1, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
LTH14 / mar
View on GitHub
PyTorch implementation of MAR+DiffLoss https://arxiv.org/abs/2406.11838
☆1,943Feb 20, 2026Updated 5 months ago
lucidrains / muse-maskgit-pytorch
View on GitHub
Implementation of Muse: Text-to-Image Generation via Masked Generative Transformers, in Pytorch
☆918Feb 29, 2024Updated 2 years ago
lucidrains / magvit2-pytorch
View on GitHub
Implementation of MagViT2 Tokenizer in Pytorch
☆668Jan 12, 2025Updated last year
joanrod / ocr-vqgan
View on GitHub
OCR-VQGAN, a discrete image encoder (tokenizer and detokenizer) for figure images in Paper2Fig100k dataset. Implementation of OCR Percept…
☆84Jan 30, 2023Updated 3 years ago
SilentView / GigaTok
View on GitHub
[ICCV 2025] Official repo for "GigaTok: Scaling Visual Tokenizers to 3 Billion Parameters for Autoregressive Image Generation"
☆204Jan 7, 2026Updated 6 months ago
dome272 / Paella
View on GitHub
Official Implementation of Paella https://arxiv.org/abs/2211.07292v2
☆748Oct 4, 2023Updated 2 years ago
VinAIResearch / SwiftBrush
View on GitHub
SwiftBrush: One-Step Text-to-Image Diffusion Model with Variational Score Distillation (CVPR 2024)
☆72Jun 24, 2026Updated last month
buxiangzhiren / Asymmetric_VQGAN
View on GitHub
☆241Jul 24, 2023Updated 3 years ago
FoundationVision / LlamaGen
View on GitHub
Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation
☆1,960Aug 15, 2024Updated last year
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
hmorimitsu / maskgit-torch
View on GitHub
A MaskGIT port from JAX to PyTorch
☆18Jun 18, 2022Updated 4 years ago
turingmotors / One-D-Piece
View on GitHub
[ICML 2025 Tokshop] One-D-Piece: Image Tokenizer Meets Quality-Controllable Compression
☆83Jul 30, 2025Updated 11 months ago
sony / sqvae
View on GitHub
Pytorch implementation of stochastically quantized variational autoencoder (SQ-VAE)
☆196Jul 20, 2022Updated 4 years ago
rosinality / vq-vae-2-pytorch
View on GitHub
Implementation of Generating Diverse High-Fidelity Images with VQ-VAE-2 in PyTorch
☆1,802Feb 15, 2023Updated 3 years ago
AILab-CVC / SEED
View on GitHub
Official implementation of SEED-LLaMA (ICLR 2024).
☆642Sep 21, 2024Updated last year
songweige / TATS
View on GitHub
Official PyTorch implementation of TATS: A Long Video Generation Framework with Time-Agnostic VQGAN and Time-Sensitive Transformer (ECCV …
☆288May 1, 2024Updated 2 years ago
facebookresearch / DiT
View on GitHub
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
☆8,689May 31, 2024Updated 2 years ago