An unofficial implementation of both ViT-VQGAN and RQ-VAE in Pytorch
☆324Apr 7, 2025Updated last year
Alternatives and similar repositories for enhancing-transformers
Users that are interested in enhancing-transformers are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- JAX implementation ViT-VQGAN☆82Sep 21, 2022Updated 3 years ago
- The official implementation of Autoregressive Image Generation using Residual Quantization (CVPR '22)☆1,011Jan 3, 2024Updated 2 years ago
- [ICLR2025] Halton Scheduler for Masked Generative Image Transformer☆284Oct 28, 2025Updated 5 months ago
- Taming Transformers for High-Resolution Image Synthesis☆6,472Jul 30, 2024Updated last year
- Implementation of Parti, Google's pure attention-based text-to-image neural network, in Pytorch☆537Dec 8, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Pytorch implementation of VQGAN (Taming Transformers for High-Resolution Image Synthesis) (https://arxiv.org/pdf/2012.09841.pdf)☆546Jul 17, 2024Updated last year
- Vector (and Scalar) Quantization, in Pytorch☆3,896Mar 30, 2026Updated 2 weeks ago
- Code for the ECCV 2022 paper "Unleashing Transformers"☆185Apr 17, 2023Updated 2 years ago
- Pytorch implementation of MaskGIT: Masked Generative Image Transformer (https://arxiv.org/pdf/2202.04200.pdf)☆472Sep 3, 2023Updated 2 years ago
- Official Pytorch Implementation of Our CVPR2023 Paper: "Towards Accurate Image Coding: Improved Autoregressive Image Generation with Dyna…☆192Jul 23, 2023Updated 2 years ago
- Official Jax Implementation of MaskGIT☆558Nov 18, 2022Updated 3 years ago
- ☆486Jun 30, 2022Updated 3 years ago
- [ICCV 2023] Online Clustered Codebook☆185Sep 19, 2024Updated last year
- Locally Hierarchical Auto-Regressive Modeling for Image Generation (HQ-Transformer)☆29Feb 14, 2024Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- SEED-Voken: A Series of Powerful Visual Tokenizers☆1,002Nov 25, 2025Updated 4 months ago
- Fast and controllable text-to-image model.☆41Jun 16, 2023Updated 2 years ago
- ☆144Jun 28, 2024Updated last year
- High-performance Image Tokenizers for VAR and AR☆305Apr 25, 2025Updated 11 months ago
- MoVQGAN - model for the image encoding and reconstruction☆264Oct 31, 2023Updated 2 years ago
- Official implementation of VQ-Diffusion☆978Apr 17, 2024Updated last year
- This repo contains the code for 1D tokenizer and generator☆1,140Mar 20, 2025Updated last year
- A PyTorch implementation of MAGE: MAsked Generative Encoder to Unify Representation Learning and Image Synthesis☆579Mar 10, 2023Updated 3 years ago
- Open reproduction of MUSE for fast text2image generation.☆359Jun 1, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- PyTorch implementation of MAR+DiffLoss https://arxiv.org/abs/2406.11838☆1,898Feb 20, 2026Updated last month
- Implementation of MagViT2 Tokenizer in Pytorch☆660Jan 12, 2025Updated last year
- Implementation of Muse: Text-to-Image Generation via Masked Generative Transformers, in Pytorch☆919Feb 29, 2024Updated 2 years ago
- OCR-VQGAN, a discrete image encoder (tokenizer and detokenizer) for figure images in Paper2Fig100k dataset. Implementation of OCR Percept…☆83Jan 30, 2023Updated 3 years ago
- [ICCV 2025] Official repo for "GigaTok: Scaling Visual Tokenizers to 3 Billion Parameters for Autoregressive Image Generation"☆202Jan 7, 2026Updated 3 months ago
- Official Implementation of Paella https://arxiv.org/abs/2211.07292v2☆749Oct 4, 2023Updated 2 years ago
- SwiftBrush: One-Step Text-to-Image Diffusion Model with Variational Score Distillation (CVPR 2024)☆72Aug 6, 2025Updated 8 months ago
- ☆239Jul 24, 2023Updated 2 years ago
- Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation☆1,941Aug 15, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- A MaskGIT port from JAX to PyTorch☆18Jun 18, 2022Updated 3 years ago
- [ICML 2025 Tokshop] One-D-Piece: Image Tokenizer Meets Quality-Controllable Compression☆78Jul 30, 2025Updated 8 months ago
- Official implementation of SEED-LLaMA (ICLR 2024).☆641Sep 21, 2024Updated last year
- ☆145Feb 27, 2024Updated 2 years ago
- Implementation of Generating Diverse High-Fidelity Images with VQ-VAE-2 in PyTorch☆1,798Feb 15, 2023Updated 3 years ago
- FlexTok: Resampling Images into 1D Token Sequences of Flexible Length☆308Jun 2, 2025Updated 10 months ago
- Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"☆8,479May 31, 2024Updated last year