An unofficial implementation of both ViT-VQGAN and RQ-VAE in Pytorch
☆322Apr 7, 2025Updated 11 months ago
Alternatives and similar repositories for enhancing-transformers
Users that are interested in enhancing-transformers are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- JAX implementation ViT-VQGAN☆82Sep 21, 2022Updated 3 years ago
- The official implementation of Autoregressive Image Generation using Residual Quantization (CVPR '22)☆1,010Jan 3, 2024Updated 2 years ago
- [ICLR2025] Halton Scheduler for Masked Generative Image Transformer☆282Oct 28, 2025Updated 4 months ago
- Taming Transformers for High-Resolution Image Synthesis☆6,455Jul 30, 2024Updated last year
- Implementation of Parti, Google's pure attention-based text-to-image neural network, in Pytorch☆537Dec 8, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Pytorch implementation of VQGAN (Taming Transformers for High-Resolution Image Synthesis) (https://arxiv.org/pdf/2012.09841.pdf)☆546Jul 17, 2024Updated last year
- Vector (and Scalar) Quantization, in Pytorch☆3,872Feb 12, 2026Updated last month
- Code for the ECCV 2022 paper "Unleashing Transformers"☆185Apr 17, 2023Updated 2 years ago
- Pytorch implementation of MaskGIT: Masked Generative Image Transformer (https://arxiv.org/pdf/2202.04200.pdf)☆471Sep 3, 2023Updated 2 years ago
- Official Pytorch Implementation of Our CVPR2023 Paper: "Towards Accurate Image Coding: Improved Autoregressive Image Generation with Dyna…☆191Jul 23, 2023Updated 2 years ago
- Official Jax Implementation of MaskGIT☆559Nov 18, 2022Updated 3 years ago
- ☆486Jun 30, 2022Updated 3 years ago
- [ICCV 2023] Online Clustered Codebook☆184Sep 19, 2024Updated last year
- Locally Hierarchical Auto-Regressive Modeling for Image Generation (HQ-Transformer)☆29Feb 14, 2024Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- SEED-Voken: A Series of Powerful Visual Tokenizers☆999Nov 25, 2025Updated 4 months ago
- Fast and controllable text-to-image model.☆41Jun 16, 2023Updated 2 years ago
- ☆143Jun 28, 2024Updated last year
- High-performance Image Tokenizers for VAR and AR☆303Apr 25, 2025Updated 11 months ago
- MoVQGAN - model for the image encoding and reconstruction☆264Oct 31, 2023Updated 2 years ago
- Official implementation of VQ-Diffusion☆978Apr 17, 2024Updated last year
- This repo contains the code for 1D tokenizer and generator☆1,134Mar 20, 2025Updated last year
- A PyTorch implementation of MAGE: MAsked Generative Encoder to Unify Representation Learning and Image Synthesis☆576Mar 10, 2023Updated 3 years ago
- PyTorch implementation of MAR+DiffLoss https://arxiv.org/abs/2406.11838☆1,883Feb 20, 2026Updated last month
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Open reproduction of MUSE for fast text2image generation.☆359Jun 1, 2024Updated last year
- OCR-VQGAN, a discrete image encoder (tokenizer and detokenizer) for figure images in Paper2Fig100k dataset. Implementation of OCR Percept…☆83Jan 30, 2023Updated 3 years ago
- [ICCV 2025] Official repo for "GigaTok: Scaling Visual Tokenizers to 3 Billion Parameters for Autoregressive Image Generation"☆202Jan 7, 2026Updated 2 months ago
- Official Implementation of Paella https://arxiv.org/abs/2211.07292v2☆748Oct 4, 2023Updated 2 years ago
- SwiftBrush: One-Step Text-to-Image Diffusion Model with Variational Score Distillation (CVPR 2024)☆72Aug 6, 2025Updated 7 months ago
- ☆239Jul 24, 2023Updated 2 years ago
- Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation☆1,940Aug 15, 2024Updated last year
- A MaskGIT port from JAX to PyTorch☆18Jun 18, 2022Updated 3 years ago
- [ICML 2025 Tokshop] One-D-Piece: Image Tokenizer Meets Quality-Controllable Compression☆77Jul 30, 2025Updated 7 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Official implementation of SEED-LLaMA (ICLR 2024).☆642Sep 21, 2024Updated last year
- ☆145Feb 27, 2024Updated 2 years ago
- Implementation of Generating Diverse High-Fidelity Images with VQ-VAE-2 in PyTorch☆1,797Feb 15, 2023Updated 3 years ago
- Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"☆8,450May 31, 2024Updated last year
- FlexTok: Resampling Images into 1D Token Sequences of Flexible Length☆308Jun 2, 2025Updated 9 months ago
- LaVIT: Empower the Large Language Model to Understand and Generate Visual Content☆605Oct 6, 2024Updated last year
- Official implementation of Diffusion Autoencoders☆959Sep 12, 2024Updated last year