Shubhamai / pytorch-vqganView external linksLinks
This repo contains the implementation of VQGAN, Taming Transformers for High-Resolution Image Synthesis in PyTorch from scratch. I have added support for custom datasets, testings, experiment tracking etc.
☆40Aug 20, 2024Updated last year
Alternatives and similar repositories for pytorch-vqgan
Users that are interested in pytorch-vqgan are comparing it to the libraries listed below
Sorting:
- Pytorch implementation of VQGAN (Taming Transformers for High-Resolution Image Synthesis) (https://arxiv.org/pdf/2012.09841.pdf)☆544Jul 17, 2024Updated last year
- The Transformer in PyTorch☆13Aug 7, 2024Updated last year
- ☆18Oct 19, 2024Updated last year
- [ICLR 2026 Oral] Locality-aware Parallel Decoding for Efficient Autoregressive Image Generation☆82Feb 7, 2026Updated last week
- ☆40Jun 6, 2025Updated 8 months ago
- ☆141Jun 28, 2024Updated last year
- Denoising Diffusion Probabilistic Models (DDPM)☆20Oct 11, 2022Updated 3 years ago
- An unofficial implementation of both ViT-VQGAN and RQ-VAE in Pytorch☆321Apr 7, 2025Updated 10 months ago
- Locally Hierarchical Auto-Regressive Modeling for Image Generation (HQ-Transformer)☆28Feb 14, 2024Updated 2 years ago
- Implementation of Muse: Text-to-Image Generation via Masked Generative Transformers, in Pytorch☆33Dec 15, 2023Updated 2 years ago
- Codebase for the paper-Elucidating the design space of language models for image generation☆46Nov 17, 2024Updated last year
- A Pytorch Implementation of Finite Scalar Quantization☆176Nov 29, 2023Updated 2 years ago
- Accuracy 77%. Large batch deep learning optimizer LARS for ImageNet with PyTorch and ResNet, using Horovod for distribution. Optional acc…☆38Jun 1, 2021Updated 4 years ago
- A protein language model for learning the SARS-CoV-2 fitness landscape☆12Apr 22, 2025Updated 9 months ago
- Official Repository of RefChartQA: Grounding Visual Answer on Chart Images through Instruction Tuning☆14Jul 9, 2025Updated 7 months ago
- Official PyTorch implementation of The Linear Attention Resurrection in Vision Transformer☆15Sep 7, 2024Updated last year
- Anki add-on that adds Pinyin and Zhuyin readings above Chinese characters in any field.☆12Sep 23, 2025Updated 4 months ago
- 用于两两比较图像质 量主观评价django-web项目☆10Sep 24, 2022Updated 3 years ago
- TensorRT In Docker☆11Dec 7, 2024Updated last year
- Deep Generative Models: Diffusion Models for Molecule Generation☆10Jun 17, 2024Updated last year
- The code for AAAI 2025 “Large Language Models Are Read/Write Policy-Makers for Simultaneous Generation”☆15Jan 3, 2025Updated last year
- Data Programming for Text Detection in Documents using SPEAR☆12Mar 26, 2025Updated 10 months ago
- Linear Attention for Efficient Bidirectional Sequence Modeling☆15May 13, 2025Updated 9 months ago
- Implementation of the DeepHit model for survival analysis with competing risks using PyTorch☆14Sep 20, 2024Updated last year
- Implementation of various handwritten text line segmentation☆10Jan 6, 2020Updated 6 years ago
- Cloud Computing for Science and Engineering web site☆14Dec 7, 2017Updated 8 years ago
- Image Tokenizer Needs Post-Training☆24Oct 4, 2025Updated 4 months ago
- Implementation for "Text2Control3D: Controllable 3D Avatar Generation in Neural Radiance Fields using Geometry-Guided Text-to-Image Diffu…☆13Sep 8, 2023Updated 2 years ago
- This is the official repository for the ICLR 2025 Conference Paper - Fast and Slow Streams for Online Time Series Forecasting without Inf…☆14Apr 30, 2025Updated 9 months ago
- Machines Learn to Infer Stellar Parameters Just by Looking at a Large Number of Spectra☆11Jan 30, 2025Updated last year
- Implementation of the DocLLM paper for Llama models.☆13Apr 6, 2025Updated 10 months ago
- MathNet: A Data-Centric Approach, Dataset and Benchmark Model to Advance Mathematical Expression Recognition☆10Mar 19, 2025Updated 10 months ago
- Workshop that will take you from Graph Neural Networks (GNNs) to Transformers, architectures which have led to numerous breakthrough achi…☆13Sep 11, 2023Updated 2 years ago
- [WACV2025] source code of StrDA: https://arxiv.org/abs/2410.09913☆12Apr 15, 2025Updated 10 months ago
- A starting point from which digital twins can be developed.☆11Apr 22, 2024Updated last year
- ☆12Dec 15, 2022Updated 3 years ago
- Python Radiative Transfer in a Bayesian Framework☆14Nov 11, 2025Updated 3 months ago
- HyperCUT: Video Sequence from a Single Blurry Image using Unsupervised Ordering (CVPR'23)☆13Nov 4, 2025Updated 3 months ago
- Official implementation for AAAI 2025 paper: SSAN: A Symbol Spatial-Aware Network for Handwritten Mathematical Expression Recognition☆15Jan 21, 2025Updated last year