youngsheen / SimVQ
SimVQ: Addressing Representation Collapse in Vector Quantized Models with One Linear Layer
☆250Updated 4 months ago
Alternatives and similar repositories for SimVQ:
Users that are interested in SimVQ are comparing it to the libraries listed below
- A Pytorch Implementation of Finite Scalar Quantization☆128Updated last year
- Scaling Diffusion Transformers with Mixture of Experts☆314Updated 7 months ago
- ☆122Updated 10 months ago
- [ICCV 2023] Online Clustered Codebook☆171Updated 7 months ago
- ☆98Updated last month
- Keras implement of Finite Scalar Quantization☆71Updated last year
- TokenBridge: Bridging Continuous and Discrete Tokens for Autoregressive Visual Generation. https://yuqingwang1029.github.io/TokenBridge☆106Updated this week
- Official codebase for "Schrodinger Bridges Beat Diffusion Models on Text-to-Speech Synthesis" (https://arxiv.org/abs/2312.03491).☆127Updated 9 months ago
- Official Pytorch Implementation of Our CVPR2023 Paper: "Towards Accurate Image Coding: Improved Autoregressive Image Generation with Dyna…☆175Updated last year
- Official Implementation for "Consistency Flow Matching: Defining Straight Flows with Velocity Consistency"☆210Updated 3 months ago
- A collection of resources and papers on Vector Quantized Variational Autoencoder (VQ-VAE) and its application☆277Updated 3 months ago
- ☆180Updated 2 months ago
- Implementation of Autoregressive Diffusion in Pytorch☆376Updated 6 months ago
- This repository includes the official implementation of our paper "Beyond Next-Token: Next-X Prediction for Autoregressive Visual Generat…☆199Updated last week
- High-performance Image Tokenizers for VAR and AR☆255Updated last week
- [ICLR2025] Halton Scheduler for Masked Generative Image Transformer☆226Updated 3 weeks ago
- Implementation of rectified flow and some of its followup research / improvements in Pytorch☆285Updated last week
- [NeurIPS 2024] The official code of "U-DiTs: Downsample Tokens in U-Shaped Diffusion Transformers"☆202Updated 7 months ago
- Official implementation of the paper: REPA-E: Unlocking VAE for End-to-End Tuning of Latent Diffusion Transformers☆190Updated 3 weeks ago
- Unofficial implementation of "Simplifying, Stabilizing & Scaling Continuous-Time Consistency Models" for MNIST☆56Updated last month
- 🔥stable, simple, state-of-the-art VQVAE toolkit & cookbook☆92Updated 10 months ago
- [ICLR 2025][arXiv:2406.07548] Image and Video Tokenization with Binary Spherical Quantization☆151Updated 10 months ago
- ☆127Updated last year
- ☆283Updated 6 months ago
- Towards training VQ-VAE models robustly!☆72Updated 3 months ago
- ☆159Updated 4 months ago
- Implementation of MagViT2 Tokenizer in Pytorch☆601Updated 3 months ago
- MoVQGAN - model for the image encoding and reconstruction☆233Updated last year
- SEED-Voken: A Series of Powerful Visual Tokenizers☆872Updated 2 months ago
- “FlowAR: Scale-wise Autoregressive Image Generation Meets Flow Matching” FlowAR employs a simplest scale design and is compatible with an…☆107Updated this week