youngsheen / SimVQ
SimVQ: Addressing Representation Collapse in Vector Quantized Models with One Linear Layer
☆227Updated 2 months ago
Alternatives and similar repositories for SimVQ:
Users that are interested in SimVQ are comparing it to the libraries listed below
- A Pytorch Implementation of Finite Scalar Quantization☆115Updated last year
- Scaling Diffusion Transformers with Mixture of Experts☆294Updated 6 months ago
- ☆120Updated 9 months ago
- ☆85Updated this week
- Keras implement of Finite Scalar Quantization☆71Updated last year
- A collection of resources and papers on Vector Quantized Variational Autoencoder (VQ-VAE) and its application☆259Updated last month
- Official codebase for "Schrodinger Bridges Beat Diffusion Models on Text-to-Speech Synthesis" (https://arxiv.org/abs/2312.03491).☆127Updated 8 months ago
- [ICCV 2023] Online Clustered Codebook☆162Updated 6 months ago
- Official Implementation for "Consistency Flow Matching: Defining Straight Flows with Velocity Consistency"☆199Updated 2 months ago
- This repository includes the official implementation of our paper "Beyond Next-Token: Next-X Prediction for Autoregressive Visual Generat…☆143Updated 3 weeks ago
- ☆167Updated last month
- Implementation of Autoregressive Diffusion in Pytorch☆364Updated 4 months ago
- Official Pytorch Implementation of Our CVPR2023 Paper: "Towards Accurate Image Coding: Improved Autoregressive Image Generation with Dyna…☆167Updated last year
- [NeurIPS 2024] The official code of "U-DiTs: Downsample Tokens in U-Shaped Diffusion Transformers"☆192Updated 5 months ago
- High-performance Image Tokenizers for VAR and AR☆226Updated last week
- Implementation of rectified flow and some of its followup research / improvements in Pytorch☆265Updated last month
- 🔥stable, simple, state-of-the-art VQVAE toolkit & cookbook☆87Updated 9 months ago
- [ICLR2025] Halton Scheduler for Masked Generative Image Transformer☆204Updated last month
- “FlowAR: Scale-wise Autoregressive Image Generation Meets Flow Matching” FlowAR employs a simplest scale design and is compatible with an…☆95Updated 3 months ago
- Official Repository of IJCAI 2024 Paper: "BATON: Aligning Text-to-Audio Model with Human Preference Feedback"☆23Updated 3 weeks ago
- This is a repo to track the latest autoregressive visual generation papers.☆178Updated this week
- Towards training VQ-VAE models robustly!☆62Updated 2 months ago
- [CVPR 2025] 🔥 Official impl. of "TokenFlow: Unified Image Tokenizer for Multimodal Understanding and Generation".☆292Updated 3 weeks ago
- [ICLR 2025] Rectified Diffusion: Straightness Is Not Your Need☆202Updated 2 weeks ago
- Inference-only implementation of "One-Step Diffusion Distillation through Score Implicit Matching" [NIPS 2024]☆77Updated 4 months ago
- ☆147Updated 3 months ago
- Official Pytorch Implementation of Our CVPR2023 Paper: "Not All Image Regions Matter: Masked Vector Quantization for Autoregressive Image…☆57Updated last year
- ☆275Updated 5 months ago
- Unofficial implementation of "Simplifying, Stabilizing & Scaling Continuous-Time Consistency Models" for MNIST☆41Updated this week
- [NeurIPS 2024] Stabilize the Latent Space for Image Autoregressive Modeling: A Unified Perspective☆64Updated 4 months ago