youngsheen / SimVQ
SimVQ: Addressing Representation Collapse in Vector Quantized Models with One Linear Layer
☆225Updated 2 months ago
Alternatives and similar repositories for SimVQ:
Users that are interested in SimVQ are comparing it to the libraries listed below
- A Pytorch Implementation of Finite Scalar Quantization☆113Updated last year
- Scaling Diffusion Transformers with Mixture of Experts☆291Updated 6 months ago
- Keras implement of Finite Scalar Quantization☆70Updated last year
- ☆119Updated 8 months ago
- ☆80Updated 5 months ago
- Official codebase for "Schrodinger Bridges Beat Diffusion Models on Text-to-Speech Synthesis" (https://arxiv.org/abs/2312.03491).☆127Updated 8 months ago
- [ICCV 2023] Online Clustered Codebook☆160Updated 5 months ago
- Official Repository of IJCAI 2024 Paper: "BATON: Aligning Text-to-Audio Model with Human Preference Feedback"☆22Updated last week
- Implementation of Autoregressive Diffusion in Pytorch☆360Updated 4 months ago
- ☆160Updated last month
- Official Implementation for "Consistency Flow Matching: Defining Straight Flows with Velocity Consistency"☆195Updated last month
- [ICLR2025] Halton Scheduler for Masked Generative Image Transformer☆196Updated last week
- [NeurIPS 2024] The official code of "U-DiTs: Downsample Tokens in U-Shaped Diffusion Transformers"☆188Updated 5 months ago
- Official Pytorch Implementation of Our CVPR2023 Paper: "Towards Accurate Image Coding: Improved Autoregressive Image Generation with Dyna…☆167Updated last year
- A collection of resources and papers on Vector Quantized Variational Autoencoder (VQ-VAE) and its application☆260Updated last month
- 从零手搓Flow Matching(Rectified Flow)☆301Updated 3 months ago
- “FlowAR: Scale-wise Autoregressive Image Generation Meets Flow Matching” FlowAR employs a simplest scale design and is compatible with an…☆90Updated 2 months ago
- High-performance Image Tokenizers for VAR and AR☆211Updated this week
- Towards training VQ-VAE models robustly!☆54Updated 2 months ago
- ☆124Updated last year
- ☆48Updated 5 months ago
- Inference-only implementation of "One-Step Diffusion Distillation through Score Implicit Matching" [NIPS 2024]☆78Updated 3 months ago
- ☆144Updated 2 months ago
- 🔥stable, simple, state-of-the-art VQVAE toolkit & cookbook☆85Updated 8 months ago
- Diff-Foley: Synchronized Video-to-Audio Synthesis with Latent Diffusion Models☆175Updated 9 months ago
- [Official Implementation] Acoustic Autoregressive Modeling 🔥☆64Updated 6 months ago
- The official repository of SpeechCraft dataset, a large-scale expressive bilingual speech dataset with natural language descriptions.☆110Updated 2 months ago
- Implementation of rectified flow and some of its followup research / improvements in Pytorch☆255Updated last month