zbr17 / OptVQ
Towards training VQ-VAE models robustly!
☆68Updated 3 months ago
Alternatives and similar repositories for OptVQ:
Users that are interested in OptVQ are comparing it to the libraries listed below
- TokenBridge: Bridging Continuous and Discrete Tokens for Autoregressive Visual Generation. https://yuqingwang1029.github.io/TokenBridge☆89Updated last week
- “FlowAR: Scale-wise Autoregressive Image Generation Meets Flow Matching” FlowAR employs a simplest scale design and is compatible with an…☆104Updated 3 months ago
- Inference-only implementation of "One-Step Diffusion Distillation through Score Implicit Matching" [NIPS 2024]☆79Updated 4 months ago
- Official PyTorch Implementation of "Diffusion Autoencoders are Scalable Image Tokenizers"☆110Updated 2 months ago
- This repository includes the official implementation of our paper "Beyond Next-Token: Next-X Prediction for Autoregressive Visual Generat…☆172Updated last month
- ☆176Updated 2 months ago
- [NeurIPS 2024] Stabilize the Latent Space for Image Autoregressive Modeling: A Unified Perspective☆66Updated 5 months ago
- Implementation of the paper "MaskBit: Embedding-free Image Generation from Bit Tokens"☆61Updated this week
- ☆156Updated 3 months ago
- Official Implementation for Diffusion Models Without Classifier-free Guidance☆110Updated last month
- EQ-VAE: Equivariance Regularized Latent Space for Improved Generative Image Modeling.☆90Updated last month
- ☆30Updated last month
- Gaussian Mixture Flow Matching Models (GMFlow)☆46Updated last week
- This is the official implementation for ControlVAR.☆101Updated 4 months ago
- The official implementation of OmniFlow: Any-to-Any Generation with Multi-Modal Rectified Flows☆56Updated last month
- DDT: Decoupled Diffusion Transformer☆97Updated this week
- PyTorch implementation of DiffMoE, TC-DiT, EC-DiT and Dense DiT☆69Updated 3 weeks ago
- HermesFlow: Seamlessly Closing the Gap in Multimodal Understanding and Generation☆54Updated last month
- [NeurIPS 24] Alleviating Distortion in Image Generation via Multi-Resolution Diffusion Models☆37Updated 6 months ago
- [CVPR2025 Highlight] PAR: Parallelized Autoregressive Visual Generation. https://yuqingwang1029.github.io/PAR-project☆146Updated 3 weeks ago
- Autoregressive Image Generation with Randomized Parallel Decoding☆42Updated 2 weeks ago
- [CVPR 2025] DiG: Scalable and Efficient Diffusion Models with Gated Linear Attention☆161Updated last month
- "SlimFlow: Training Smaller One-Step Diffusion Models with Rectified Flow", Yuanzhi Zhu, Xingchao Liu, Qiang Liu☆48Updated 4 months ago
- ☆121Updated 9 months ago
- [NeurIPS 2024] Official PyTorch Implementation of "FlowTurbo: Towards Real-time Flow-Based Image Generation with Velocity Refiner"☆66Updated 6 months ago
- The official implementation of "Neighboring Autoregressive Modeling for Efficient Visual Generation"☆47Updated last week
- ☆45Updated last month
- Diffusion Powers Video Tokenizer for Comprehension and Generation (CVPR 2025)☆66Updated last month
- [CVPR 2025 (Oral)] Open implementation of "RandAR"☆99Updated 3 weeks ago
- ☆70Updated 4 months ago