zbr17 / OptVQ
Towards training VQ-VAE models robustly!
☆42Updated this week
Alternatives and similar repositories for OptVQ:
Users that are interested in OptVQ are comparing it to the libraries listed below
- ☆121Updated 3 weeks ago
- “FlowAR: Scale-wise Autoregressive Image Generation Meets Flow Matching” FlowAR employs a simplest scale design and is compatible with an…☆67Updated 2 weeks ago
- ☆43Updated last week
- [NeurIPS 2024] Stabilize the Latent Space for Image Autoregressive Modeling: A Unified Perspective☆55Updated 2 months ago
- This is the official implementation for ControlVAR.☆84Updated last month
- ☆42Updated last week
- Implementation of the paper "MaskBit: Embedding-free Image Generation from Bit Tokens"☆36Updated 3 weeks ago
- This is a PyTorch-based reimplementation of CrossFlow, as proposed in 'Flowing from Words to Pixels: A Framework for Cross-Modality Evolu…☆118Updated last week
- [arXiv'25] Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models☆185Updated this week
- Implementation of Accelerating Auto-regressive Text-to-Image Generation with Training-free Speculative Jacobi Decoding☆26Updated 2 months ago
- CoDe: Collaborative Decoding Makes Visual Auto-Regressive Modeling Efficient☆75Updated last month
- Inference-only implementation of "One-Step Diffusion Distillation through Score Implicit Matching" [NIPS 2024]☆75Updated last month
- ☆66Updated last month
- Official PyTorch Implementation of "FlowTurbo: Towards Real-time Flow-Based Image Generation with Velocity Refiner"☆62Updated 3 months ago
- The official implementation of PAR: Parallelized Autoregressive Visual Generation. https://epiphqny.github.io/PAR-project/☆103Updated last week
- Open implementation of "RandAR"☆46Updated last week
- Official PyTorch Implementation of "Scalable Autoregressive Image Generation with Mamba"☆118Updated 4 months ago
- The official implementation for "MonoFormer: One Transformer for Both Diffusion and Autoregression"☆80Updated 2 months ago
- "SlimFlow: Training Smaller One-Step Diffusion Models with Rectified Flow", Yuanzhi Zhu, Xingchao Liu, Qiang Liu☆43Updated last month
- Codes accompanying the paper "Toward Guidance-Free AR Visual Generation via Condition Contrastive Alignment"☆22Updated last month
- [NeurIPS 24] Alleviating Distortion in Image Generation via Multi-Resolution Diffusion Models☆36Updated 3 months ago
- XQ-GAN🚀: An Open-source Image Tokenization Framework for Autoregressive Generation☆173Updated last month
- Diffusion Powers Video Tokenizer for Comprehension and Generation☆38Updated last month
- STAR: Scale-wise Text-to-image generation via Auto-Regressive representations☆131Updated 6 months ago
- Codebase for the paper-Elucidating the design space of language models for image generation☆45Updated last month
- Video Generation, Physical Commonsense, Semantic Adherence, VideoCon-Physics☆62Updated 2 months ago
- Liquid: Language Models are Scalable Multi-modal Generators☆57Updated 3 weeks ago
- VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video Generation☆77Updated this week
- Official PyTorch implementation - Video Motion Transfer with Diffusion Transformers☆32Updated last month
- [ NeurIPS 2024 D&B Track ] Implementation for "FiVA: Fine-grained Visual Attribute Dataset for Text-to-Image Diffusion Models"☆66Updated 2 weeks ago