zh460045050 / VQGAN-LC
☆113Updated 7 months ago
Alternatives and similar repositories for VQGAN-LC:
Users that are interested in VQGAN-LC are comparing it to the libraries listed below
- ☆138Updated last month
- Official Pytorch Implementation of Our CVPR2023 Paper: "Towards Accurate Image Coding: Improved Autoregressive Image Generation with Dyna…☆167Updated last year
- “FlowAR: Scale-wise Autoregressive Image Generation Meets Flow Matching” FlowAR employs a simplest scale design and is compatible with an…☆86Updated last month
- [NeurIPS 2024] Stabilize the Latent Space for Image Autoregressive Modeling: A Unified Perspective☆59Updated 3 months ago
- ☆123Updated this week
- Implementation of the paper "MaskBit: Embedding-free Image Generation from Bit Tokens"☆46Updated 2 weeks ago
- Scaling Diffusion Transformers with Mixture of Experts☆252Updated 5 months ago
- [ICLR 2025][arXiv:2406.07548] Image and Video Tokenization with Binary Spherical Quantization☆126Updated 8 months ago
- official code for Diff-Instruct algorithm for one-step diffusion distillation☆68Updated last month
- Inference-only implementation of "One-Step Diffusion Distillation through Score Implicit Matching" [NIPS 2024]☆77Updated 2 months ago
- Official PyTorch Implementation of "Scalable Autoregressive Image Generation with Mamba"☆122Updated last month
- The official PyTorch implementation for Improving Long-Text Alignment for Text-to-Image Diffusion Models (LongAlign)☆62Updated 4 months ago
- Official Pytorch Implementation of Our CVPR2023 Paper: "Not All Image Regions Matter: Masked Vector Quantization for Autoregressive Image…☆57Updated last year
- ☆71Updated 4 months ago
- [Arxiv 2024] Official code for MMTrail: A Multimodal Trailer Video Dataset with Language and Music Descriptions☆30Updated last week
- MoVQGAN - model for the image encoding and reconstruction☆218Updated last year
- STAR: Scale-wise Text-to-image generation via Auto-Regressive representations☆135Updated 7 months ago
- [ICLR25] High-performance Image Tokenizers for VAR and AR☆194Updated this week
- [NeurIPS 2024] The official code of "U-DiTs: Downsample Tokens in U-Shaped Diffusion Transformers"☆181Updated 4 months ago
- This is the official implementation for ControlVAR.☆94Updated 2 months ago
- [NeurIPS 24] Alleviating Distortion in Image Generation via Multi-Resolution Diffusion Models☆36Updated 4 months ago
- CoDe: Collaborative Decoding Makes Visual Auto-Regressive Modeling Efficient☆75Updated 3 weeks ago
- unofficial MaskGIT reproduction in PyTorch☆183Updated last year
- ☆43Updated 5 months ago
- This is a repo to track the latest autoregressive visual generation papers.☆137Updated this week
- The official implementation of OmniFlow: Any-to-Any Generation with Multi-Modal Rectified Flows☆54Updated last month
- The codebase of our paper "Improving the Training of Rectified Flows", NeurIPS 2024☆94Updated 3 months ago
- Towards training VQ-VAE models robustly!☆50Updated last month
- A big_vision inspired repo that implements a generic Auto-Encoder class capable in representation learning and generative modeling.☆34Updated 7 months ago