kakaobrain / hqtransformerLinks
Locally Hierarchical Auto-Regressive Modeling for Image Generation (HQ-Transformer)
☆28Updated last year
Alternatives and similar repositories for hqtransformer
Users that are interested in hqtransformer are comparing it to the libraries listed below
Sorting:
- Official Pytorch Implementation of Our CVPR2023 Paper: "Not All Image Regions Matter: Masked Vector Quantization for Autoregressive Image…☆63Updated 2 years ago
- Official Pytorch Implementation of Our CVPR2023 Paper: "Towards Accurate Image Coding: Improved Autoregressive Image Generation with Dyna…☆191Updated 2 years ago
- [NeurIPS 2023] Official Implementation: "Consistent Diffusion Models"☆58Updated 2 years ago
- official code for Diff-Instruct algorithm for one-step diffusion distillation☆84Updated last year
- ☆141Updated last year
- The official PyTorch implementation of Fast Diffusion Model☆96Updated 2 years ago
- The codebase of our paper "Improving the Training of Rectified Flows", NeurIPS 2024☆127Updated last year
- [ICLR 2024] Official code for the paper 'Elucidating the Exposure Bias in Diffusion Models'☆49Updated 8 months ago
- Official Implementation (Pytorch) of "Constant Acceleration Flow", NeurIPS 2024☆35Updated 3 months ago
- Minimal multi-gpu implementation of EDM2: "Analyzing and Improving the Training Dynamics of Diffusion Models"☆40Updated last year
- PyTorch Implementation of "Your ViT is Secretly a Hybrid Discriminative-Generative Diffusion Model"☆49Updated 3 years ago
- CVPR 2022☆152Updated last year
- Official PyTorch implementation for the paper Minimizing Trajectory Curvature of ODE-based Generative Models, ICML 2023☆91Updated 11 months ago
- Implementation of "Analyzing and Improving the Training Dynamics of Diffusion Models"☆98Updated last year
- [ICCV 2023] Online Clustered Codebook☆181Updated last year
- Official repo for Discriminator Guidance.☆168Updated last year
- MaskFlow: Discrete Flows For Flexible and Efficient Long Video Generation☆27Updated 11 months ago
- An official pytorch implementation of AAAI 2024 paper "Latent Space Editing in Transformer-based Flow Matching"☆51Updated last year
- Implementation of the paper "MaskBit: Embedding-free Image Generation from Bit Tokens"☆88Updated 9 months ago
- Official implementation of the paper "Uncovering the Disentanglement Capability in Text-to-Image Diffusion Models☆175Updated 2 years ago
- A big_vision inspired repo that implements a generic Auto-Encoder class capable in representation learning and generative modeling.☆34Updated last year
- [ICLR 2025][arXiv:2406.07548] Image and Video Tokenization with Binary Spherical Quantization☆197Updated last month
- Implementation of Binary Latent Diffusion☆51Updated 2 years ago
- Code for the Paper "Improving Diffusion Model Efficiency Through Patching"☆114Updated last year
- ☆19Updated last year
- Code for the paper "Hyperbolic Image-Text Representations", Desai et al, ICML 2023☆196Updated 2 years ago
- The official implementation of MAGVLT: Masked Generative Vision-and-Language Transformer (CVPR'23)☆28Updated 2 years ago
- Implementation of RQ Transformer, proposed in the paper "Autoregressive Image Generation using Residual Quantization"☆124Updated 3 years ago
- Transformer-Mamba Diffusion Models☆119Updated last year
- [ICLR2026] AliTok: Towards Sequence Modeling Alignment between Tokenizer and Autoregressive Model☆53Updated 3 months ago