elyxlz / givt-pytorchLinks
A partial implementation of Generative Infinite Vocabulary Transformer (GIVT) from Google Deepmind, in PyTorch.
☆20Updated last year
Alternatives and similar repositories for givt-pytorch
Users that are interested in givt-pytorch are comparing it to the libraries listed below
Sorting:
- [ICLR2025] Halton Scheduler for Masked Generative Image Transformer☆272Updated last week
- “FlowAR: Scale-wise Autoregressive Image Generation Meets Flow Matching” FlowAR employs a simplest scale design and is compatible with an…☆157Updated 6 months ago
- ☆281Updated 5 months ago
- Pytorch implementation for MeanFlow☆223Updated 3 months ago
- ☆139Updated last year
- ☆151Updated 7 months ago
- Pytorch implementation of MeanFlow on ImageNet and CIFAR10☆331Updated 2 months ago
- Official Pytorch Implementation of Our CVPR2023 Paper: "Not All Image Regions Matter: Masked Vector Quantization for Autoregressive Image…☆61Updated 2 years ago
- Towards training VQ-VAE models robustly!☆85Updated 3 months ago
- High-performance Image Tokenizers for VAR and AR☆293Updated 6 months ago
- A Pytorch Implementation of Finite Scalar Quantization☆164Updated last year
- [ICCV2025] TokenBridge: Bridging Continuous and Discrete Tokens for Autoregressive Visual Generation. https://yuqingwang1029.github.io/To…☆146Updated 3 months ago
- The official implementation of "[MASK] is All You Need"☆125Updated 3 months ago
- The codebase of our paper "Improving the Training of Rectified Flows", NeurIPS 2024☆124Updated last year
- [ICCV 2023] Online Clustered Codebook☆178Updated last year
- Shortcut flow matching Pytorch implementation☆65Updated 10 months ago
- This repository includes the official implementation of our paper "Beyond Next-Token: Next-X Prediction for Autoregressive Visual Generat…☆238Updated 3 weeks ago
- [NeurIPS 2024] The official code of "U-DiTs: Downsample Tokens in U-Shaped Diffusion Transformers"☆223Updated last year
- [NeurIPS 2024] Stabilize the Latent Space for Image Autoregressive Modeling: A Unified Perspective☆72Updated last year
- 🔥stable, simple, state-of-the-art VQVAE toolkit & cookbook☆102Updated last year
- [ECCV 2024] Official PyTorch implementation of RoPE-ViT "Rotary Position Embedding for Vision Transformer"☆408Updated last week
- ☆183Updated 10 months ago
- Scalable Diffusion Models with State Space Backbone☆156Updated last year
- Official PyTorch Implementation of "Diffusion Autoencoders are Scalable Image Tokenizers"☆157Updated 9 months ago
- Implementation of Autoregressive Diffusion in Pytorch☆416Updated last year
- official training and inference code of bitwise tokenizer☆51Updated 5 months ago
- [ICLR 2025][arXiv:2406.07548] Image and Video Tokenization with Binary Spherical Quantization☆178Updated last year
- Open source implementation of "Vision Transformers Need Registers"☆198Updated 2 weeks ago
- [ICML'25] EQ-VAE: Equivariance Regularized Latent Space for Improved Generative Image Modeling.☆158Updated 4 months ago
- Code for ICML 2025 Paper "Highly Compressed Tokenizer Can Generate Without Training"☆183Updated 4 months ago